Agent-JAE

jae/Agent-JAE

Fork 0

Commit graph

Author	SHA1	Message	Date
JAE	a2227c7659	feat: browser use - element extraction + index-based clicking for text models Some checks failed CI / build-check-test (push) Has been cancelled Details - tool-server.mjs: extractElements() scrapes all interactive elements with coordinates - tool-server.mjs: formatElements() returns numbered list for LLM to read - tool-server.mjs: click/type now support {index: N} for element-based interaction - tool-server.mjs: new /api/browser/elements and /api/browser/keypress endpoints - browser-tool.ts: updated schema with index, key params and elements/keypress actions - browser-tool.ts: elementsText included in every LLM response so model can see the page - browser-tool.ts: detailed workflow instructions in tool description - Enables text-only models (Llama 3.3 etc) to navigate and interact with web pages	2026-03-27 23:17:24 +00:00
JAE	fedc60fd0f	feat: unified tool-server + Agent Zero-inspired system prompt Some checks are pending CI / build-check-test (push) Waiting to run Details - Merge 3 servers into single tool-server.mjs on port 7700 - HTTP API: POST /api/bash, /api/browser/* - WebSocket: /ws/terminal (xterm.js panel) - WebSocket: /ws/browser (live browser panel) - SHARED Playwright instance between LLM browser tool and user panel - When AI navigates a page, user sees it live in browser panel - When user clicks in panel, AI tools see the same page state - Remove standalone terminal-server.mjs (was :7701) - Remove standalone browser-server.mjs (was :7702) - Update browser-panel.ts: ws://localhost:7700/ws/browser - Update terminal-panel.ts: ws://localhost:7700/ws/terminal - Agent Zero-inspired system prompt with: - Structured problem-solving methodology (analyse/plan/execute/verify/report) - Clear tool usage rules (no tools for casual chat) - Detailed tool descriptions with usage guidance - Resourceful retry behaviour on failures - npm run dev starts both vite + unified server via concurrently	2026-03-27 04:13:17 +00:00
JAE	00e9816e57	feat: add bash/browser agent tools + Docker support Some checks are pending CI / build-check-test (push) Waiting to run Details - bash-tool.ts: execute shell commands via tool-server HTTP API - browser-tool.ts: Playwright browser automation (navigate, click, type, screenshot) - tool-server.mjs: Node.js HTTP server for bash exec + Playwright control (port 7700) - Dockerfile + docker-compose.yml for containerised deployment - Register tools in agent toolchain (main.ts, index.ts) - Add dev:all script to run Vite + tool-server concurrently	2026-03-26 23:36:29 +00:00

Author

SHA1

Message

Date

JAE

a2227c7659

feat: browser use - element extraction + index-based clicking for text models

CI / build-check-test (push) Has been cancelled

Details

- tool-server.mjs: extractElements() scrapes all interactive elements with coordinates
- tool-server.mjs: formatElements() returns numbered list for LLM to read
- tool-server.mjs: click/type now support {index: N} for element-based interaction
- tool-server.mjs: new /api/browser/elements and /api/browser/keypress endpoints
- browser-tool.ts: updated schema with index, key params and elements/keypress actions
- browser-tool.ts: elementsText included in every LLM response so model can see the page
- browser-tool.ts: detailed workflow instructions in tool description
- Enables text-only models (Llama 3.3 etc) to navigate and interact with web pages

2026-03-27 23:17:24 +00:00

JAE

fedc60fd0f

feat: unified tool-server + Agent Zero-inspired system prompt

CI / build-check-test (push) Waiting to run

Details

- Merge 3 servers into single tool-server.mjs on port 7700
  - HTTP API: POST /api/bash, /api/browser/*
  - WebSocket: /ws/terminal (xterm.js panel)
  - WebSocket: /ws/browser (live browser panel)
- SHARED Playwright instance between LLM browser tool and user panel
  - When AI navigates a page, user sees it live in browser panel
  - When user clicks in panel, AI tools see the same page state
- Remove standalone terminal-server.mjs (was :7701)
- Remove standalone browser-server.mjs (was :7702)
- Update browser-panel.ts: ws://localhost:7700/ws/browser
- Update terminal-panel.ts: ws://localhost:7700/ws/terminal
- Agent Zero-inspired system prompt with:
  - Structured problem-solving methodology (analyse/plan/execute/verify/report)
  - Clear tool usage rules (no tools for casual chat)
  - Detailed tool descriptions with usage guidance
  - Resourceful retry behaviour on failures
- npm run dev starts both vite + unified server via concurrently

2026-03-27 04:13:17 +00:00

JAE

00e9816e57

feat: add bash/browser agent tools + Docker support

CI / build-check-test (push) Waiting to run

Details

- bash-tool.ts: execute shell commands via tool-server HTTP API
- browser-tool.ts: Playwright browser automation (navigate, click, type, screenshot)
- tool-server.mjs: Node.js HTTP server for bash exec + Playwright control (port 7700)
- Dockerfile + docker-compose.yml for containerised deployment
- Register tools in agent toolchain (main.ts, index.ts)
- Add dev:all script to run Vite + tool-server concurrently

2026-03-26 23:36:29 +00:00

3 commits