nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-21 09:02:32 +00:00

Author	SHA1	Message	Date
Xubin Ren	3b82e14f85	fix(shell): preserve login PATH for path append Made-with: Cursor	2026-04-26 20:32:38 +08:00
yorkhellen	2f2ac96ac7	fix: update tests for path_append env dict change	2026-04-26 20:32:38 +08:00
yorkhellen	23dde7b84c	fix: prevent shell injection via path_append in ExecTool	2026-04-26 20:32:38 +08:00
Xubin Ren	6036355ac5	fix(message): limit session recording to proactive sends Only mark message-tool deliveries for channel-session recording while cron jobs are running, avoiding duplicate session writes during normal user turns. Made-with: Cursor	2026-04-26 20:08:21 +08:00
Xubin Ren	403ce23d22	fix(agent): tighten ask_user CLI handling Made-with: Cursor	2026-04-25 22:10:19 +08:00
Xubin Ren	cfc76ffbbf	feat(agent): add ask_user tool Made-with: Cursor	2026-04-25 22:10:19 +08:00
Xubin Ren	be05189f39	feat(channels): add video support for Telegram and WebSocket Telegram previously sent all video files as documents via send_document, so users saw a file icon instead of an inline player. WebSocket only accepted image MIME types, rejecting video uploads entirely. Telegram: - Recognize video extensions (mp4/mov/avi/mkv/webm/3gp) in _get_media_type - Route videos through send_video with supports_streaming=True - Add VIDEO/VIDEO_NOTE/ANIMATION to inbound message filters - Add video MIME mappings to _get_extension - Fix: local file sends now use _call_with_retry (previously no retry) WebSocket: - Expand upload MIME whitelist with video/mp4, video/webm, video/quicktime - Add per-type size limits (_MAX_VIDEO_BYTES=20MB, _MAX_VIDEOS_PER_MESSAGE=1) - Expand media serving endpoint to serve video with correct Content-Type Agent: - Add "video" to message tool media parameter description - Add .mp4 example to identity.md system prompt Made-with: Cursor	2026-04-25 02:20:13 +08:00
Gunnar Thielebein	8d33c1cb37	feat(telegram): add inline keyboard buttons	2026-04-23 13:26:06 +08:00
k	03ec28dd49	fix(mcp): avoid WinError 193 for Windows stdio launchers	2026-04-22 14:50:55 +09:00
aiguozhi123456	53ba410e49	feat(read_file): add DOCX, XLSX, PPTX support via document.extract_text() Wire up the existing office document extractors in document.py to ReadFileTool by adding an extension guard and _read_office_doc() method that follows the established PDF pattern. Handles missing libraries, corrupt files, empty documents, and 128K truncation consistently.	2026-04-21 22:12:19 +08:00
jr_blue_551	ff8c28d5a8	agent: use ContextVar for tool routing context	2026-04-21 13:25:30 +08:00
hussein1362	368752e707	fix(mcp): retry once on transient connection errors When an MCP server restarts or a network connection drops between tool calls, the existing session throws ClosedResourceError, BrokenPipeError, ConnectionResetError, etc. Currently these are caught as generic exceptions and returned as permanent failures to the LLM, which then tells the user 'my tools are broken.' This change adds a single automatic retry with a 1-second backoff for transient connection-class errors in MCPToolWrapper, MCPResourceWrapper, and MCPPromptWrapper. Non-transient errors (ValueError, RuntimeError, McpError, etc.) are not retried. The retry is conservative: - Only 1 retry (not configurable, to keep the change minimal) - Only for a specific set of connection-class exceptions - Matched by exception class name to avoid importing anyio/etc. - 1s sleep between attempts to allow the server to recover - Clear logging distinguishes retried vs permanent failures In production this eliminates most 'MCP tool call failed: ClosedResourceError' noise when MCP bridge processes restart (e.g. after config changes or OOM kills). Tests: 22 new tests covering retry, exhaustion, non-transient bypass, timeout bypass, and all three wrapper types.	2026-04-21 13:24:40 +08:00
chengyongru	68466b1c2a	fix(agent): propagate effective session key through subagent pipeline The previous fix hardcoded session_key_override as channel:chat_id which broke unified session mode where pending queues use "unified:default". Propagate the effective key from _set_tool_context through SpawnTool into the origin dict so _announce_result routes to the correct pending queue in both normal and unified session modes.	2026-04-20 14:47:14 +08:00
coldxiangyu	7527961b19	fix(cron): drop top-level oneOf so OpenAI Codex/Responses accept tool schema PR #3125 added a top-level `oneOf` branch to `_CRON_PARAMETERS` to advertise per-action required fields. OpenAI Codex/Responses rejects `oneOf`/`anyOf`/`allOf`/`enum`/`not` at the root of function parameters, so any agent that registers the cron tool now fails to start with: HTTP 400: Invalid schema for function 'cron': schema must have type 'object' and not have 'oneOf'/'anyOf'/'allOf'/'enum'/'not' at the top level. Remove the top-level `oneOf`. The original intent of #3125 (stop LLMs from looping on the #3113 contract mismatch) is preserved by: - `validate_params` — runtime-enforces `message` for `action='add'` and `job_id` for `action='remove'` - field descriptions — each schema field already flags "REQUIRED when action='...'" so the LLM sees the contract The regression test is updated to lock the invariant in the other direction: the top-level schema must not contain `oneOf`/`anyOf`/`allOf`/`not`, and the REQUIRED hints must stay on `message` and `job_id`. Verified: - tests/cron/ 70 passed - tests/agent/test_loop_cron_timezone.py + tests/providers/ 232 passed Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2026-04-19 21:54:38 +08:00
Xubin Ren	adc1e843b4	Merge origin/main into fix/cron-contract-repeat-guard Made-with: Cursor	2026-04-18 19:42:48 +00:00
JunghwanNA	34fccb2ee9	Prevent self-inspection from leaking configured secrets MyTool blocks direct access to sensitive nested paths, but its formatter still printed scalar fields for small config objects. That let `my(action="check", key="web_config.search")` expose `api_key` in plain text even though the docs promise sensitive sub-fields are protected. This keeps the change narrow: sensitive nested config fields are omitted from MyTool's formatted output, and regression coverage locks the behavior in. Constraint: Must preserve existing read-only inspection behavior for non-sensitive fields Constraint: Keep scope limited to MyTool rather than introducing broader redaction plumbing Rejected: Rework global context/tool redaction around MyTool \| broader than needed for the leak path Confidence: high Scope-risk: narrow Reversibility: clean Directive: If more nested config rendering is added later, filter sensitive field names at the formatter boundary as well as the path resolver Tested: PYTHONPATH=$PWD pytest -q tests/agent/tools/test_self_tool.py /Users/jh0927/Workspace/nanobot-validation-artifacts-2026-04-18/test_my_tool_secret_leak_regression.py Not-tested: Full repository test suite Related: #3259	2026-04-18 00:59:08 +08:00
Steve	39dd59f2ba	fix(cron): state per-action requirements in descriptions, keep list/remove callable The previous patch promoted `message` into top-level `required`, which solved the `add` loop but broke `list` and `remove`: `ToolRegistry.prepare_call` enforces `required` via `validate_params`, so `cron(action="list")` and `cron(action="remove", job_id=...)` — both documented in `SKILL.md` — started failing schema validation with the same "missing required message" shape that #3113 describes for `add`. Instead: - Keep `required=["action"]` so `list`/`remove` stay callable. - Prefix `message`'s description with `REQUIRED when action='add'.` and `job_id`'s with `REQUIRED when action='remove'.` so LLMs see the real per-action contract up front. - Keep the improved runtime error message from the previous commit for the case an LLM still omits `message` on `add`. Also add `tests/cron/test_cron_tool_schema_contract.py` to lock in: - `list` and `remove` pass schema validation with no `message` - `add` with `message` passes - `add` without `message` surfaces the actionable runtime error - field descriptions carry the REQUIRED hints - top-level `required` stays `["action"]` Existing `tests/cron/test_cron_tool_list.py` cases bypass schema validation by calling `_list_jobs()` / `_remove_job()` directly, which is why CI didn't catch the regression; the new test goes through `ToolRegistry.prepare_call`.	2026-04-17 22:52:48 +08:00
Your Name	19dada927a	fix: make cron tool schema require message for add action Previously the JSON schema only required "action" but the runtime rejected empty messages, causing LLM retry loops. Making "message" required in the schema prevents the mismatch, and the improved error message guides the LLM to retry with the correct parameters. Fixes #3113 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 22:52:48 +08:00
Xubin Ren	5badb75f6c	review: tighten scope and add regression tests Follow-ups from review of #3194: - ci.yml: drop unconditional --ignore=tests/channels/test_matrix_channel.py. That test file already calls pytest.importorskip("nio") at module top, so it self-skips on Windows (where nio isn't installed) without also hiding 62 tests from Linux CI. - filesystem.py: hoist `import os` to the module top and drop the duplicate inline import in ReadFileTool.execute. Document the CRLF->LF normalization as intentional (primarily a Windows UX fix so downstream StrReplace/Grep match consistently regardless of where the file was written). - test_read_enhancements.py: lock down two new behaviors * TestFileStateHashFallback: check_read warns when content changes but mtime is unchanged (coarse-mtime filesystems on Windows). * TestReadFileLineEndingNormalization: ReadFileTool strips CRLF and preserves LF-only files untouched. - test_tool_validation.py: restore list2cmdline/shlex.quote in test_exec_head_tail_truncation. The temp_path-based form was correct, but dropping the quoting broke on any Windows path containing spaces (e.g. C:\Users\John Doe\...). CI runners happen not to have spaces so this slipped through. Tests: 1993 passed locally. Made-with: Cursor	2026-04-17 16:11:37 +08:00
Jiajun Xie	3db2eb66e4	ci: add Windows and Python 3.14 support	2026-04-17 16:11:37 +08:00
Xubin Ren	90b7d940e8	refactor(config): nest MyTool settings under tools.my (with legacy-key migration)	2026-04-16 15:58:20 +00:00
chengyongru	b51da93cbb	feat(agent): add SelfTool for runtime self-inspection and configuration Add a built-in tool that lets the agent inspect and modify its own runtime state (model, iterations, context window, etc.). Key features: - inspect: view current config, usage stats, and subagent status - modify: adjust parameters at runtime (protected by type/range validation) - Subagent observability: inspect running subagent tasks (phase, iteration, tool events, errors) — subagents are no longer a black box - Watchdog corrects out-of-bounds values on each iteration - Enabled by default in read-only mode (self_modify: false) - All changes are in-memory only; restart restores defaults - Comprehensive test suite (90 tests) Includes a self-awareness skill (always-on) with progressive disclosure: SKILL.md for core rules, references/examples.md for detailed scenarios.	2026-04-16 23:44:26 +08:00
Mohamed Elkholy	1304ff78cc	perf(tools): cache ToolRegistry.get_definitions() between mutations get_definitions() sorts tools on every LLM iteration for prompt cache stability. Cache the sorted result and invalidate on register/unregister so the sort only runs when the tool set actually changes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 21:52:36 +08:00
yeyitech	ee061f0595	fix(web): serialize duckduckgo search calls	2026-04-14 14:10:06 +08:00
yeyitech	655f3d2cc5	fix: harden cron tool contract and repeat guard	2026-04-14 12:40:23 +08:00
ramonpaolo	830644c352	fix: add guard for non-dict tool call parameters - Add type validation in registry.prepare_call() to catch list/other invalid params - Add logger.warning() in provider layer when non-dict args detected - Works for OpenAI-compatible and Anthropic providers - Registry returns clear error hint for model to self-correct	2026-04-13 09:55:05 +08:00
haosenwang1018	92ef594b6a	fix(mcp): hint on stdio protocol pollution	2026-04-13 09:41:55 +08:00
Xubin Ren	5dc238c7ef	fix(shell): allow read-only copies from internal state files Keep the new exec guard focused on writes to history.jsonl and .dream_cursor while still allowing read-only copy operations out of those files. Made-with: Cursor	2026-04-12 16:38:55 +08:00
04cb	3f59bd1443	fix(shell): reject LLM-supplied working_dir outside workspace (#2826 )	2026-04-12 16:38:55 +08:00
04cb	00fb491bc9	fix(shell): block exec writes to history.jsonl and cursor files (#2989 )	2026-04-12 16:38:55 +08:00
Xubin Ren	e0b9edf985	Merge PR #3017 : feat(tool): improve file editing and add notebook tool feat(tool): improve file editing and add notebook tool	2026-04-11 18:02:25 +08:00
Xubin Ren	322142f7ad	Merge origin/main into main	2026-04-11 09:32:05 +00:00
Mike Terhar	d3aa209cf6	add kagi web search tool	2026-04-11 16:53:05 +08:00
Xubin Ren	696b64b5a6	fix(notebook): remove unused imports Clean up unused imports in notebook_edit so the Ruff F401 check passes cleanly. Made-with: Cursor	2026-04-10 16:02:00 +00:00
worenidewen	a167959027	fix(mcp): support multiple MCP servers by connecting each in isolated task Each MCP server now connects in its own asyncio.Task to isolate anyio cancel scopes and prevent 'exit cancel scope in different task' errors when multiple servers (especially mixed transport types) are configured. Changes: - connect_mcp_servers() returns dict[str, AsyncExitStack] instead of None - Each server runs in separate task via asyncio.gather() - AgentLoop uses _mcp_stacks dict to track per-server stacks - Tests updated to handle new API	2026-04-10 23:51:50 +08:00
Xubin Ren	651aeae656	improve file editing and add notebook tool Enhance file tools with read tracking, PDF support, safer path handling, smarter edit matching/diagnostics, and introduce notebook_edit with tests.	2026-04-10 15:44:50 +00:00
chenyahui	0e6331b66d	feat(exec): support allowed_env_keys to pass specified env vars to subprocess Add allowed_env_keys config field to selectively forward host environment variables (e.g. GOPATH, JAVA_HOME) into the sandboxed subprocess environment, while keeping the default allow-list unchanged.	2026-04-09 23:35:44 +08:00
chensp	bfec06a2c1	Fix Windows exec env for Docker Desktop plugin discovery nanobot's Windows exec environment was not forwarding ProgramFiles and related variables, so docker desktop start could not discover the desktop CLI plugin and reported unknown command. Forward the missing variables and add a regression test that covers the Windows env shape.	2026-04-09 10:55:53 +08:00
Xubin Ren	edb821e10d	feat(agent): prompt behavior directives, tool descriptions, and loop robustness	2026-04-08 02:22:25 +08:00
Xubin Ren	ef0284a4e0	fix(exec): add Windows support for shell command execution ExecTool hardcoded bash, breaking exec on Windows. Now uses cmd.exe via COMSPEC on Windows with a curated minimal env (PATH, SYSTEMROOT, etc.) that excludes secrets. bwrap sandbox gracefully skips on Windows.	2026-04-08 01:48:55 +08:00
Xubin Ren	8871a57b4c	fix(mcp): forward prompt arg descriptions & standardise error format - Propagate `description` from MCP prompt arguments into the JSON Schema so LLMs can better understand prompt parameters. - Align generic-exception error message with tool/resource wrappers (drop redundant `{exc}` detail). - Extend test fixture to mock `mcp.shared.exceptions.McpError`. - Add tests for argument description forwarding and McpError handling. Made-with: Cursor	2026-04-08 00:28:04 +08:00
Tim O'Brien	7cc527cf65	feat(mcp): expose MCP resources and prompts as read-only tools Add MCPResourceWrapper and MCPPromptWrapper classes that expose MCP server resources and prompts as nanobot tools. Resources are read-only tools that fetch content by URI, and prompts are read-only tools that return filled prompt templates with optional arguments. - MCPResourceWrapper: reads resource content (text and binary) via URI - MCPPromptWrapper: gets prompt templates with typed arguments - Both handle timeouts, cancellation, and MCP SDK 1.x error types - Resources and prompts are registered during server connection - Gracefully handles servers that don't support resources/prompts	2026-04-08 00:28:04 +08:00
04cb	f4904c4bdf	fix(cron): add optional name parameter to separate job label from message (#2680 )	2026-04-07 13:22:20 +08:00
Leo fu	44c7992095	fix(filesystem): correct write success message from bytes to characters len(content) counts Unicode code points, not UTF-8 bytes. For non-ASCII content such as Chinese or emoji, the reported count would be lower than the actual bytes written to disk, which is misleading to the agent.	2026-04-07 13:22:00 +08:00
Xubin Ren	424b9fc262	refactor: extract _kill_process helper to DRY timeout/cancel cleanup Made-with: Cursor	2026-04-06 13:47:09 +08:00
Lingao Meng	0e617c32cd	fix(shell): kill subprocess on CancelledError to prevent orphan processes When an agent task is cancelled (e.g. via /stop), the ExecTool was only handling TimeoutError but not CancelledError. This left the child process running as an orphan. Now CancelledError also triggers process.kill() and waitpid cleanup before re-raising.	2026-04-06 13:47:09 +08:00
Xubin Ren	7ffd93f48d	refactor: move search_usage to utils/searchusage, remove brave stub - Rename agent/tools/search_usage.py → utils/searchusage.py (not an LLM tool, matches utils/ naming convention) - Remove redundant _fetch_brave_usage — handled by else branch - Move test to tests/utils/test_searchusage.py Made-with: Cursor	2026-04-06 13:37:55 +08:00
whs	bc0ff7f214	feat(status): add web search provider usage to /status command	2026-04-06 13:37:55 +08:00
Xubin Ren	28e0a76b80	fix: path_append must not clobber login shell PATH Seeding PATH in the env before bash -l caused /etc/profile to skip its default PATH setup, breaking standard commands. Move path_append to an inline export so the login shell establishes a proper base PATH first. Add regression test: ls still works when path_append is set. Made-with: Cursor	2026-04-06 13:20:53 +08:00
Ben Lenarts	be6063a142	security: prevent exec tool from leaking process env vars to LLM The exec tool previously passed the full parent process environment to child processes, which meant LLM-generated commands could access secrets stored in env vars (e.g. API keys from EnvironmentFile=). Switch from subprocess_shell with inherited env to bash login shell with a minimal environment (HOME, LANG, TERM only). The login shell sources the user's profile for PATH setup, making the pathAppend config option a fallback rather than the primary PATH mechanism.	2026-04-06 13:20:53 +08:00

1 2 3 4 5

204 Commits