nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-06 17:55:59 +00:00

Author	SHA1	Message	Date
Xubin Ren	7742f8fbdc	fix(runner): narrow workspace_violation fatal classification (#3599 , helps #3605 #3597 ) PR #3493 promoted every shell `_guard_command` rejection to a turn-fatal RuntimeError. The two heuristic outputs in that list -- `path outside working dir` and `path traversal detected` -- routinely false-positive on benign constructs (e.g. `2>/dev/null`, quoted `..` arguments to sed/find, absolute paths inside inline scripts), so legitimate workspace commands silently kill the user's turn (#3599) and the agent never gets a chance to retry with a different approach (#3605). Two changes, both narrowly scoped: - `ExecTool._guard_command` now skips a small allow-list of kernel device files (`/dev/null`, the standard streams, `/dev/random`, `/dev/fd/N`, ...) before the workspace path check, matched against the pre-resolve string so symlinks like `/dev/stderr -> /proc/self/fd/2` still hit the allow-list. Real outside writes such as `> /etc/issue` remain blocked. - `AgentRunner._WORKSPACE_BLOCK_MARKERS` keeps only the four hard path-resolution errors from filesystem.py / shell.py and the SSRF marker. The two heuristic substrings move out of the fatal list, so the LLM sees them as ordinary tool errors and can self-correct in the next iteration. SSRF stays fatal because retrying an internal URL with a different phrasing would defeat the safety boundary. Tests: - `tests/tools/test_exec_security.py`: parametrized regression for the exact #3599 command sample plus other stdio redirects and device reads; explicit negative case asserts `> /etc/issue` is still blocked. - `tests/agent/test_runner.py`: `_is_workspace_violation` no longer fatals on the two heuristic markers, plus an end-to-end case proving the runner hands the guard error back to the LLM and finalizes the next turn cleanly.	2026-05-04 01:18:39 +08:00
Xubin Ren	9a9e446f3f	fix(cron): clean persistence lint issues Keep the cron persistence hardening clean under ruff without changing behavior. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-04 00:16:39 +08:00
hussein1362	75c2506c07	fix(cron): atomic write for jobs.json + don't silently overwrite corrupt store Two related bugs that together caused scheduled jobs to disappear after a container restart: 1. `_save_store()` used `Path.write_text(...)`, which truncates the destination in place. A SIGKILL or shutdown mid-write left `jobs.json` either truncated or corrupt. 2. `_load_jobs()` caught any parse error, logged at WARNING, and returned an empty list. `start()` then called `_save_store()` immediately, overwriting the corrupt-but-recoverable file with an empty job array. Every scheduled job was silently lost with only a single warning line in the log. Reproduction in production: container restart at 18:08, after which a job that had fired correctly for two consecutive days never fired again. jobs.json on disk was missing the job entirely. Fix: - `_save_store()` now writes via temp file + `os.replace` + `fsync` (matches the session manager pattern from 512bf59, "fix(session): fsync sessions on graceful shutdown to prevent data loss"). An interrupted write cannot corrupt the live file. - `_load_jobs()` now moves a corrupt store aside as `jobs.json.corrupt-<ts>` and returns `None` instead of `[]`. - `start()` aborts with a `RuntimeError` when the on-disk store is corrupt, instead of starting empty and overwriting. - `_load_store()` falls back to the previous in-memory snapshot when a hot reload encounters a corrupt file, so a transient corruption after start does not drop live jobs. Tests cover the atomic-write path, the corrupt-file preservation, the start-time refusal, the in-memory fallback, and a basic save/load round trip across two service instances. Existing 79 cron tests and full suite (2553 tests) still pass.	2026-05-04 00:16:39 +08:00
Xubin Ren	66682eb46f	test(cli): cover retry-wait interactive routing Keep provider retry wait messages on the interactive progress path so they do not fall through as assistant responses. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-03 22:59:08 +08:00
04cb	c15d816d9c	fix(cli): intercept _retry_wait so provider retry messages don't garble interactive output (#3600 )	2026-05-03 22:59:08 +08:00
Xubin Ren	7faa339902	fix(webui): keep existing package lockfile Restore the npm lockfile that is already present on main so this PR only carries the WebUI turn-completion changes. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-03 22:28:40 +08:00
Xubin Ren	96da6d8190	fix(webui): tighten turn completion handling Keep the new turn-end signal scoped to WebSocket clients, preserve pending tool-call state across trailing tool result rows, and drop the accidental npm lockfile from the Bun-based WebUI. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-03 22:28:40 +08:00
ramonpaolo	be83525f99	test(webui): cover turn-end streaming regressions	2026-05-03 22:28:40 +08:00
ramonpaolo	08744ce408	fix(webui): isolate thread cache during chat switches	2026-05-03 22:28:40 +08:00
ramonpaolo	76e3f74df7	feat(webui): improve beta turn completion and streaming UX	2026-05-03 22:28:40 +08:00
chengyongru	5853d5dfda	fix: allow_patterns take priority over deny_patterns in ExecTool (#3594 ) * fix: allow_patterns take priority over deny_patterns in ExecTool Previously deny_patterns were checked first with no bypass, meaning allow_patterns could never exempt commands from the built-in deny list. This made it impossible to whitelist destructive commands for specific directories (e.g. build/cleanup tasks). Changes: - shell.py: check allow_patterns first; if matched, skip deny check - shell.py: deny_patterns now appends to built-in list (not replaces) - schema.py: add allow_patterns/deny_patterns to ExecToolConfig - loop.py/subagent.py: pass allow_patterns/deny_patterns to ExecTool - Add test_exec_allow_patterns.py covering priority semantics * fix: separate deny pattern errors from workspace violation detection The deny pattern error message "Command blocked by safety guard" was included in _WORKSPACE_BLOCK_MARKERS, causing deny_pattern blocks to be misclassified as fatal workspace violations. This meant LLMs had no chance to retry with a different command — the turn was aborted immediately. Changes: - shell.py: deny/allowlist error messages now use distinct phrasing ("blocked by deny pattern filter" / "blocked by allowlist filter") - runner.py: remove "blocked by safety guard" from _WORKSPACE_BLOCK_MARKERS so deny_pattern errors are treated as normal tool errors (LLM can retry) instead of fatal violations - workspace path errors still use "blocked by safety guard" and remain fatal as intended * fix: update test assertions to match new deny pattern error message * fix: indentation error in test file * fix: restore SSRF fatal classification and tidy exec pattern plumbing Address review feedback on the deny/allow_patterns rework: - runner.py: re-add "internal/private url detected" to _WORKSPACE_BLOCK_MARKERS. The earlier marker removal also stripped fatal classification from SSRF / internal-URL rejections (whose message still says "blocked by safety guard"), turning a hard security boundary into something the LLM could retry. - loop.py / subagent.py: drop `or None` between ExecToolConfig and ExecTool. The schema default is an empty list and ExecTool already normalizes None back to [], so the indirection was a no-op. - shell.py: extract `explicitly_allowed` flag in _guard_command so allow_patterns are scanned once instead of twice and the control flow no longer relies on a no-op `pass + else` branch. - tests/agent/test_runner.py: add a regression test asserting that the SSRF block message is treated as fatal, while deny/allowlist filter messages are deliberately non-fatal. * fix: remove unused exec allow-pattern test import Keep the new ExecTool allow-pattern coverage clean under ruff. Co-authored-by: Cursor <cursoragent@cursor.com> --------- Co-authored-by: Xubin Ren <xubinrencs@gmail.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-03 00:27:17 +08:00
Xubin Ren	2fa15ccf1b	fix: improve media failure diagnostics and token fallback coverage	2026-05-02 11:37:07 +00:00
Xubin Ren	fde530de01	refactor(setup): enhance SKILL.md for upgrade process clarity	2026-05-02 07:40:29 +00:00
Xubin Ren	861fbb0dde	fix(provider): correct LongCat OpenAI base URL Use the SDK-ready /v1 base so LongCat chat completions hit the documented endpoint. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-02 01:52:04 +08:00
moranfong	051037ff08	feat(provider): add LongCat via OpenAI-compatible backend	2026-05-02 01:52:04 +08:00
yorkhellen	ee364c6ac1	fix(helpers): restore tiktoken fallback in estimate_prompt_tokens_chain	2026-05-02 00:07:45 +08:00
Xubin Ren	fd1a5a6267	test(provider): tidy Anthropic fallback imports Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-01 23:59:24 +08:00
coldxiangyu	4c54a2b153	fix(anthropic): auto-fallback to stream on long-request error The Anthropic SDK raises a client-side ValueError when a non-streaming `messages.create` call could exceed the 10-minute server timeout (e.g. high `max_tokens` combined with extended thinking budget). The error text "Streaming is required for operations that may take longer than 10 minutes" was bubbling up to the user as an opaque LLM error in channels that use the non-stream path (e.g. wecom in #2709). Detect this specific ValueError in `chat()` and transparently retry through `chat_stream()` (without `on_content_delta` so behavior matches the non-stream contract). Other ValueErrors continue to flow through `_handle_error` unchanged. Closes #2709 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 23:59:24 +08:00
coldxiangyu	4860a9a6c9	fix(matrix): stop sync loop on irrecoverable auth errors When the Matrix homeserver returns M_UNKNOWN_TOKEN / M_FORBIDDEN / M_UNAUTHORIZED (or soft_logout), the previous _sync_loop kept retrying sync_forever every 2 seconds forever, spamming the homeserver and filling logs (#1851). The auth state cannot recover by retrying, so this is pure noise and a soft DoS on the homeserver. - Extract `_is_fatal_auth_response()` helper - In `_on_sync_error`, on fatal auth: set `_running=False` and call `stop_sync_forever()` so the loop exits cleanly - Add exponential backoff (2s → 60s cap) to the generic exception path in `_sync_loop` so transient network blips also stop hammering Closes #1851 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 23:59:09 +08:00
Xubin Ren	539d82eadc	test(tools): accept spawn origin message context Made-with: Cursor	2026-05-01 20:09:59 +08:00
Xubin Ren	188e6df757	fix(utils): cover complete trailing think markers Made-with: Cursor	2026-05-01 20:09:59 +08:00
bravel	2c397ad442	fix: strip partial think tags in streaming output	2026-05-01 20:09:59 +08:00
Xubin Ren	aea5948b11	fix(tools): tighten web fetch URL cleaning Made-with: Cursor	2026-05-01 19:58:19 +08:00
彭星杰	5dc96505e8	fix(web_fetch): sanitize URL to strip markdown backticks and quotes before validation LLM-generated tool calls may wrap URLs in markdown backticks or quotes (e.g. \https://example.com\), causing urlparse to produce empty scheme and netloc, which leads to all fetch attempts failing silently. Add URL cleaning at the top of WebFetchTool.execute to strip whitespace, backticks, double quotes, and single quotes, plus an early rejection guard for non-http(s) URLs after cleaning.	2026-05-01 19:58:19 +08:00
Xubin Ren	43a58335f6	fix(provider): narrow DeepSeek reasoning history cleanup Made-with: Cursor	2026-05-01 19:52:38 +08:00
Jiajun Xie	8ca575bdeb	fix: adjust DeepSeek reasoning mode check condition - Modified _drop_deepseek_incomplete_reasoning_history to properly handle reasoning mode detection - Fixes issue #3554	2026-05-01 19:52:38 +08:00
Xubin Ren	e16fa7c6b1	Merge PR #3561 : fix: origin_message_id support and outbound deduplication fix: origin_message_id support and outbound deduplication	2026-05-01 19:52:10 +08:00
Xubin Ren	e157392250	fix(agent): scope subagent reply dedupe to origin message Made-with: Cursor	2026-05-01 11:47:24 +00:00
yorkhellen	08f326ec55	test: Add tests for sender_id runtime context injection	2026-05-01 19:43:38 +08:00
yorkhellen	c4170fa9ba	feat: Add sender_id to LLM runtime context	2026-05-01 19:43:38 +08:00
hanyuanling	1040124ede	Fix API stream lifecycle for tool-backed requests	2026-05-01 19:42:52 +08:00
liuZhou	73840b0af6	fix(matrix): remove tuple default from allow_room_mentions	2026-05-01 19:41:58 +08:00
hinotoi-agent	ad952e0da2	fix(dingtalk): block SSRF in outbound media fetches	2026-05-01 19:31:45 +08:00
copilot-swe-agent[bot]	0284174df9	fix: prevent empty Matrix messages when progress callback sends empty content Agent-Logs-Url: https://github.com/halldorjanetzko/nanobot/sessions/df528c59-8214-41a0-9b79-9d1d41857107 Co-authored-by: halldorjanetzko <158819146+halldorjanetzko@users.noreply.github.com>	2026-05-01 19:31:04 +08:00
coldxiangyu	15007afd4a	fix(matrix): skip events received before bot startup Matrix sync replays the room timeline on each startup or `/restart`, causing already-handled messages to be reprocessed (#3553). Even with `store_sync_tokens=True`, the sync token isn't reliably re-injected when restoring a session via access_token + load_store(), so the client re-reads recent timeline entries. Filter `event.server_timestamp` against the process start time so old events are dropped at the `_on_message` / `_on_media_message` entry points. Trade-off: messages received during downtime won't be processed, which matches the issue reporter's expectation. Closes #3553 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 19:30:33 +08:00
Jack Lu	d9800ecdd2	refactor: replace try-except blocks with contextlib.suppress for cleaner error handling across multiple files	2026-05-01 19:30:11 +08:00
Xubin Ren	1c24f10236	fix(skills): update restart instructions in upgrade process	2026-05-01 11:18:47 +00:00
Xubin Ren	39c38b593f	refactor(tools): move file state lookup out of loop Made-with: Cursor	2026-05-01 19:15:07 +08:00
Xubin Ren	fae38319ca	fix(tools): scope file state by session Made-with: Cursor	2026-05-01 19:15:07 +08:00
LZDQ	58ae2d5b7e	Claude: replace module-level file read states with per-loop per-session state class. fixes #3571	2026-05-01 19:15:07 +08:00
Xubin Ren	6891a7a4d4	fix(skills): correct update setup commands Made-with: Cursor	2026-05-01 19:02:26 +08:00
chengyongru	830730f82d	feat(skills): add update-setup wizard skill	2026-05-01 19:02:26 +08:00
Xubin Ren	306958d6e6	add native Bedrock Converse provider Made-with: Cursor	2026-05-01 18:52:03 +08:00
童天立	61a8ad27d9	fix: add origin_message_id parameter to SubagentManager.spawn()	2026-04-30 21:24:37 +08:00
童天立	4e06c00b46	fix: add origin_message_id support for spawn and message deduplication	2026-04-30 21:22:48 +08:00
hanyuanling	3c20d16117	fix subagent max iteration limit	2026-04-30 13:45:40 +08:00
Xubin Ren	f8fd9f0011	fix(feishu): keep streaming replies in existing topics Made-with: Cursor	2026-04-30 13:42:37 +08:00
hanyuanling	d82f25e4d4	fix(feishu): respect reply_to_message for group threads	2026-04-30 13:42:37 +08:00
Xubin Ren	26e953f0b9	Revert "fix(feishu): streaming card and tool hint respect reply_to_message in…" This reverts commit 651b6b933f2db26713b5668d0c103d1b022e858c.	2026-04-30 13:27:37 +08:00
04cb	651b6b933f	fix(feishu): streaming card and tool hint respect reply_to_message in groups	2026-04-30 12:51:08 +08:00

1 2 3 4 5 ...

2287 Commits