nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-28 04:21:17 +00:00

Author	SHA1	Message	Date
chengyongru	bc4cc49a59	feat(agent): mid-turn message injection for responsive follow-ups (#2985 ) * feat(agent): add mid-turn message injection for responsive follow-ups Allow user messages sent during an active agent turn to be injected into the running LLM context instead of being queued behind a per-session lock. Inspired by Claude Code's mid-turn queue drain mechanism (query.ts:1547-1643). Key design decisions: - Messages are injected as natural user messages between iterations, no tool cancellation or special system prompt needed - Two drain checkpoints: after tool execution and after final LLM response ("last-mile" to prevent dropping late arrivals) - Bounded by MAX_INJECTION_CYCLES (5) to prevent consuming the iteration budget on rapid follow-ups - had_injections flag bypasses _sent_in_turn suppression so follow-up responses are always delivered Closes #1609 * fix(agent): harden mid-turn injection with streaming fix, bounded queue, and message safety - Fix streaming protocol violation: Checkpoint 2 now checks for injections BEFORE calling on_stream_end, passing resuming=True when injections found so streaming channels (Feishu) don't prematurely finalize the card - Bound pending queue to maxsize=20 with QueueFull handling - Add warning log when injection batch exceeds _MAX_INJECTIONS_PER_TURN - Re-publish leftover queue messages to bus in _dispatch finally block to prevent silent message loss on early exit (max_iterations, tool_error, cancel) - Fix PEP 8 blank line before dataclass and logger.info indentation - Add 12 new tests covering drain, checkpoints, cycle cap, queue routing, cleanup, and leftover re-publish	2026-04-11 02:11:02 +08:00
chengyongru	df6f9dd171	fix(wecom): use reply_stream for progress messages to avoid errcode=40008 The plain reply() uses cmd="reply" which does not support "text" msgtype and causes WeCom API to return errcode=40008 (invalid message type). Unify both progress and final text messages to use reply_stream() (cmd="aibot_respond_msg"), differentiating via finish flag. Fixes #2999	2026-04-10 22:20:28 +08:00
chengyongru	6af81bc4a3	feat(agent): auto compact — proactive session compression to reduce token cost and latency (#2982 ) When a user is idle for longer than a configured TTL, nanobot proactively compresses the session context into a summary. This reduces token cost and first-token latency when the user returns — instead of re-processing a long stale context with an expired KV cache, the model receives a compact summary and fresh input.	2026-04-10 17:43:42 +08:00
chengyongru	4a33c1392b	test(channels): add media support tests for QQ and WeCom channels Cover helpers (sanitize_filename, guess media type), outbound send (exception handling, media-then-text order, fallback), inbound message processing (attachments, dedup, empty content), _post_base64file payload filtering, and WeCom upload/download flows.	2026-04-10 17:05:59 +08:00
chengyongru	4b0fdffe39	fix(wecom): harden upload/download, extract media type helper - Use asyncio.to_thread for file I/O to avoid blocking event loop - Add 200MB upload size limit with early rejection - Fix file handle leak by using context manager - Use memoryview for upload chunking to reduce peak memory - Add inbound download size check to prevent OOM - Use asyncio.to_thread for write_bytes in download path - Extract inline media_type detection to _guess_wecom_media_type()	2026-04-10 17:05:59 +08:00
chengyongru	4fe23a01c9	fix(wecom): harden upload and inbound media handling - Use asyncio.to_thread for file I/O to avoid blocking event loop - Add 200MB upload size limit with early rejection - Fix file handle leak by using context manager - Free raw bytes early after chunking to reduce memory pressure - Add file attachments to media_paths (was text-only, inconsistent with image) - Use robust _sanitize_filename() instead of os.path.basename() for path safety - Remove re-raise in send() for consistency with QQ channel - Fix truncated media_id logging for short IDs	2026-04-10 17:05:59 +08:00
gem12	973b888d39	feat(channels): Add full media support for QQ and WeCom channels QQ channel improvements (on top of nightly): - Add top-level try/except in _on_message and send() for resilience - Use defensive getattr() for attachment attributes (botpy version compat) - Skip file_name for image uploads to avoid QQ rendering as file attachment - Extract only file_info from upload response to avoid extra fields - Handle protocol-relative URLs (//...) in attachment downloads WeCom channel improvements: - Add _upload_media_ws() for WebSocket 3-step media upload protocol - Send media files (image/video/voice/file) via WeCom rich media API - Support progress messages (plain reply) vs final response (streaming) - Support proactive send when no frame available (cron push) - Pass media_paths to message bus for downstream processing	2026-04-10 17:05:59 +08:00
flobo3	7b1ce24600	fix: strip <thought> blocks from Gemma 4 and similar models	2026-04-10 00:58:00 +08:00
chengyongru	3bece171c2	docs(websocket): add WebSocket channel documentation Comprehensive guide covering wire protocol, configuration reference, token issuance, security notes, and common deployment patterns.	2026-04-09 15:56:34 +08:00
chengyongru	8f7ce9fef7	fix(websocket): harden security and robustness - Use hmac.compare_digest for timing-safe static token comparison - Add issued token capacity limit (_MAX_ISSUED_TOKENS=10000) with 429 response - Use atomic pop in _take_issued_token_if_valid to eliminate TOCTOU window - Enforce TLSv1.2 minimum version for SSL connections - Extract _safe_send helper for consistent ConnectionClosed handling - Move connection registration after ready send to prevent out-of-order delivery - Add HTTP-level allow_from check and client_id truncation in process_request - Make stop() idempotent with graceful shutdown error handling - Normalize path via validator instead of leaving raw value - Default websocket_requires_token to True for secure-by-default behavior - Add integration tests and ws_test_client helper - Refactor tests to use shared _ch factory and bus fixture	2026-04-09 15:56:34 +08:00
chengyongru	d327c19db0	fix(websocket): handle ConnectionClosed gracefully in send and send_delta	2026-04-09 15:56:34 +08:00
Jack Lu	e00dca2f84	feat(channels): add WebSocket server channel and tests Port Python implementation from a1ec7b192ad97ffd58250a720891ff09bbb73888 (websocket channel module and channel tests; excludes webui debug app).	2026-04-09 15:56:34 +08:00
Jiajun Xie	51200a954c	fix(feishu): improve voice message download with detailed logging - Add explicit error logging for missing file_key and message_id - Add logging for download failures - Change audio extension from .opus to .ogg for better Whisper compatibility - Feishu voice messages are opus in OGG container; .ogg is more widely recognized	2026-04-09 10:14:26 +08:00
chengyongru	c121547114	refactor(feishu): simplify tool hint to append-only, delegate to send_delta for throttling - Make tool_hint_prefix configurable in FeishuConfig (default: 🔧) - Delegate tool hint card updates from send() to send_delta() so hints automatically benefit from _STREAM_EDIT_INTERVAL throttling - Fix staticmethod calls to use self.__class__ instead of self - Document all supported metadata keys in send_delta docstring - Add test for empty/whitespace-only tool hint with active stream buffer	2026-04-08 21:02:21 +08:00
xzq.xu	dcc9c057bb	fix(tool-hints): deduplicate by formatted string + per-line inline display Two display fixes based on real-world Feishu testing: 1. tool_hints.py: format_tool_hints now deduplicates by comparing the fully formatted hint string instead of tool name alone. This fixes `ls /Desktop` and `ls /Downloads` being incorrectly merged as `ls /Desktop × 2`. Truly identical calls still fold correctly. (_group_consecutive and all abbreviation logic preserved unchanged.) 2. feishu.py: inline tool hints now display one tool per line with 🔧 prefix, and use double-newline trailing to prevent Setext heading rendering when followed by markdown `---`. Made-with: Cursor	2026-04-08 21:02:21 +08:00
xzq.xu	586d4e2411	fix(feishu): preserve tool hints in final card content Tool hints should be kept as permanent content in the streaming card so users can see which tools were called (matching the standalone card behavior). Previously, hints were stripped when new deltas arrived or when the stream ended, causing tool call information to disappear. Now: - New delta: hint becomes permanent content, delta appends after it - New tool hint: replaces the previous hint (unchanged) - Resuming/stream_end: hint is preserved in the final text Updated 3 tests to verify hint preservation semantics. Made-with: Cursor	2026-04-08 21:02:21 +08:00
xzq.xu	a4bb1923ac	fix(feishu): prevent tool hint stacking and clean hints on stream_end Three fixes for inline tool hints: 1. Consecutive tool hints now replace the previous one instead of stacking — the old suffix is stripped before appending the new one. 2. When _resuming flushes the buffer, any trailing tool hint suffix is removed so it doesn't persist into the next streaming segment. 3. When final _stream_end closes the card, tool hint suffix is cleaned from the text before the final card update. Adds 3 regression tests covering all three scenarios. Made-with: Cursor	2026-04-08 21:02:21 +08:00
xzq.xu	8d6f41e484	feat(feishu): streaming resuming + inline tool hints Two improvements to Feishu streaming card experience: 1. Handle _resuming in send_delta: when a mid-turn _stream_end arrives with resuming=True (tool call between segments), flush current text to the card but keep the buffer alive so subsequent segments append to the same card instead of creating a new one. 2. Inline tool hints into streaming cards: when a tool hint arrives while a streaming card is active, append it to the card content (e.g. "🔧 web_fetch(...)") instead of sending a separate card. The hint is automatically stripped when the next delta arrives. Made-with: Cursor	2026-04-08 21:02:21 +08:00
chengyongru	4962867112	fix(tool-hint): fold paths in exec commands instead of blind truncation exec tool hints previously used val[:40] which cut paths mid-segment (e.g. "D:\Documents\GitHub\nanobot.worktree…"). Now uses regex to detect file paths in commands and abbreviates them properly, with smart truncation at chain separators (&&, \|, ;) as fallback.	2026-04-08 11:37:17 +08:00
Jiajun	473637ceff	feat(feishu): add done emoji support for reaction lifecycle (#2899 ) * feat(feishu): add done emoji support for reaction lifecycle * feat(feishu): add done emoji support and update documentation	2026-04-07 23:56:23 +08:00
chengyongru	c44d4f2b2b	fix(test): fix two flaky tests on Windows - test_exec_head_tail_truncation: use temp script file instead of python -c to avoid cmd.exe quote-parsing issues after PR #2893 - test_grep_files_with_matches_supports_head_limit_and_offset: query full result set first to avoid mtime-dependent sort assumption	2026-04-07 21:30:45 +08:00
Xubin Ren	ae27d69ecb	fix(exec): add Windows support for shell command execution ExecTool hardcoded bash, breaking exec on Windows. Now uses cmd.exe via COMSPEC on Windows with a curated minimal env (PATH, SYSTEMROOT, etc.) that excludes secrets. bwrap sandbox gracefully skips on Windows.	2026-04-07 21:13:32 +08:00
chengyongru	ba38d41ad1	Merge remote-tracking branch 'origin/main' into nightly	2026-04-07 20:49:53 +08:00
chengyongru	ad4d095080	feat(memory):dream enhancement (#2887 ) * feat(dream): enhance memory cleanup with staleness detection - Phase 1: add [FILE-REMOVE] directive and staleness patterns (14-day threshold, completed tasks, superseded info, resolved tracking) - Phase 2: add explicit cleanup rules, file paths section, and deletion guidance to prevent LLM path confusion - Inject current date and file sizes into Phase 1 context for age-aware analysis - Add _dream_debug() helper for observability (dream-debug.log in workspace) - Log Phase 1 analysis output and Phase 2 tool events for debugging Tested with glm-5-turbo: MEMORY.md reduced from 149 to 108-129 lines across two rounds, correctly identifying and removing weather data, detailed incident info, completed research, and stale discussions. * refactor(dream): replace _dream_debug file logger with loguru Remove the custom _dream_debug() helper that wrote to dream-debug.log and use the existing loguru logger instead. Phase 1 analysis is logged at debug level, tool events at info level — consistent with the rest of the codebase and no extra log file to manage. * fix(dream): make stale scan independent of conversation history Reframe Phase 1 from a single comparison task to two independent tasks: history diff AND proactive stale scan. The LLM was skipping stale content that wasn't referenced in conversation history (e.g. old triage snapshots). Now explicitly requires scanning memory files for staleness patterns on every run. * fix(dream): correct old_text param name and truncate debug log - Phase 2 prompt: old_string -> old_text to match EditFileTool interface - Phase 1 debug log: truncate analysis to 500 chars to avoid oversized lines * refactor(dream): streamline prompts by separating concerns Phase 1 owns all staleness judgment logic; Phase 2 is pure execution guidance. Remove duplicated cleanup rules from Phase 2 since Phase 1 already determines what to add/remove. Fix remaining old_string -> old_text. Total prompt size reduced ~45% (870 -> 480 tokens). * fix(dream): add FILE-REMOVE execution guidance to Phase 2 prompt Phase 2 was only processing [FILE] additions and ignoring [FILE-REMOVE] deletions after the cleanup rules were removed. Add explicit mapping: [FILE] → add content, [FILE-REMOVE] → delete content.	2026-04-07 15:41:54 +08:00
Xubin Ren	82dec12f66	refactor: extract tool hint formatting to utils/tool_hints.py - Move _tool_hint implementation from loop.py to nanobot/utils/tool_hints.py - Keep thin delegation in AgentLoop._tool_hint for backward compat - Update test imports to test format_tool_hints directly Made-with: Cursor	2026-04-07 15:15:07 +08:00
chengyongru	3e3a7654f8	fix(agent): address code review findings for tool hint enhancement - C1: Fix IndexError on empty list arguments via _get_args() helper - I1: Remove redundant branch in _fmt_known - I2: Export abbreviate_path from nanobot.utils.__init__ - I3: Fix _abbreviate_url negative-budget format consistency - S1: Move FORMATS to class-level _TOOL_HINT_FORMATS constant - S2: Add list_dir to FORMATS registry (ls path) - G1-G5: Add tests for empty list args, None args, URL edge cases, mixed folding groups, and list_dir format	2026-04-07 15:15:07 +08:00
chengyongru	b1d3c00deb	test(feishu): add compatibility tests for new tool hint format	2026-04-07 15:15:07 +08:00
chengyongru	238a9303d0	test: update tool_hint assertion to match new format	2026-04-07 15:15:07 +08:00
chengyongru	8ca9960077	feat(agent): rewrite _tool_hint with registry, path abbreviation, and call folding	2026-04-07 15:15:07 +08:00
chengyongru	f452af6c62	feat(utils): add abbreviate_path for smart path/URL truncation	2026-04-07 15:15:07 +08:00
Xubin Ren	02597c3ec9	fix(runner): silent retry on empty response before finalization	2026-04-07 15:03:41 +08:00
Xubin Ren	0355f20919	test: add regression tests for _resolve_mentions 7 tests covering: single mention, dual IDs, no-id skip, multiple mentions, no mentions, empty text, and key-not-in-text edge case. Made-with: Cursor	2026-04-07 14:03:55 +08:00
wudongxue	b3294f79aa	fix(feishu): ensure access token is initialized before fetching bot open_id The lark-oapi client requires token types to be explicitly configured so that the SDK can obtain and attach the tenant_access_token to raw requests. Without this, `_fetch_bot_open_id()` would fail with "Missing access token for authorization" because the token had not been provisioned at the time of the call.	2026-04-07 14:03:55 +08:00
wudongxue	0291d1f716	feat: resolve mentions data	2026-04-07 14:03:55 +08:00
Xubin Ren	075bdd5c3c	refactor: move SafeFileHistory to module level + add regression tests - Promote _SafeFileHistory to module-level SafeFileHistory for testability - Add 5 regression tests: surrogates, normal text, emoji, mixed CJK, multi-surrogates Made-with: Cursor	2026-04-07 13:57:34 +08:00
bahtya	64bd7234b3	fix(cli): sanitize surrogate characters in prompt history to prevent UnicodeEncodeError On Windows, certain Unicode input (emoji, mixed-script text, surrogate pairs) causes prompt_toolkit's FileHistory to crash with UnicodeEncodeError when writing the history file. Fix: wrap FileHistory with a _SafeFileHistory subclass that sanitizes surrogate characters before writing, replacing invalid sequences instead of crashing. Fixes #2846	2026-04-07 13:57:34 +08:00
flobo3	67e6f8cc7a	fix(docker): strip Windows CRLF from entrypoint.sh	2026-04-07 13:32:01 +08:00
Jiajun Xie	5ee96721f7	ci: add ruff lint check for unused imports and variables Add CI step to detect unused imports (F401) and unused variables (F841) with ruff. Clean up existing violations: - Remove unused Consolidator import in agent/__init__.py - Remove unused re import in agent/loop.py - Remove unused Path import in channels/feishu.py - Remove unused ContentRepositoryConfigError import in channels/matrix.py - Remove unused field and CommandHandler imports in channels/telegram.py - Remove unused exception variable in channels/weixin.py	2026-04-07 13:30:49 +08:00
04cb	f4904c4bdf	fix(cron): add optional name parameter to separate job label from message (#2680 )	2026-04-07 13:22:20 +08:00
Leo fu	44c7992095	fix(filesystem): correct write success message from bytes to characters len(content) counts Unicode code points, not UTF-8 bytes. For non-ASCII content such as Chinese or emoji, the reported count would be lower than the actual bytes written to disk, which is misleading to the agent.	2026-04-07 13:22:00 +08:00
bahtya	cefeddab8e	fix(matrix): correct e2eeEnabled camelCase alias mapping The pydantic to_camel function generates 'e2EeEnabled' (treating 'ee' as a word boundary) for the field 'e2ee_enabled'. Users writing 'e2eeEnabled' in their config get the default value instead. Fix: add explicit alias='e2eeEnabled' to override the incorrect auto-generated alias. Both 'e2eeEnabled' and 'e2ee_enabled' now work. Fixes #2851	2026-04-07 13:20:55 +08:00
Xubin Ren	bf459c7887	fix(docker): fix volume mount path and add permission error guidance	2026-04-06 13:15:40 +00:00
Xubin Ren	4dac0a8930	docs: update nanobot docs badge	2026-04-06 11:55:47 +00:00
Xubin Ren	a30e84bfd1	docs: update v0.1.5 release news	2026-04-06 11:46:16 +00:00
Xubin Ren	6269876bc7	docs: update v0.1.5 release news	2026-04-06 11:45:37 +00:00
Xubin Ren	bc2253c83f	docs: update v0.1.5 release news	2026-04-06 11:45:08 +00:00
Xubin Ren	b719da7400	fix(feishu): use RawRequest for bot info API	2026-04-06 11:39:23 +00:00
Xubin Ren	79234d237e	chore: bump version to 0.1.5 v0.1.5	2026-04-06 11:26:07 +00:00
Xubin Ren	1243c08745	docs: update news section	2026-04-06 11:22:20 +00:00
Xubin Ren	dad9c07843	fix(tests): update Tavily usage tests to match actual API response shape The _parse_tavily_usage implementation was updated to use the real {account: {plan_usage, plan_limit, ...}} structure, but the tests still used the old flat {used, limit, breakdown} format. Made-with: Cursor	2026-04-06 19:17:55 +08:00

1 2 3 4 5 ...

1776 Commits