nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-19 16:12:30 +00:00

Author	SHA1	Message	Date
chengyongru	08c5ce95f2	feat(feishu): per-message session for group top-level messages Align with deer-flow: group top-level messages (no root_id) now get their own session keyed by message_id instead of sharing a single group-wide session. Topic replies continue to share session via root_id.	2026-04-26 03:18:13 +08:00
chengyongru	d5122f6df8	chore(test): remove stale reaction_id from test metadata The production code no longer reads reaction_id from metadata, so remove the leftover key from the test_no_removal_when_message_id_missing test case.	2026-04-26 03:18:13 +08:00
chengyongru	1b231eb69f	fix(test): update reaction cleanup test for _reaction_ids dict The stream-end reaction cleanup now reads from _reaction_ids instead of metadata, so pre-populate the dict in the test instead of passing reaction_id via metadata.	2026-04-26 03:18:13 +08:00
chengyongru	91d5f14fbd	fix(feishu): use message_id as reply target and fix keyword-only arg Align reply targeting with deer-flow: always reply to the inbound message_id (not root_id). The Feishu Reply API keeps responses in the same topic automatically when the target message is inside a topic. Also fix run_in_executor calls that passed reply_in_thread as a positional arg to a keyword-only parameter, and route standalone tool hints through the reply API for group chats.	2026-04-26 03:18:13 +08:00
chengyongru	3ece7256d1	perf(feishu): make reaction non-blocking to speed up inbound dispatch Reaction emoji is now added as a fire-and-forget background task instead of blocking the inbound message pipeline. This removes one API round-trip from the critical path before the agent starts processing.	2026-04-26 03:18:13 +08:00
chengyongru	a0e97e360e	feat(feishu): add reply_in_thread for visual topic grouping When reply_to_message config is enabled, the bot's first reply now uses reply_in_thread=True to create a visual topic/thread in the Feishu client. Subsequent chunks fall back to regular create. The reply_to_message default remains False for backward compatibility. Failed replies still fall back to regular send — messages are never silently dropped.	2026-04-26 03:18:13 +08:00
chengyongru	934372d90b	feat(feishu): add thread-scoped session isolation for group chats Thread replies (messages with root_id != message_id) in group chats now get their own session key: feishu:{chat_id}:{root_id}. This means each Feishu thread has an independent conversation context. Top-level group messages and all private chat messages keep the default session key (no override), consistent with Telegram and Slack channel behavior. Co-authored-by: shenchengtsi <228445050+shenchengtsi@users.noreply.github.com>	2026-04-26 03:18:13 +08:00
T3chC0wb0y	9eff9a70bb	fix(msteams): normalize nbsp in inbound text	2026-04-25 15:26:27 +08:00
T3chC0wb0y	5c2c1bb9ef	fix(msteams): prune bad notify refs	2026-04-25 15:26:27 +08:00
T3chC0wb0y	d40ce81a3d	fix(msteams): send threaded replies via replyToId	2026-04-25 15:26:27 +08:00
chengyongru	8a646d9aec	fix(agent): cap recent history section in system prompt Truncate the "Recent History" section injected by build_system_prompt() to 32K chars. Without this, many accumulated history.jsonl entries could still bloat the system prompt even with per-entry truncation in place.	2026-04-24 01:53:31 +08:00
chengyongru	93bcb0a649	fix(agent): prevent history.jsonl bloat from raw_archive and stuck consolidation Root cause: when consolidation LLM fails, raw_archive() dumped full message content (~1MB) into history.jsonl with no size limit. Since build_system_prompt() injects history.jsonl into every system prompt, all subsequent LLM calls exceeded the 200K context window with error 1261. Additionally, _cap_consolidation_boundary's 60-message cap caused consolidation to get stuck on sessions with long tool chains (200+ iterations), triggering the raw_archive fallback in the first place. Three-layer fix: - Remove _cap_consolidation_boundary: let pick_consolidation_boundary drive chunk sizing based solely on token budget - Truncate archive() input: use tiktoken to cap formatted text to the model's input token budget before sending to consolidation LLM - Truncate raw_archive() output: cap history.jsonl entries at 16K chars	2026-04-24 01:41:01 +08:00
chengyongru	da0ebc64fb	fix(agent): prevent duplicate responses when sub-agents complete concurrently When the main agent spawns multiple sub-agents, each completion independently triggered a new _dispatch, causing 3-4 user-visible responses instead of a single comprehensive report. - Extend _drain_pending to block-wait on pending_queue when sub-agents are still running, keeping the runner loop alive for in-order injection - Pass pending_queue in the system message path so subsequent sub-agent results can still be injected mid-turn via a new dispatch	2026-04-22 18:20:36 +08:00
chengyongru	9bf7f3b420	Merge remote-tracking branch 'origin/main' into nightly	2026-04-22 13:29:10 +08:00
hussein1362	0932189860	fix: handle Windows PermissionError on directory fsync On Windows, opening a directory with O_RDONLY raises PermissionError. Wrap the directory fsync in a try/except PermissionError — NTFS journals metadata synchronously so the directory sync is unnecessary there. Also adjust test assertions to expect 1 fsync call (file only) on Windows vs 2 (file + directory) on POSIX.	2026-04-22 13:19:53 +08:00
hussein1362	512bf59b3c	fix(session): fsync sessions on graceful shutdown to prevent data loss On filesystems with write-back caching (rclone VFS, NFS, FUSE mounts) the OS page cache may buffer recent session writes. If the process is killed before the cache flushes, the most recent conversation turns are silently lost — causing the agent to "forget" recent context and respond to stale history on the next startup. Changes: - session/manager.py: add fsync=True option to save() that flushes the file and its parent directory to durable storage. Add flush_all() that re-saves every cached session with fsync. Default save() behavior is unchanged (no fsync) to avoid performance regression in normal operation. - cli/commands.py: call agent.sessions.flush_all() in the gateway shutdown finally block, after stopping heartbeat/cron/channels. - tests/session/test_session_fsync.py: 8 tests covering fsync flag behavior, flush_all with empty/multiple/errored sessions, and data survival across simulated process restart. - tests/cli/test_commands.py: add sessions attribute to _FakeAgentLoop so the gateway health endpoint test passes with the new shutdown flush.	2026-04-22 13:19:53 +08:00
Xubin Ren	ef8bbab7b3	test(cli): lock _render_interactive_ansi force_terminal to isatty Made-with: Cursor	2026-04-22 13:12:29 +08:00
wood3n	2e419f9ba2	fix(cli): respect sys.stdout.isatty() in commands.py	2026-04-22 13:12:29 +08:00
Xubin Ren	88c619901e	review(providers): tighten comments in reasoning_effort normalize path Made-with: Cursor	2026-04-22 12:49:55 +08:00
hlg	28c42628b0	fix: normalize DashScope reasoning_effort (minimal vs minimum) DashScope rejects the OpenAI-style value "minimal" with `'reasoning_effort.effort' must be one of: 'none', 'minimum', 'low', 'medium', 'high', 'xhigh'`, but nanobot was passing the string through verbatim. Users who tried the documented "minimal" to disable thinking got a 400; users who tried the DashScope-native "minimum" to work around it got `enable_thinking=True` because the internal comparison was a hard string match on "minimal". Introduce a semantic/wire split in `_build_kwargs`: - `semantic_effort` is the internal canonical form (OpenAI vocabulary). "minimum" on the way in is normalized to "minimal" here so both spellings share one meaning. - `wire_effort` is what we actually serialize. For DashScope with semantic_effort == "minimal" we translate to "minimum" on the way out; other providers are unchanged. - `thinking_enabled` and the Kimi thinking branch now compare on `semantic_effort`, so either user spelling correctly disables provider-side thinking. Tests: - Strengthen `test_dashscope_thinking_disabled_for_minimal` to assert the wire value is "minimum" in addition to the extra_body signal; the original version only checked extra_body and let the invalid-value bug slip through. - Add `test_dashscope_thinking_disabled_for_minimum_alias` so a user who read the DashScope docs and configured "minimum" still gets thinking off. - Add `test_non_dashscope_minimal_not_retranslated` to pin down that the DashScope-specific translation does not leak to OpenAI et al.	2026-04-22 12:49:55 +08:00
chengyongru	f6a417e77d	fix(transcription): harden language parameter validation and tests - Add ISO-639 pattern validation (2-3 lowercase letters) to schema - Normalize empty language to None in provider constructors - Extract shared httpx mock stubs, parameterize provider tests - Add test for language=None omitting field from multipart body - Add test for Pydantic pattern validation rejecting invalid codes	2026-04-22 12:41:32 +08:00
k	123d69bfb7	fix: allow specifying transcription language	2026-04-22 12:41:32 +08:00
flobo3	1826ab44fa	feat(transcription): add language parameter for Groq Whisper STT	2026-04-22 12:41:32 +08:00
chengyongru	a4a197fea5	fix(transcription): harden language parameter validation and tests - Add ISO-639 pattern validation (2-3 lowercase letters) to schema - Normalize empty language to None in provider constructors - Extract shared httpx mock stubs, parameterize provider tests - Add test for language=None omitting field from multipart body - Add test for Pydantic pattern validation rejecting invalid codes	2026-04-22 11:02:07 +08:00
k	bc3d734df5	fix: allow specifying transcription language	2026-04-22 11:02:07 +08:00
flobo3	1835f94d8e	feat(transcription): add language parameter for Groq Whisper STT	2026-04-22 11:02:07 +08:00
Xubin Ren	f5b8ee9f78	docs: update v0.1.5.post2 release news	2026-04-21 17:50:54 +00:00
Xubin Ren	950dddec49	chore: bump version to 0.1.5.post2 v0.1.5.post2	2026-04-21 17:25:08 +00:00
k	e5b288c6eb	fix: map MiniMax reasoning_effort to reasoning_split	2026-04-22 00:52:56 +08:00
Xubin Ren	558aa98491	chore: temporary keep WebUI source-only	2026-04-21 14:33:44 +00:00
aiguozhi123456	53ba410e49	feat(read_file): add DOCX, XLSX, PPTX support via document.extract_text() Wire up the existing office document extractors in document.py to ReadFileTool by adding an extension guard and _read_office_doc() method that follows the established PDF pattern. Handles missing libraries, corrupt files, empty documents, and 128K truncation consistently.	2026-04-21 22:12:19 +08:00
彭星杰	46864b0911	fix: use try/finally in _extract_xlsx to prevent resource leak	2026-04-21 22:01:17 +08:00
彭星杰	a00beebd06	fix: use context manager in _extract_xlsx to prevent resource leak	2026-04-21 22:01:17 +08:00
chengyongru	e15705b471	fix(tests): add _cancel_active_tasks mock to cmd_new test fixtures The existing test_unified_session tests construct a SimpleNamespace loop mock that now needs _cancel_active_tasks since cmd_new calls it.	2026-04-21 21:50:37 +08:00
chengyongru	d4e34f8c67	fix(commands): intercept non-priority commands during active turn Non-priority slash commands (e.g. /new, /help, /dream-log) arriving while a session has an active LLM turn were silently queued into the pending injection buffer and later injected as raw user messages into the LLM conversation. This caused the model to respond to "/new" as plain text instead of executing the command. Root cause: the run() loop only checked priority commands (/stop, /restart, /status) before routing messages to the pending queue. All other command tiers (exact, prefix) bypassed command dispatch entirely. Changes: - Add CommandRouter.is_dispatchable_command() to match exact/prefix tiers, mirroring the existing is_priority() pattern. - In run(), intercept dispatchable commands before pending queue insertion and dispatch them directly via _dispatch_command_inline(). - Extract _cancel_active_tasks() from cmd_stop for reuse; cmd_new now cancels active tasks before clearing the session to prevent shared mutable state corruption from concurrent asyncio coroutines. - Update /new semantics: stops active task first, then clears session. - Update documentation in help text, docs, and Discord command list.	2026-04-21 21:50:37 +08:00
hussein1362	f8a023218d	fix(telegram): improve markdown rendering for modern LLM output Problem: Modern LLMs (GPT-5.4, Claude, Gemini) produce markdown-heavy responses with numbered lists, headers, and nested formatting. The Telegram channel's _markdown_to_telegram_html() converter has gaps that leave these poorly formatted: 1. Numbered lists (1. 2. 3.) have zero handling — sent as raw text 2. Headers (# Title) are stripped to plain text, losing visual hierarchy 3. Mid-stream edits send raw markdown (users see bold and ### headers while the response generates, before the final HTML conversion) Root Cause: _markdown_to_telegram_html() handles bullets (- *) but skips numbered lists entirely. Headers are stripped of # but not given any emphasis. The streaming path in send_delta() sends buf.text as-is during mid-stream edits (plain text, no parse_mode) — only the final _stream_end edit converts to HTML. Fix: 1. Headers now render as <b>bold</b> in the final HTML (using placeholder markers that survive HTML escaping, restored after all other processing) 2. Numbered lists are normalized (extra whitespace after the dot is cleaned) 3. New _strip_md_block() function strips markdown syntax for readable plain-text preview during streaming mid-edits The final _stream_end HTML conversion is unchanged — it still produces full HTML with parse_mode=HTML. Only the intermediate edits are improved. Tests: Added 10 new tests covering: - Headers converting to bold HTML - Numbered list preservation and whitespace normalization - Headers with HTML special characters - Mixed formatting (headers + bullets + numbers + bold) - _strip_md_block for inline formatting, headers, bullets, numbers, links - Streaming mid-edit markdown stripping (initial send + edit)	2026-04-21 21:35:34 +08:00
chengyongru	37ea8b8f5b	fix(retry): recognize ZhiPu 1302 rate-limit error for retry ZhiPu API returns code 1302 with Chinese text "速率限制" instead of standard HTTP 429 + "rate limit", causing the retry engine to treat it as non-transient and fail immediately.	2026-04-21 21:23:20 +08:00
Xubin Ren	1b692debdc	docs(webui): revise README to clarify WebSocket channel setup and sequence of startup steps	2026-04-21 12:46:17 +00:00
chengyongru	c51b653154	fix(retry): recognize ZhiPu 1302 rate-limit error for retry ZhiPu API returns code 1302 with Chinese text "速率限制" instead of standard HTTP 429 + "rate limit", causing the retry engine to treat it as non-transient and fail immediately.	2026-04-21 17:39:51 +08:00
Xubin Ren	c1957e14ff	refactor(memory): centralize cursor validation behind a single gate Move the non-int cursor guard out of the two consumer sites and into a shared ``_iter_valid_entries`` iterator so the invariant lives in one place. Closes three gaps left by the original fix: * ``bool`` is now rejected — ``isinstance(True, int)`` is ``True`` in Python, so the previous guard silently treated ``{"cursor": true}`` as cursor ``1``. * Recovery now returns ``max(valid cursors) + 1``. Under adversarial corruption "first int scanning in reverse" is not the same thing, and only ``max`` keeps the recovered cursor strictly greater than every legitimate cursor still on disk. * Non-int cursors are logged exactly once per ``MemoryStore``. Silently dropping corrupted entries hides the root cause (an external writer to ``memory/history.jsonl``); rate-limiting keeps the log clean when the same poisoned file is read every turn. All 7 tests from the original fix pass unchanged; 3 new tests pin the invariants above. Made-with: Cursor	2026-04-21 14:02:53 +08:00
Muata Kamdibe	c0a11c7cf4	fix(memory): harden cursor recovery against non-integer corruption _next_cursor now checks isinstance(cursor, int) before arithmetic, falling back to a reverse scan of all entries when the last entry's cursor is corrupted. read_unprocessed_history skips entries with non-int cursors instead of crashing on comparison. Root cause: external callers (cron jobs, plugins) occasionally wrote string cursors to history.jsonl, which blocked all subsequent append_history calls with TypeError/ValueError. Includes 7 regression tests covering string, float, null, and list cursor types.	2026-04-21 14:02:53 +08:00
chengyongru	409afe1a3d	test(tools): add basic regression tests for ContextVar routing context	2026-04-21 13:25:30 +08:00
jr_blue_551	ff8c28d5a8	agent: use ContextVar for tool routing context	2026-04-21 13:25:30 +08:00
Xubin Ren	82aa9efc02	test(mcp): pin CancelledError short-circuits the retry loop The retry branch is only reachable via `except Exception`, and `CancelledError` inherits from `BaseException`, so today it naturally bypasses the retry path and /stop still works. Add one focused regression test so any future refactor that widens the retry catch to `BaseException`, re-orders the handlers, or adds `CancelledError` to `_TRANSIENT_EXC_NAMES` fails CI instead of silently swallowing /stop. Made-with: Cursor	2026-04-21 13:24:40 +08:00
hussein1362	368752e707	fix(mcp): retry once on transient connection errors When an MCP server restarts or a network connection drops between tool calls, the existing session throws ClosedResourceError, BrokenPipeError, ConnectionResetError, etc. Currently these are caught as generic exceptions and returned as permanent failures to the LLM, which then tells the user 'my tools are broken.' This change adds a single automatic retry with a 1-second backoff for transient connection-class errors in MCPToolWrapper, MCPResourceWrapper, and MCPPromptWrapper. Non-transient errors (ValueError, RuntimeError, McpError, etc.) are not retried. The retry is conservative: - Only 1 retry (not configurable, to keep the change minimal) - Only for a specific set of connection-class exceptions - Matched by exception class name to avoid importing anyio/etc. - 1s sleep between attempts to allow the server to recover - Clear logging distinguishes retried vs permanent failures In production this eliminates most 'MCP tool call failed: ClosedResourceError' noise when MCP bridge processes restart (e.g. after config changes or OOM kills). Tests: 22 new tests covering retry, exhaustion, non-transient bypass, timeout bypass, and all three wrapper types.	2026-04-21 13:24:40 +08:00
chengyongru	51cb260f05	test(tools): add basic regression tests for ContextVar routing context	2026-04-21 11:25:57 +08:00
jr_blue_551	e4fa58ef45	agent: use ContextVar for tool routing context	2026-04-21 11:25:57 +08:00
Xubin Ren	6c24f24e9e	feat(models): add support for kimi-k2.6 with temperature override and update documentation	2026-04-20 18:18:06 +00:00
Xubin Ren	009cce78ad	fix(anthropic): also enforce leading-user + empty-array recovery Extend `_merge_consecutive` so the three invariants from `LLMProvider._enforce_role_alternation` all hold for Anthropic: 1. collapse consecutive same-role turns (unchanged) 2. no trailing assistant — Anthropic rejects prefill (unchanged) 3. no leading assistant — Anthropic requires the first turn be user 4. non-empty messages array — recover the last stripped assistant as a user turn when every turn got stripped, so callers don't hit a secondary "messages array empty" 400 Anthropic-specific wrinkle: `tool_use` blocks live inside `content` (not a separate `tool_calls` field) and are illegal inside user turns, so both recovery paths skip any message carrying them rather than silently producing a malformed request. Adds 4 unit tests covering the new branches, including the tool_use opt-outs, and updates the existing `test_single_assistant_stripped` to reflect the new rerouting contract. Made-with: Cursor	2026-04-21 01:32:32 +08:00
hussein1362	2f02342083	fix(anthropic): strip trailing assistant messages to prevent prefill error Anthropic does not support assistant-message prefill and returns a 400 error when the conversation ends with an assistant turn. This commonly happens when heartbeat/system messages accumulate trailing assistant replies in the session history. The _merge_consecutive method already handles same-role merging but did not strip trailing assistant messages. The base provider's _enforce_role_alternation (used by OpenAI-compat) does strip them, but AnthropicProvider uses its own _merge_consecutive instead. Add a trailing-assistant stripping loop to _merge_consecutive, matching the behavior already present in _enforce_role_alternation. Includes 7 new tests covering merge + strip behavior.	2026-04-21 01:32:32 +08:00

1 2 3 4 5 ...

2121 Commits