nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-19 16:12:30 +00:00

Author	SHA1	Message	Date
chengyongru	ff6b014a07	refactor: allow model/context_window_tokens override in from_config() - Pop model and context_window_tokens from extra kwargs before forwarding to __init__, allowing callers like _run_gateway to pass snapshot-derived values instead of config defaults - _run_gateway now explicitly passes model/context_window_tokens from provider_snapshot to preserve pre-refactor behavior	2026-05-09 15:30:48 +08:00
chengyongru	733b34d685	refactor: address code review feedback on AgentLoop.from_config() - Accept optional `provider` kwarg in from_config() to avoid double instantiation in _run_gateway (which already builds provider_snapshot) - Restore try/except ValueError wrappers in serve() and agent() for clean error messages on provider creation failure - Update test: _FakeAgentLoop captures provider from kwargs, restore strong assertion (seen["provider"] is provider)	2026-05-09 15:30:48 +08:00
chengyongru	3202f58c41	refactor: introduce AgentLoop.from_config() to centralize loop assembly Extract duplicated bus/provider/loop initialization from CLI commands (serve, _run_gateway, agent) and Nanobot facade into a single AgentLoop.from_config() classmethod. - Remove _make_provider() from cli/commands.py and nanobot.py - Remove inline provider creation in all three CLI entry points - AgentLoop.from_config() creates MessageBus, calls make_provider(), and assembles AgentLoop with all standard config-derived parameters - Supports **extra overrides for callers that need custom args (e.g. cron_service, session_manager, provider_snapshot_loader) - Update tests to mock make_provider at nanobot.providers.factory and add from_config classmethod to _FakeAgentLoop fixtures This is PR 1/4 of the model-preset feature decomposition.	2026-05-09 15:30:48 +08:00
Xubin Ren	9252f4d826	Revert "fix(agent): persist _last_summary across restarts with used sentinel" This reverts commit e5a1416a37b423de95b0fa279e9473110a678112.	2026-05-09 15:00:54 +08:00
chengyongru	e5a1416a37	fix(agent): persist _last_summary across restarts with used sentinel The previous implementation popped _last_summary from session.metadata after injecting it into the prompt, then saved the session. This caused the summary to be permanently lost after a process restart, making the AI forget archived context and appear to ignore memory or reference non-existent previous messages. Replace the destructive pop with a _last_summary_used sentinel: - _last_summary stays in metadata for restart survival - _last_summary_used prevents duplicate injection within the same turn - Clear the sentinel whenever a new summary is generated Updates tests to match the new persistence behavior.	2026-05-09 14:58:38 +08:00
Xubin Ren	56eee06736	feat(webui): add BYOK web search settings Let WebUI users configure the single web search provider credential from BYOK while keeping saved secrets masked and hot-reloaded for new searches. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-09 14:52:48 +08:00
Xubin Ren	7c1aa5ae31	docs: refine AI contributor guidance Clarify nanobot's preference for small core changes, reviewable PR boundaries, and careful handling of prompt/context surfaces so AI contributors preserve the project's maintenance philosophy. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-09 14:00:32 +08:00
chengyongru	6eef3d0f15	docs: add CLAUDE.md and .agent/ guides for AI contributors Add CLAUDE.md at the repository root to orient future Claude Code instances, and split detailed constraints into .agent/: - .agent/design.md — architectural constraints (core small, duplication over abstraction, minimal changes, explicit over magical) - .agent/security.md — workspace/SSRF/shell sandbox boundaries - .agent/gotchas.md — config ${VAR}, Windows compat, templates, heartbeat virtual tool call, atomic writes, ruff format warning, skills extension point Also updates .gitignore to not ignore .agent/.	2026-05-09 14:00:32 +08:00
Eugene Chae	4d7bf5bb8a	fix(cli): handle retry-wait messages in interactive mode	2026-05-09 13:50:39 +08:00
Xubin Ren	3231aaf9ee	fix(image): prevent duplicate delivery and replay artifacts	2026-05-09 05:45:13 +00:00
Vilius Vystartas	4d168c571c	fix: replace raise with logger.error + return fail in exception handlers The previous version changed return fail/pass to raise, which broke graceful degradation — tests expect upload/content failures to be caught and handled, not propagated. Now logs errors with exc_info=True while preserving existing control flow (return fail for upload/content send, stop typing for stream).	2026-05-09 01:04:20 +08:00
Vilius Vystartas	31c45fe798	fix: raise instead of swallowing on outbound-message path errors Per reviewer request (chengyongru): raise exceptions on the outbound message path so ChannelManager can trigger retry logic, matching the pattern from commit 98c2f7cc (Weixin channel cleanup). Changes: - _resolve_server_upload_limit_bytes: warning → error (non-fatal config) - _upload_and_send_attachment media upload: raise instead of swallow - _upload_and_send_attachment room send: raise instead of swallow - send_delta stream edit: error + raise after cleanup - weixin _load_state: warning → error (non-fatal state load)	2026-05-09 01:04:20 +08:00
Vilius Vystartas	ba1e5036f5	fix: log errors in silent exception handlers (matrix + weixin channels) The Matrix channel had 4 bare except blocks that silently swallowed transport errors with no logging — stream send/edit failures, media upload failures, server config fetch failures, and room content send failures. The Weixin channel had 1 silent state-load failure. This mirrors commit 98c2f7cc ('fix(weixin): raise exceptions instead of silently dropping messages') for the Matrix channel and adds a warning for the remaining silent catch in Weixin's _load_state. All failures now log at warning level with exc_info=True so operators can diagnose intermittent Matrix/Weixin transport issues.	2026-05-09 01:04:20 +08:00
yorkhellen	843e96f09d	fix(feishu): send all messages to topic when in thread	2026-05-09 01:03:57 +08:00
chengyongru	908f1246d8	fix(cli): sanitize surrogate code points before entering message bus On Windows, prompt_toolkit produces lone surrogate code points (e.g. 🐈) for emoji input. These propagate through the message bus and crash at json.dumps() / file write time because surrogates cannot be encoded as UTF-8. Extract _sanitize_surrogates() that round-trips through UTF-16 to reconstruct paired surrogates into real characters (e.g. 🐈 → 🐈), replacing unpaired surrogates with U+FFFD. Apply it at the CLI input path and reuse in SafeFileHistory.	2026-05-09 01:03:34 +08:00
Xubin Ren	bbdf1db30d	fix(webui): render generated images as rounded previews Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 23:48:01 +08:00
Xubin Ren	151c3d5ad0	fix(webui): restore chat selection after settings Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 23:48:01 +08:00
Xubin Ren	2cc32ca07c	feat(webui): redesign settings and BYOK configuration Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 23:48:01 +08:00
Xubin Ren	451d740849	fix(webui): polish delete dialog and sidebar toggles Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 13:28:34 +00:00
Xubin Ren	cbd5b06075	fix(memory): align replay overflow with history trimming Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 20:37:03 +08:00
Xubin Ren	24daf9a51c	test(memory): accept replay window in consolidation assertion Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 20:37:03 +08:00
Xubin Ren	91ade9eaac	fix(memory): consolidate history hidden by replay window Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 20:37:03 +08:00
Xubin Ren	2c830ca817	test(weixin): stabilize typing keepalive assertion Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 20:06:23 +08:00
Xubin Ren	e936ed48bd	feat: add image generation tool and WebUI mode Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 20:06:23 +08:00
chengyongru	3a2f47d720	fix(onboard): allow empty strings and falsy values in input fields Fixes two related input-handling bugs in the onboard wizard: 1. _input_text treated "" as None, preventing users from clearing optional string fields or entering empty strings intentionally. 2. _input_model_with_autocomplete used `if value else None`, which discarded falsy values such as empty strings or 0. To support clearing optional string fields, add _is_str_or_none() and normalize empty strings to None inside _configure_pydantic_model only when the field annotation is `str \| None`. Required str fields keep "" as a valid value. Also included: - Remember last selected item in provider/channel/model menus for better UX when configuring multiple items. - Rename _SIMPLE_TYPES and _MENU_DISPATCH to lowercase to follow Python naming conventions (they are local variables, not constants). - Remove unused imports in test file. Extracted from PR #3358.	2026-05-08 13:21:51 +08:00
zhonghongwei	6a3069514c	fix(api): remove enable_compression to restore real SSE streaming The HTTP compression buffer in aiohttp held all SSE chunks until the stream ended, making streaming appear batched instead of incremental. SSE payloads are small and frequent, so compression provides negligible benefit while breaking real-time delivery.	2026-05-07 22:03:27 +08:00
chengyongru	536c456e5e	fix(channels): restore bound logger in discord and websocket PR introduced module-level logger in static methods, which drops the channel context bound by BaseChannel.__init__. Revert to self._channel.logger / self.logger to preserve log labels. Also remove @staticmethod since these methods legitimately need instance access (F821 was the real issue, not the logger source).	2026-05-07 13:07:22 +08:00
yorkhellen	a2f5de6838	refactor: fix import order for logger in discord.py	2026-05-07 13:07:22 +08:00
yorkhellen	10a0bb0fb3	refactor: use module-level logger in static methods	2026-05-07 13:07:22 +08:00
yorkhellen	4773589685	fix: F821 undefined name errors in channels	2026-05-07 13:07:22 +08:00
yorkhellen	4a4e0af0ba	ci: Enable full ruff -F (all F rules) checks	2026-05-07 13:07:22 +08:00
chengyongru	9a8c4da0c4	refactor(logging): preserve tracebacks in remaining except blocks Follow-up to PR #3651: - Replace logger.error with logger.exception inside except blocks so stack traces are no longer lost: - providers/transcription.py (5 occurrences) - agent/tools/mcp.py (1 occurrence) - Replace stdlib logging.getLogger with loguru logger in providers/openai_compat_provider.py for consistency.	2026-05-07 13:06:59 +08:00
Jefsky	44a341335a	fix(dream): restore cursor with memory state Track the Dream cursor in memory versioning so restores do not skip history after rolling back Dream commits. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-07 01:06:05 +08:00
Xubin Ren	ac18a8baad	feat(webui): add localized slash commands Add a session-scoped slash command palette sourced from backend command metadata, and keep welcome-page quick actions localized across all WebUI languages. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-07 00:20:28 +08:00
chengyongru	49c07aa45a	style: address code review feedback - Consistent "WeChat" prefix in context_token error message - Use object() instead of httpx.AsyncClient() in new tests to avoid resource leak warnings	2026-05-06 23:52:50 +08:00
chengyongru	98c2f7cc27	fix(weixin): raise exceptions instead of silently dropping messages _send_text() swallowed API errors (non-zero errcode) with just a warning log, and send() had three silent return paths (no client, session paused, no context_token). Neither triggered ChannelManager's retry logic, causing persistent message loss until a new inbound message refreshed the context_token. Now all failure paths raise RuntimeError, matching BaseChannel's contract and enabling proper retry behavior.	2026-05-06 23:52:50 +08:00
chengyongru	4efd904ccc	fix(webui): require token_issue_secret for LAN access with frontend auth When host is set to 0.0.0.0, the gateway now enforces that either token or token_issue_secret must be configured — it refuses to start otherwise. Bootstrap endpoint behavior: - token_issue_secret configured: always validate regardless of source IP (handles reverse-proxy scenarios where all connections appear as localhost) - No secret: only localhost can bootstrap (local dev mode) The frontend shows an authentication form when bootstrap returns 401/403, persists the secret in localStorage, and retries automatically on reload.	2026-05-06 23:51:51 +08:00
chengyongru	034bea1a44	fix(webui): require token_issue_secret for non-localhost bootstrap The previous LAN-access fix (PR #3656) relaxed the bootstrap localhost check when host was 0.0.0.0, but did not require any authentication — any device on the network could obtain a token without credentials. New behavior: - token_issue_secret configured: always validate, regardless of source IP (handles reverse-proxy scenarios where all connections appear as localhost). - No secret configured: only localhost can bootstrap (local dev mode). This supersedes the host-based check from PR #3656.	2026-05-06 23:51:51 +08:00
chengyongru	bad584cb0e	fix(webui): allow LAN access when host is 0.0.0.0 The webui bootstrap endpoint (/webui/bootstrap) rejected all non-localhost connections with HTTP 403, preventing the embedded webui from working when accessed from another device on the LAN — even when host was set to 0.0.0.0. Skip the localhost check when the server is explicitly bound to 0.0.0.0 or ::, since that signals intent to accept external connections.	2026-05-06 23:00:23 +08:00
Xubin Ren	790a03ec28	feat(webui): polish chat layout and titles Align the WebUI sidebar and chat chrome with the updated design, and generate WebUI session titles asynchronously without blocking turns. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-06 22:20:35 +08:00
Xubin Ren	d8fd4c80bf	Merge PR #3646 : fix(transcription): retry Whisper calls on transient failures fix(transcription): retry Whisper calls on transient failures	2026-05-06 21:52:33 +08:00
chengyongru	40b4e01b13	merge: resolve conflict with main in transcription.py Keep _post_transcription_with_retry from PR branch, drop inline httpx calls that were replaced by the shared retry helper.	2026-05-06 21:26:28 +08:00
chengyongru	4fad19dc17	fix: use sequential MCP server connections to prevent CPU spin asyncio.create_task in connect_mcp_servers creates child tasks for each MCP server, but close_mcp calls stack.aclose() from the main task. anyio CancelScope requires enter/exit in the same task, so the cross-task exit raises RuntimeError which gets silently caught. The orphaned cancel scope keeps retrying via call_soon on every event loop tick, consuming 100% CPU. Fix: remove create_task/gather and connect servers sequentially in the caller task. MCP servers are typically 1-2, so parallel connection provides negligible benefit while introducing the cancel scope hazard. Closes #3638	2026-05-06 21:18:51 +08:00
Tim O'Brien	99209a806d	fix(tool_hints): pass max_length to abbreviate_path for is_path tools The is_path branch in _fmt_known was not passing max_length to abbreviate_path, so read_file, write_file, edit, list_dir, and web_fetch always truncated paths at 40 chars regardless of config. Now all three branches (is_path, is_command, fallback) honor the configured toolHintMaxLength.	2026-05-06 21:18:39 +08:00
Tim O'Brien	67875d7a15	fix: wire toolHintMaxLength through AgentLoop constructors The config field was added but never passed from config to AgentLoop. The value was always falling back to the default (40) regardless of what was set in config.json. Now passes tool_hint_max_length through all AgentLoop() call sites: - nanobot/nanobot.py (main bot) - nanobot/cli/commands.py (CLI agent, dev, webui commands) Also adds documentation in docs/configuration.md.	2026-05-06 21:18:39 +08:00
Tim O'Brien	daa4a25c9b	feat(config): add toolHintMaxLength to control tool hint truncation Add to config (default: 40, range: 20-500). Controls how many characters of tool hints are shown in progress updates (e.g. '$ cd …/project && npm test'). Set to 120+ to see full commands instead of truncated hints: ```json { "agents": { "defaults": { "toolHintMaxLength": 120 } } } ``` - Thread max_length through format_tool_hints → _fmt_known/_fmt_mcp/_fmt_fallback - Make path abbreviation in _abbreviate_command proportional to max_length - Add TestToolHintMaxLength test class with 5 tests - All 41 existing tests pass	2026-05-06 21:18:39 +08:00
hanyuanling	653de4a7ef	fix(agent): gate provider progress deltas	2026-05-06 21:18:30 +08:00
chengyongru	05e0106592	refactor(logging): preserve tracebacks and add channel context - Preserve tracebacks: logger.error in except blocks → logger.exception - Channel context: BaseChannel injects self.logger = logger.bind(channel=name) - Third-party bridge: redirect_lib_logging() replaces ad-hoc stdlib-to-loguru bridges - Log levels: network timeouts downgraded from ERROR → WARNING - Fix --verbose flag to actually work with loguru (set handler to DEBUG)	2026-05-06 21:17:45 +08:00
chengyongru	3437ff273f	fix(transcription): address review nits on PR #3253 - Correct api_key type hint to str \| None in _post_transcription_with_retry - Remove unreachable final return "" - Fix test_openai_missing_api_key_short_circuits to actually test missing-key path (use audio_file fixture so file exists) - Fix PermissionError patch for Windows (patch class method instead of instance attribute)	2026-05-06 15:52:29 +08:00
mohamed-elkholy95	7ebf611be8	fix(transcription): retry Whisper calls and guard malformed responses A single transient failure between the agent and an OpenAI/Groq Whisper endpoint currently vanishes as `return ""` in transcribe(). The voice message arrives as the empty string and there is no way to tell real silence apart from a failed upload. A malformed but successful response body is even worse: the JSON-decode error escapes the helper unhandled. Add a shared `_post_transcription_with_retry` used by both providers. Retry behaviour: - exponential backoff 1s -> 2s -> 4s, up to 3 retries (4 attempts) - retryable HTTP statuses: 408, 429, 500, 502, 503, 504 - retryable exceptions: TimeoutException, ConnectError, ReadError, WriteError, RemoteProtocolError Non-transient failures short-circuit to "" on the first attempt -- retrying a misconfigured key or a broken upload only burns rate-limit quota. Branches that short-circuit: - missing API key, missing audio file - file-read errors (PermissionError, OSError) on the audio path, preserving the nightly contract for direct provider callers - HTTP auth/4xx body issues via raise_for_status() - response.json() parse failures - non-dict JSON payloads Sharing one helper means OpenAI and Groq cannot drift apart silently. Thread `language` through the helper. The multipart files dict is rebuilt inside the per-attempt loop, so when a caller sets self.language the `language` field is sent on every attempt -- not just the first. Tests cover: - every advertised retryable status and exception, parameterized - language present on attempts 1 and 2 of a 503->200 sequence - language absent when unset; present when set (both providers) - malformed JSON body and non-dict JSON body short-circuit to "" - PermissionError on file read short-circuits with no HTTP attempt - max-attempts give-up, exponential-backoff schedule, auth no-retry, missing-key / missing-file short-circuit Test stub fix: the _StubResponse in tests/channels/test_channel_plugins.py declared no status_code, which the new helper reads for retry classification. Set status_code = 200 so the stub advertises the successful response that those tests already simulate. Also moved the two transcription-provider imports to the top of that file (previously placed mid-file) so the file is ruff-clean (E402).	2026-05-06 15:52:25 +08:00

1 2 3 4 5 ...

2456 Commits