nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-20 00:22:31 +00:00

Author	SHA1	Message	Date
Xubin Ren	13eede5803	refactor(agent): inject runtime model publisher Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	6554c1f832	refactor(agent): move preset helpers out of loop Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	e6103d9312	fix(agent): separate preset snapshots from config reload Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	8fcb24bb7c	refactor(agent): trim model preset runtime wiring Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	70b8daaee6	fix(command): show default as current model preset Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	c9b84c7b11	fix(config): reserve implicit default model preset Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	1d14c2ba40	fix(config): accept modelPresets root alias Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	bcc4b97183	fix(webui): broadcast runtime model updates Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	c92345bbb1	fix(webui): sync model badge after preset switch Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	b61c6304c3	fix(config): reconcile presets with settings reload Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	c450d6fd3f	fix(config): make model preset switching atomic Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
chengyongru	6f78267c82	feat(config): add ModelPresetConfig and runtime preset switching - Add `ModelPresetConfig` schema for named model presets - Add `model_presets` dict to `Config` and `model_preset` field to `AgentDefaults` - Add `resolve_preset()` to return effective model params from preset or defaults - Add `@model_validator` to reject unknown preset names - Update `_match_provider()` to use resolved preset model/provider - Update `make_provider()` and `provider_signature()` to use `resolve_preset()` - Add `model_preset` property to `AgentLoop` for atomic runtime switching - Update `AgentLoop.from_config()` to inject a runtime `default` preset - Wire self-tool to inspect/clear preset state - Update CLI display strings to show active preset	2026-05-12 20:06:22 +08:00
yorkhellen	03b357b12d	feat(feishu): add topic_isolation config switch	2026-05-12 11:51:25 +08:00
Albert Wang	dd4def25fa	fix(providers): set supports_max_completion_tokens for VolcEngine providers VolcEngine's OpenAI-compatible gateway rejects requests when both max_tokens and max_completion_tokens are present (the latter added by openai-python SDK v2.x serialization). Set the flag so nanobot sends max_completion_tokens instead of max_tokens for volcengine, volcengine_coding_plan, and by extension byteplus variants. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-12 11:35:52 +08:00
Xubin Ren	23312d683e	fix(tools): isolate plugin runtime state Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 11:28:20 +08:00
chengyongru	043f0e67f7	feat(tools): introduce plugin-based tool discovery and runtime context protocol This commit implements a progressive refactoring of the tool system to support plugin discovery, scoped loading, and protocol-driven runtime context injection. Key changes: - Add Tool ABC metadata (tool_name, _scopes) and ToolContext dataclass for dependency injection. - Introduce ToolLoader with pkgutil-based builtin discovery and entry_points-based third-party plugin loading. - Add scope filtering (core/subagent/memory) so different contexts load appropriate tool sets. - Introduce ContextAware protocol and RequestContext dataclass to replace hardcoded per-tool context injection in AgentLoop. - Add RuntimeState / MutableRuntimeState protocols to decouple MyTool from AgentLoop. - Migrate all built-in tools to declare scopes and implement create()/enabled() hooks. - Migrate MessageTool, SpawnTool, CronTool, and MyTool to ContextAware. - Refactor AgentLoop to use ToolLoader and protocol-driven context injection. - Refactor SubagentManager to use ToolLoader(scope="subagent") with per-run FileStates isolation. - Register all built-in tools via pyproject.toml entry_points. - Add comprehensive tests for loader scopes, entry_points, ContextAware, subagent tools, and runtime state sync.	2026-05-12 11:28:20 +08:00
04cb	bd0ba745dd	fix(wecom): preserve real filename from SDK when payload omits name (#3737 )	2026-05-12 10:27:32 +08:00
chengyongru	49f85f5c23	docs(schema,config): clarify reasoning_effort semantics for MiMo thinking mode - Update AgentDefaults.reasoning_effort comment to document "none" (disable) and None (preserve provider default). - Add configuration.md tip explaining MiMo thinking mode behavior.	2026-05-11 14:38:28 +08:00
Alfredo Arenas	c6b7a9524c	fix(providers): wire MiMo to thinking_type to allow disabling reasoning (#3585 ) The hosted Xiaomi MiMo API accepts {"thinking": {"type": "enabled"\|"disabled"}} to toggle reasoning, which is exactly the shape produced by the existing thinking_type style. The xiaomi_mimo ProviderSpec just needed to opt in. Before this fix, setting reasoning_effort="none" had no effect on MiMo because no thinking_style was configured, so the disable signal never reached the server. Default-on models (mimo-v2.5-pro and friends) kept reasoning regardless of user configuration. Source: https://platform.xiaomimimo.com/docs/en-US/api/chat/openai-api Co-authored with Claude Opus 4.7. Strategy and review via Claude Desktop, implementation via Claude Code.	2026-05-11 14:38:28 +08:00
Alfredo Arenas	271b674bf1	feat(cli): pass bot_name/bot_icon from config to StreamRenderer (#3650 ) Both StreamRenderer instantiations in the agent command (single-message mode and interactive mode) now read bot_name and bot_icon from config.agents.defaults and forward them to the renderer. This is the wiring step that makes the schema fields actually take effect at runtime. With safe defaults of "nanobot" and "🐈", existing users see no change.	2026-05-11 11:50:18 +08:00
Alfredo Arenas	86693f5422	feat(cli): make stream renderer use bot_name and bot_icon (#3650 ) Threads bot_name/bot_icon through ThinkingSpinner and StreamRenderer with safe defaults that preserve current behavior. - ThinkingSpinner uses bot_name in its status text - StreamRenderer header is "<icon> <name>" when icon is set, or just "<name>" when icon is empty - Removes the now-unused __logo__ import (the cat emoji is the default value of bot_icon, not a hardcoded constant)	2026-05-11 11:50:18 +08:00
Alfredo Arenas	fcf9d110dd	feat(schema): add bot_name and bot_icon to AgentDefaults (#3650 ) Two new fields with safe defaults that preserve current branding: - bot_name: str = "nanobot" - bot_icon: str = "🐈" Empty string for bot_icon is allowed and lets users opt out of the leading icon. camelCase keys (botName, botIcon) bind via the existing to_camel alias generator.	2026-05-11 11:50:18 +08:00
barreler126	046d0831ef	feat: add NVIDIA NIM provider support	2026-05-11 01:25:44 +08:00
chengyongru	a6e993df25	fix(agent): move archived summary into system prompt for KV cache stability - Append [Archived Context Summary] to system prompt instead of injecting it into the user message runtime context, improving KV cache reuse across turns and avoiding consecutive same-role messages. - _last_summary persists in metadata (no pop) for restart survival; summary is re-injected every turn via the stable system prompt. - Remove dynamic "Inactive for X minutes" from _format_summary — use static last_active timestamp instead to preserve KV cache stability. - Pass session_summary through build_messages() so both normal and ask_user paths receive the archived summary in the system prompt. - estimate_session_prompt_tokens now reads _last_summary from metadata to include the summary in token budget estimation. - Remove obsolete session_summary parameter from maybe_consolidate_by_tokens and estimate_session_prompt_tokens call sites in loop.py (summary flows through build_messages instead). - Ensure /new (session.clear()) clears _last_summary from metadata.	2026-05-11 01:25:15 +08:00
chengyongru	73a8d8a875	fix(utils): remove unreachable dead code in find_legal_message_start The for loop at line 168 never executes because start is assigned i + 1 immediately before slicing messages[start : i + 1], which is always an empty list. Remove the dead code. Fixes #3716	2026-05-09 18:53:13 +08:00
chengyongru	de13e72e15	refactor(loop): log turn completion with state count	2026-05-09 17:15:23 +08:00
chengyongru	728d837e4e	refactor(loop): add turn_id for trace correlation - TurnContext now carries a turn_id (session_key:time_ns) - All state transition debug logs include [turn_id] prefix - RuntimeError messages also include turn_id for observability	2026-05-09 17:15:23 +08:00
chengyongru	5327f5e1a0	refactor(loop): event-driven state transitions + trace logging - State handlers now return event strings ('ok', 'dispatch', 'shortcut') - Driver loop uses _TRANSITIONS lookup table: (state, event) -> next_state - State graph is centralized and visible at a glance - Added StateTraceEntry to record per-state timing and events - Driver loop logs state duration + event at debug level - Exception paths are traced with error field for observability	2026-05-09 17:15:23 +08:00
chengyongru	6ef1b2c842	refactor(loop): address code review nits - Fix _assemble_outbound on_stream type annotation (Callable[[str], Awaitable[None]] \| None) - Use last_msg consistently in _state_save instead of re-indexing - Remove dead fallback in _state_respond (guaranteed non-None by _state_save) - Change pending_summary type from Any to str \| None - Make session optional in TurnContext to avoid redundant fetch - Add defensive dispatch with RuntimeError for missing handlers	2026-05-09 17:15:23 +08:00
chengyongru	8a6b769219	refactor(loop): fix line length in state handlers	2026-05-09 17:15:23 +08:00
chengyongru	02443ca208	refactor(loop): convert _process_message to functional state machine - Extract TurnState enum and TurnContext dataclass - Extract state handlers: _state_restore, _state_compact, _state_command, _state_build, _state_run, _state_save, _state_respond - Extract _process_system_message for system message short-circuit - Driver loop uses getattr dispatch over explicit state transitions - Preserve all existing behavior (794 tests passing)	2026-05-09 17:15:23 +08:00
chengyongru	9fb9f53147	refactor(loop): add TurnState and TurnContext	2026-05-09 17:15:23 +08:00
chengyongru	88cf8db164	refactor(loop): extract _assemble_outbound	2026-05-09 17:15:23 +08:00
chengyongru	0124c94d19	refactor(loop): extract _build_initial_messages	2026-05-09 17:15:23 +08:00
chengyongru	ce52070fcf	refactor(loop): extract _persist_user_message_early	2026-05-09 17:15:23 +08:00
chengyongru	d2cb8ac17f	refactor(loop): extract _build_retry_wait_callback	2026-05-09 17:15:23 +08:00
chengyongru	b2fb776a68	refactor(loop): extract _build_bus_progress_callback	2026-05-09 17:15:23 +08:00
chengyongru	ff6b014a07	refactor: allow model/context_window_tokens override in from_config() - Pop model and context_window_tokens from extra kwargs before forwarding to __init__, allowing callers like _run_gateway to pass snapshot-derived values instead of config defaults - _run_gateway now explicitly passes model/context_window_tokens from provider_snapshot to preserve pre-refactor behavior	2026-05-09 15:30:48 +08:00
chengyongru	733b34d685	refactor: address code review feedback on AgentLoop.from_config() - Accept optional `provider` kwarg in from_config() to avoid double instantiation in _run_gateway (which already builds provider_snapshot) - Restore try/except ValueError wrappers in serve() and agent() for clean error messages on provider creation failure - Update test: _FakeAgentLoop captures provider from kwargs, restore strong assertion (seen["provider"] is provider)	2026-05-09 15:30:48 +08:00
chengyongru	3202f58c41	refactor: introduce AgentLoop.from_config() to centralize loop assembly Extract duplicated bus/provider/loop initialization from CLI commands (serve, _run_gateway, agent) and Nanobot facade into a single AgentLoop.from_config() classmethod. - Remove _make_provider() from cli/commands.py and nanobot.py - Remove inline provider creation in all three CLI entry points - AgentLoop.from_config() creates MessageBus, calls make_provider(), and assembles AgentLoop with all standard config-derived parameters - Supports **extra overrides for callers that need custom args (e.g. cron_service, session_manager, provider_snapshot_loader) - Update tests to mock make_provider at nanobot.providers.factory and add from_config classmethod to _FakeAgentLoop fixtures This is PR 1/4 of the model-preset feature decomposition.	2026-05-09 15:30:48 +08:00
Xubin Ren	9252f4d826	Revert "fix(agent): persist _last_summary across restarts with used sentinel" This reverts commit e5a1416a37b423de95b0fa279e9473110a678112.	2026-05-09 15:00:54 +08:00
chengyongru	e5a1416a37	fix(agent): persist _last_summary across restarts with used sentinel The previous implementation popped _last_summary from session.metadata after injecting it into the prompt, then saved the session. This caused the summary to be permanently lost after a process restart, making the AI forget archived context and appear to ignore memory or reference non-existent previous messages. Replace the destructive pop with a _last_summary_used sentinel: - _last_summary stays in metadata for restart survival - _last_summary_used prevents duplicate injection within the same turn - Clear the sentinel whenever a new summary is generated Updates tests to match the new persistence behavior.	2026-05-09 14:58:38 +08:00
Xubin Ren	56eee06736	feat(webui): add BYOK web search settings Let WebUI users configure the single web search provider credential from BYOK while keeping saved secrets masked and hot-reloaded for new searches. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-09 14:52:48 +08:00
Eugene Chae	4d7bf5bb8a	fix(cli): handle retry-wait messages in interactive mode	2026-05-09 13:50:39 +08:00
Xubin Ren	3231aaf9ee	fix(image): prevent duplicate delivery and replay artifacts	2026-05-09 05:45:13 +00:00
Vilius Vystartas	4d168c571c	fix: replace raise with logger.error + return fail in exception handlers The previous version changed return fail/pass to raise, which broke graceful degradation — tests expect upload/content failures to be caught and handled, not propagated. Now logs errors with exc_info=True while preserving existing control flow (return fail for upload/content send, stop typing for stream).	2026-05-09 01:04:20 +08:00
Vilius Vystartas	31c45fe798	fix: raise instead of swallowing on outbound-message path errors Per reviewer request (chengyongru): raise exceptions on the outbound message path so ChannelManager can trigger retry logic, matching the pattern from commit 98c2f7cc (Weixin channel cleanup). Changes: - _resolve_server_upload_limit_bytes: warning → error (non-fatal config) - _upload_and_send_attachment media upload: raise instead of swallow - _upload_and_send_attachment room send: raise instead of swallow - send_delta stream edit: error + raise after cleanup - weixin _load_state: warning → error (non-fatal state load)	2026-05-09 01:04:20 +08:00
Vilius Vystartas	ba1e5036f5	fix: log errors in silent exception handlers (matrix + weixin channels) The Matrix channel had 4 bare except blocks that silently swallowed transport errors with no logging — stream send/edit failures, media upload failures, server config fetch failures, and room content send failures. The Weixin channel had 1 silent state-load failure. This mirrors commit 98c2f7cc ('fix(weixin): raise exceptions instead of silently dropping messages') for the Matrix channel and adds a warning for the remaining silent catch in Weixin's _load_state. All failures now log at warning level with exc_info=True so operators can diagnose intermittent Matrix/Weixin transport issues.	2026-05-09 01:04:20 +08:00
yorkhellen	843e96f09d	fix(feishu): send all messages to topic when in thread	2026-05-09 01:03:57 +08:00
chengyongru	908f1246d8	fix(cli): sanitize surrogate code points before entering message bus On Windows, prompt_toolkit produces lone surrogate code points (e.g. 🐈) for emoji input. These propagate through the message bus and crash at json.dumps() / file write time because surrogates cannot be encoded as UTF-8. Extract _sanitize_surrogates() that round-trips through UTF-16 to reconstruct paired surrogates into real characters (e.g. 🐈 → 🐈), replacing unpaired surrogates with U+FFFD. Apply it at the CLI input path and reuse in SafeFileHistory.	2026-05-09 01:03:34 +08:00

1 2 3 4 5 ...

1636 Commits