nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-19 16:12:30 +00:00

Author	SHA1	Message	Date
chengyongru	6a4ed255de	fix(mcp): probe HTTP port before connecting to prevent event-loop crash When an MCP server configured as streamableHttp or SSE is unreachable, streamable_http_client's anyio task group cleanup raises RuntimeError / ExceptionGroup that escapes the caller's try/except and crashes the event loop with "Unhandled exception in event loop". Fix: add a lightweight TCP probe (_probe_http_url) before entering the MCP SDK transport. If the port is closed, the server is skipped with a warning instead of crashing. stdio transport is not probed (local process). Closes #3739	2026-05-13 23:39:07 +08:00
Xubin Ren	921fe259f4	Merge PR #3756 : feat(runner): model failover with fallback_models feat(runner): model failover with fallback_models	2026-05-13 23:38:14 +08:00
Xubin Ren	5efd67919b	feat(runner): support fallback candidates Resolve fallbackModels as preset references or explicit inline provider configs so failover uses complete model settings without exposing fallback logic to the agent loop. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 15:34:03 +00:00
Xubin Ren	43db848db0	Revert "feat(runner): support structured fallback models" This reverts commit 02b059a616dc6dc82ad15282102c7b27a5a34e40.	2026-05-13 14:11:08 +00:00
Xubin Ren	02b059a616	feat(runner): support structured fallback models Bind fallback model chains to the active model configuration so defaults and presets do not inherit or merge fallback behavior implicitly. Require explicit fallback providers while preserving per-fallback generation overrides and context-window safety. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 13:57:30 +00:00
Xubin Ren	eaa8ebd5d3	Merge remote-tracking branch 'origin/main' into pr-3756	2026-05-13 13:12:56 +00:00
Xubin Ren	fb508a302a	feat(webui): refresh session titles from live updates	2026-05-13 13:10:21 +00:00
chengyongru	913b0774d8	feat(runner): add model failover with fallback_models When the primary model returns a non-transient error and no content has been streamed yet, the runner now tries each model listed in the active preset's fallback_models in order. Each fallback model may reside on a different provider — a temporary provider instance is created on-the-fly via make_provider(config, model=...). Key design: - Failover is request-scoped (does not affect subagents/dream/consolidator) - Provider is restored via try/finally after each fallback attempt - Skipped when content was already streamed to avoid duplicate output - Recursive failover prevented by clearing fallback_models on fallback spec - Circuit breaker trips open after 3 consecutive primary failures (60s cooldown) - Cross-provider routing: fallback model prefix (e.g. groq/) determines provider Fixes: cross-provider fallback was broken because the factory passed the original preset (with provider forced to primary's provider) when creating fallback providers. Now uses provider="auto" so the model string prefix correctly routes to the right provider. Also fixes: log messages now distinguish between primary-failed, previous-fallback-failed, and circuit-open scenarios. closes: https://github.com/HKUDS/nanobot/issues/3376	2026-05-13 17:30:49 +08:00
Xubin Ren	79e528119c	Merge PR #3655 : feat(reason): display model reasoning content during streaming feat(reason): display model reasoning content during streaming	2026-05-13 17:19:30 +08:00
Xubin Ren	567e95dee6	fix(cli): stop spinner before resumed answer deltas Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 09:18:59 +00:00
Xubin Ren	53831e1611	fix(cli): clear thinking spinner before trace output Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 09:15:53 +00:00
Xubin Ren	3fab736262	fix(cli): keep trace output under assistant header Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 09:13:16 +00:00
Xubin Ren	9d50f1b933	feat: polish trace delivery and slash menu UX Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 08:47:34 +00:00
Xubin Ren	321c565ec4	fix(webui): normalize thinking trace row box model Thinking and Used tools are both auxiliary rows, but Thinking still carried an internal mb-2 even when it was standalone. That made collapsed Thinking rows visually taller than tool trace rows despite the shared thread spacing. Only add the extra bottom margin when a Thinking bubble has answer content below it in the same assistant message. Standalone Thinking rows now share the same outer box model as Used tools. Tests lock both standalone and answer-backed cases. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 08:12:44 +00:00
Xubin Ren	82ba63e148	fix(webui): compact spacing between auxiliary trace rows Thinking and Used tools are both auxiliary trace rows, but the thread list was applying the same large gap used between full chat turns. That made alternating Thinking / Used tools sequences look uneven and too airy. Move row spacing from a fixed flex gap to per-row margins: full chat turns keep mt-5, while consecutive auxiliary rows use mt-2. Add coverage for Thinking -> Used tools -> Thinking spacing. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 08:05:34 +00:00
Xubin Ren	c7ec5d3b75	fix(webui): align thinking and tool trace affordances Tool trace groups are supporting details, so default them to collapsed. Match the Thinking bubble's expanded body to the tool trace affordance by using the same grouped header and animated fade/slide body treatment. Update MessageBubble tests to assert tool traces start collapsed and expand on click. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:58:24 +00:00
Xubin Ren	521aaa5ecf	fix(webui): split reasoning at tool trace boundaries Live rendering merged reasoning chunks by scanning backward to the latest assistant row. That fixed late reasoning, but the scan skipped trace rows, so reasoning after a tool call crossed the Used tools block and attached to the previous assistant iteration. Refresh looked correct because persisted history reconstructs assistant/tool boundaries. Treat trace rows as hard phase boundaries, just like user messages. A reasoning_delta after Used tools now starts a fresh assistant placeholder, so live rendering matches replay: Thinking -> Used tools -> Thinking -> Used tools / answer. Add a regression for reasoning_delta -> reasoning_end -> tool_hint -> reasoning_delta. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:49:44 +00:00
Xubin Ren	278affc25e	fix(webui): hydrate reasoning and tool traces from history Live reasoning/tool frames were rendering correctly, but refreshing WebUI replayed only role/content/media from `/api/sessions/:key/messages`. Assistant `reasoning_content` / `thinking_blocks` and `tool_calls` were already persisted by the backend and returned by the history endpoint, but useSessionHistory discarded them. Hydrate persisted assistant reasoning into `UIMessage.reasoning` and reconstruct assistant tool calls as `kind: "trace"` rows so the replayed thread keeps the same Thinking bubble and Used tools block as the live stream. Tool result rows remain hidden from the conversation view to avoid replaying raw tool output as chat text. Adds regression coverage for both persisted reasoning and historical tool call trace hydration. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:33:52 +00:00
Xubin Ren	0033a8a185	fix(webui): keep reasoning scoped to the current user turn The post-hoc reasoning fix allowed late reasoning frames to attach back to the nearest assistant message, but the scan crossed a newer user message. That made the next turn's Thinking bubble render above the previous assistant reply. Treat the latest user message as a hard boundary: reasoning after it must start a new assistant placeholder and can no longer attach to earlier assistant turns. Add a regression covering previous assistant -> new user -> reasoning_delta. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:28:54 +00:00
Xubin Ren	9829cf66d2	fix(webui): keep late reasoning attached above the answer Some providers only surface structured `reasoning_content` after answer text has already streamed. The WebUI was treating those late `reasoning_delta` frames as a fresh assistant placeholder, so the Thinking bubble rendered below the already-visible answer. Attach late reasoning back to the active assistant turn instead. The bubble still renders above the message content, preserving the expected Thinking -> answer order even when the provider protocol delivers the reasoning post-hoc. Added a regression test for answer-first followed by reasoning_delta/reasoning_end. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:20:36 +00:00
Xubin Ren	458b4ba235	feat(reasoning): stream reasoning content as a first-class channel Reasoning now flows as its own stream — symmetric to the answer's ``delta`` / ``stream_end`` pair — instead of being shipped as one oversized progress message. This lets WebUI render a live "Thinking…" bubble that updates in place, then auto-collapses when the stream closes. Other channels remain plugin no-ops by default. ## Protocol New metadata: ``_reasoning_delta`` (chunk) and ``_reasoning_end`` (close marker). ChannelManager routes both to the dedicated plugin hooks below; the legacy one-shot ``_reasoning`` is kept for back-compat and BaseChannel expands it into a single delta + end pair so plugins only ever implement the streaming primitives. WebSocket emits two new events: - ``reasoning_delta`` (event, chat_id, text, optional stream_id) - ``reasoning_end`` (event, chat_id, optional stream_id) ## BaseChannel surface - ``send_reasoning_delta(chat_id, delta, metadata)`` — no-op default - ``send_reasoning_end(chat_id, metadata)`` — no-op default - ``send_reasoning(msg)`` — back-compat wrapper, base impl forwards to the streaming primitives A channel adds reasoning support by overriding the two streaming primitives. Telegram / Slack / Discord / Feishu / WeChat / Matrix keep the base no-ops until their bubble UIs are adapted; reasoning silently drops at dispatch, never as a stray text message. ## AgentHook Adds ``emit_reasoning_end`` to the hook lifecycle. ``_LoopHook`` tracks whether a reasoning segment is open and closes it on: - the first answer delta arriving (so the UI locks the bubble before the answer renders below), - ``on_stream_end``, - one-shot ``reasoning_content`` / ``thinking_blocks`` after a single non-streaming response. ## WebUI - ``UIMessage.reasoning`` is now a single accumulated string with a companion ``reasoningStreaming`` flag. - ``useNanobotStream`` consumes ``reasoning_delta`` / ``reasoning_end``; legacy ``kind: "reasoning"`` is auto-translated to a delta + end. - New ``ReasoningBubble``: shimmer header + auto-expanded while streaming, collapses to a clickable "Thinking" pill once closed, respects ``prefers-reduced-motion``. - Answer deltas adopt the reasoning placeholder so the bubble and the answer share one assistant row. ## Tests - ``tests/channels/test_channel_manager_reasoning.py`` — manager routes delta + end, drops on channel opt-out, expands one-shot back-compat. - ``tests/channels/test_websocket_channel.py`` — new ``reasoning_delta`` / ``reasoning_end`` frames, empty-chunk safety, no-subscriber safety, back-compat expansion. - ``tests/agent/test_runner_reasoning.py`` — runner closes the segment on streaming answer start and after one-shot reasoning. - WebUI ``useNanobotStream`` + ``message-bubble`` cover the new protocol and the shimmer styling. ## Docs ``docs/configuration.md`` and ``docs/websocket.md`` document the new events and the plugin contract. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:13:43 +00:00
Xubin Ren	a6b059d379	refactor(reasoning): make channel plugins own reasoning rendering Reasoning was being shipped to every channel as a generic progress message with a `_reasoning: true` flag. Two problems with that: 1. Channels without a low-emphasis UI primitive (Telegram, Slack, Discord, Feishu...) would dump raw model thoughts as ordinary replies, polluting the conversation. 2. The agent loop double-gated by inspecting `channels_config`, which coupled the loop to display policy. Treat reasoning as its own plugin action — `BaseChannel.send_reasoning` defaults to a documented no-op; channels that have a fitting affordance override. ChannelManager routes `_reasoning` outbounds to that method only when the channel opts in via `show_reasoning` (camelCase alias `showReasoning` mirrors `sendProgress`). Plugins that don't override silently drop reasoning — "no fit, no leak" is the contract. Reference implementation lands for WebSocket / WebUI: a new `kind: "reasoning"` frame, parked on the active assistant bubble as a collapsible `Thinking` group above the answer. CLI keeps its existing direct path (it doesn't go through the bus). `ChannelsConfig.show_reasoning` flips to `true` by default — only adapted channels surface anything, others stay quiet. Loop net diff is -3 lines: the `channels_config.show_reasoning` check moves out, leaving emit_reasoning a one-liner that publishes and trusts the channel to decide. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 06:27:53 +00:00
Xubin Ren	01fa362c03	Merge origin/main into feat/show-reasoning Resolves conflicts after main landed the state-machine turn refactor and the test_runner.py 9-file split: - nanobot/agent/loop.py: take main's `_state_build`/`_persist_user_message_early` flow; restore the `reasoning: bool` parameter on `_build_bus_progress_callback` so the loop hook can mark progress as reasoning-channel without coupling to the answer stream. - nanobot/cli/stream.py: keep main's configurable `bot_name`/`bot_icon` header while preserving the PR's `transient=True` Live + `self._console` routing + `_renderable()` final-render path that fixed TUI duplication. - tests/agent/test_runner.py was deleted on main and split into 9 focused files; relocated all 6 reasoning tests into a new `test_runner_reasoning.py` matching the new layout, deduplicated the per-test `ReasoningHook` boilerplate through a shared `_RecordingHook` helper. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 05:07:14 +00:00
chengyongru	99cc6ee808	test(agent): expand coverage and refactor test structure - Add 42 tests for ContextBuilder (context.py: 0→42 tests) - Add 37 tests for SubagentManager lifecycle (subagent.py: 2→37 tests) - Add 42 unit tests for AutoCompact in isolation - Split monolithic test_runner.py (3313 lines) into 9 focused files: test_runner_core, test_runner_hooks, test_runner_errors, test_runner_safety, test_runner_persistence, test_runner_governance, test_runner_tool_execution, test_runner_injections, test_loop_runner_integration - Add 3 config passthrough tests (temperature/max_tokens/reasoning_effort) - Fix fragile patch.object(__init__) in test_stop_preserves_context - Create shared conftest.py with make_provider/make_loop factories Total: 934 tests passing, 0 regressions	2026-05-13 12:49:17 +08:00
Xubin Ren	352aaf0627	refactor(reasoning): unify reasoning extraction across providers Reasoning surfacing was split across three branches in runner.py plus two separate streaming buffers (loop hook and runner progress stream), with three independent display-side gates in the CLI. This collapsed the policy into one source of truth and fixed two real bugs: - Structured `reasoning_content` was suppressed whenever the answer was streamed, because the runner gated emission on `streamed_content`. Providers don't stream `reasoning_content`; it only arrives on the final response, so the answer stream and the reasoning channel are independent. Added `streamed_reasoning` to `AgentHookContext` to track the right bit. - `channels.showReasoning` was subordinated to `sendProgress`. They are orthogonal — turning off progress streaming shouldn't silence reasoning. Reworked the CLI gates accordingly. Single-helper consolidation: - `extract_reasoning(reasoning_content, thinking_blocks, content)` returns `(reasoning_text, cleaned_content)` with a defined fallback order: dedicated field → Anthropic thinking_blocks → inline `<think>`/`<thought>` tags. Models that expose none of these short-circuit to `(None, content)` — zero overhead. - `IncrementalThinkExtractor` replaces the ad-hoc `emit_incremental_think` function and its hand-rolled "emitted cursor" state in both the loop hook and the runner progress stream. Also documented the new `showReasoning` channel option in docs/configuration.md and noted its independence from sendProgress. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 17:14:19 +00:00
彭星杰	00597fccd6	fix(webui): default to new chat on load and preserve scroll on settings return - Remove auto-selection of the most recent session on initial load, so the app opens to a blank new-chat page instead of the last session. - Preserve active session state when navigating to/from settings: keep ThreadShell mounted (hidden via CSS) so scroll position, message cache, and streaming state are not lost. - Update onBackToChat to return to blank page when no session was active instead of falling back to the most recent session. - Update related test expectations to match the new navigation behavior.	2026-05-12 23:13:11 +08:00
Flinn Xie	3a851f8f8d	feat(reasoning): add inline think tag extraction and Anthropic thinking_blocks support Add extract_think() and emit_incremental_think() helpers to extract thinking content from inline <think> and <thought> tags in the content field. This handles models served via Ollama, self-hosted vLLM, or other compatible endpoints that embed reasoning as inline tags instead of using the dedicated reasoning_content API field. Also adds Anthropic thinking_blocks support for extended thinking via the thinking content blocks array. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-05-12 23:02:59 +08:00
chengyongru	9e15925cf4	refactor(agent): remove ask_user tool The ask_user tool used AskUserInterrupt(BaseException) for mid-turn blocking, creating heavy coupling across runner, loop, and session management. The model now asks questions naturally in response text, the turn ends normally, and the user's next message starts a new turn with session history providing continuity. Removed: - nanobot/agent/tools/ask.py (tool, interrupt, helpers) - tests/agent/test_ask_user.py - webui/src/components/thread/AskUserPrompt.tsx - AskUserInterrupt handling in runner.py - Dual-path message building in loop.py - Pending ask detection via history scanning - button_prompt/buttons emission in WebSocket channel - ask_user references in Slack channel docstrings Preserved (MessageTool uses these independently): - OutboundMessage.buttons field - Channel button rendering (Telegram, Slack, WebSocket)	2026-05-12 22:48:26 +08:00
Xubin Ren	07f9ab580a	fix(provider): preserve Bedrock tool config for history Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:59:01 +08:00
chengyongru	ef268f47d2	chore: remove dead code identified by vulture + coverage cross-validation Remove unused code confirmed dead via vulture scan, grep verification, and coverage analysis: - _get_bridge_dir (cli/commands.py): 82-line function with zero callers - add_assistant_message (agent/context.py): method body never executed, also removed now-unused build_assistant_message import - _tool_parameters_schema (agent/tools/base.py): redundant copy of schema already exposed via the `parameters` property - MSTEAMS_REF_TTL_S (channels/msteams.py): unused constant (production uses config.ref_ttl_days directly); inlined in test - MESSAGE_TYPE_USER (channels/weixin.py): unused constant	2026-05-12 20:52:48 +08:00
Xubin Ren	35f64cd828	docs(config): document model presets Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	079b37aac5	test(config): cover legacy model defaults without presets Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	13eede5803	refactor(agent): inject runtime model publisher Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	6554c1f832	refactor(agent): move preset helpers out of loop Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	e6103d9312	fix(agent): separate preset snapshots from config reload Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	8fcb24bb7c	refactor(agent): trim model preset runtime wiring Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	70b8daaee6	fix(command): show default as current model preset Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	c9b84c7b11	fix(config): reserve implicit default model preset Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	1d14c2ba40	fix(config): accept modelPresets root alias Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	bcc4b97183	fix(webui): broadcast runtime model updates Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	c92345bbb1	fix(webui): sync model badge after preset switch Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	b61c6304c3	fix(config): reconcile presets with settings reload Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	c450d6fd3f	fix(config): make model preset switching atomic Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
chengyongru	6f78267c82	feat(config): add ModelPresetConfig and runtime preset switching - Add `ModelPresetConfig` schema for named model presets - Add `model_presets` dict to `Config` and `model_preset` field to `AgentDefaults` - Add `resolve_preset()` to return effective model params from preset or defaults - Add `@model_validator` to reject unknown preset names - Update `_match_provider()` to use resolved preset model/provider - Update `make_provider()` and `provider_signature()` to use `resolve_preset()` - Add `model_preset` property to `AgentLoop` for atomic runtime switching - Update `AgentLoop.from_config()` to inject a runtime `default` preset - Wire self-tool to inspect/clear preset state - Update CLI display strings to show active preset	2026-05-12 20:06:22 +08:00
Xubin Ren	1175420339	test(feishu): cover topic isolation alias Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 11:51:25 +08:00
yorkhellen	a32be99ddc	test(feishu): add config and helper tests for topic_isolation	2026-05-12 11:51:25 +08:00
yorkhellen	03b357b12d	feat(feishu): add topic_isolation config switch	2026-05-12 11:51:25 +08:00
Xubin Ren	fd6887c274	test(providers): cover VolcEngine token parameter Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 11:35:52 +08:00
Albert Wang	dd4def25fa	fix(providers): set supports_max_completion_tokens for VolcEngine providers VolcEngine's OpenAI-compatible gateway rejects requests when both max_tokens and max_completion_tokens are present (the latter added by openai-python SDK v2.x serialization). Set the flag so nanobot sends max_completion_tokens instead of max_tokens for volcengine, volcengine_coding_plan, and by extension byteplus variants. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-12 11:35:52 +08:00
Xubin Ren	23312d683e	fix(tools): isolate plugin runtime state Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 11:28:20 +08:00

1 2 3 4 5 ...

2486 Commits