nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-19 16:12:30 +00:00

Author	SHA1	Message	Date
Xubin Ren	5d7f3f2751	fix(webui): stabilize live thread rendering and navigation	2026-05-13 16:39:07 +00:00
Xubin Ren	fb508a302a	feat(webui): refresh session titles from live updates	2026-05-13 13:10:21 +00:00
Xubin Ren	9d50f1b933	feat: polish trace delivery and slash menu UX Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 08:47:34 +00:00
Xubin Ren	321c565ec4	fix(webui): normalize thinking trace row box model Thinking and Used tools are both auxiliary rows, but Thinking still carried an internal mb-2 even when it was standalone. That made collapsed Thinking rows visually taller than tool trace rows despite the shared thread spacing. Only add the extra bottom margin when a Thinking bubble has answer content below it in the same assistant message. Standalone Thinking rows now share the same outer box model as Used tools. Tests lock both standalone and answer-backed cases. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 08:12:44 +00:00
Xubin Ren	82ba63e148	fix(webui): compact spacing between auxiliary trace rows Thinking and Used tools are both auxiliary trace rows, but the thread list was applying the same large gap used between full chat turns. That made alternating Thinking / Used tools sequences look uneven and too airy. Move row spacing from a fixed flex gap to per-row margins: full chat turns keep mt-5, while consecutive auxiliary rows use mt-2. Add coverage for Thinking -> Used tools -> Thinking spacing. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 08:05:34 +00:00
Xubin Ren	c7ec5d3b75	fix(webui): align thinking and tool trace affordances Tool trace groups are supporting details, so default them to collapsed. Match the Thinking bubble's expanded body to the tool trace affordance by using the same grouped header and animated fade/slide body treatment. Update MessageBubble tests to assert tool traces start collapsed and expand on click. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:58:24 +00:00
Xubin Ren	521aaa5ecf	fix(webui): split reasoning at tool trace boundaries Live rendering merged reasoning chunks by scanning backward to the latest assistant row. That fixed late reasoning, but the scan skipped trace rows, so reasoning after a tool call crossed the Used tools block and attached to the previous assistant iteration. Refresh looked correct because persisted history reconstructs assistant/tool boundaries. Treat trace rows as hard phase boundaries, just like user messages. A reasoning_delta after Used tools now starts a fresh assistant placeholder, so live rendering matches replay: Thinking -> Used tools -> Thinking -> Used tools / answer. Add a regression for reasoning_delta -> reasoning_end -> tool_hint -> reasoning_delta. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:49:44 +00:00
Xubin Ren	278affc25e	fix(webui): hydrate reasoning and tool traces from history Live reasoning/tool frames were rendering correctly, but refreshing WebUI replayed only role/content/media from `/api/sessions/:key/messages`. Assistant `reasoning_content` / `thinking_blocks` and `tool_calls` were already persisted by the backend and returned by the history endpoint, but useSessionHistory discarded them. Hydrate persisted assistant reasoning into `UIMessage.reasoning` and reconstruct assistant tool calls as `kind: "trace"` rows so the replayed thread keeps the same Thinking bubble and Used tools block as the live stream. Tool result rows remain hidden from the conversation view to avoid replaying raw tool output as chat text. Adds regression coverage for both persisted reasoning and historical tool call trace hydration. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:33:52 +00:00
Xubin Ren	0033a8a185	fix(webui): keep reasoning scoped to the current user turn The post-hoc reasoning fix allowed late reasoning frames to attach back to the nearest assistant message, but the scan crossed a newer user message. That made the next turn's Thinking bubble render above the previous assistant reply. Treat the latest user message as a hard boundary: reasoning after it must start a new assistant placeholder and can no longer attach to earlier assistant turns. Add a regression covering previous assistant -> new user -> reasoning_delta. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:28:54 +00:00
Xubin Ren	9829cf66d2	fix(webui): keep late reasoning attached above the answer Some providers only surface structured `reasoning_content` after answer text has already streamed. The WebUI was treating those late `reasoning_delta` frames as a fresh assistant placeholder, so the Thinking bubble rendered below the already-visible answer. Attach late reasoning back to the active assistant turn instead. The bubble still renders above the message content, preserving the expected Thinking -> answer order even when the provider protocol delivers the reasoning post-hoc. Added a regression test for answer-first followed by reasoning_delta/reasoning_end. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:20:36 +00:00
Xubin Ren	458b4ba235	feat(reasoning): stream reasoning content as a first-class channel Reasoning now flows as its own stream — symmetric to the answer's ``delta`` / ``stream_end`` pair — instead of being shipped as one oversized progress message. This lets WebUI render a live "Thinking…" bubble that updates in place, then auto-collapses when the stream closes. Other channels remain plugin no-ops by default. ## Protocol New metadata: ``_reasoning_delta`` (chunk) and ``_reasoning_end`` (close marker). ChannelManager routes both to the dedicated plugin hooks below; the legacy one-shot ``_reasoning`` is kept for back-compat and BaseChannel expands it into a single delta + end pair so plugins only ever implement the streaming primitives. WebSocket emits two new events: - ``reasoning_delta`` (event, chat_id, text, optional stream_id) - ``reasoning_end`` (event, chat_id, optional stream_id) ## BaseChannel surface - ``send_reasoning_delta(chat_id, delta, metadata)`` — no-op default - ``send_reasoning_end(chat_id, metadata)`` — no-op default - ``send_reasoning(msg)`` — back-compat wrapper, base impl forwards to the streaming primitives A channel adds reasoning support by overriding the two streaming primitives. Telegram / Slack / Discord / Feishu / WeChat / Matrix keep the base no-ops until their bubble UIs are adapted; reasoning silently drops at dispatch, never as a stray text message. ## AgentHook Adds ``emit_reasoning_end`` to the hook lifecycle. ``_LoopHook`` tracks whether a reasoning segment is open and closes it on: - the first answer delta arriving (so the UI locks the bubble before the answer renders below), - ``on_stream_end``, - one-shot ``reasoning_content`` / ``thinking_blocks`` after a single non-streaming response. ## WebUI - ``UIMessage.reasoning`` is now a single accumulated string with a companion ``reasoningStreaming`` flag. - ``useNanobotStream`` consumes ``reasoning_delta`` / ``reasoning_end``; legacy ``kind: "reasoning"`` is auto-translated to a delta + end. - New ``ReasoningBubble``: shimmer header + auto-expanded while streaming, collapses to a clickable "Thinking" pill once closed, respects ``prefers-reduced-motion``. - Answer deltas adopt the reasoning placeholder so the bubble and the answer share one assistant row. ## Tests - ``tests/channels/test_channel_manager_reasoning.py`` — manager routes delta + end, drops on channel opt-out, expands one-shot back-compat. - ``tests/channels/test_websocket_channel.py`` — new ``reasoning_delta`` / ``reasoning_end`` frames, empty-chunk safety, no-subscriber safety, back-compat expansion. - ``tests/agent/test_runner_reasoning.py`` — runner closes the segment on streaming answer start and after one-shot reasoning. - WebUI ``useNanobotStream`` + ``message-bubble`` cover the new protocol and the shimmer styling. ## Docs ``docs/configuration.md`` and ``docs/websocket.md`` document the new events and the plugin contract. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:13:43 +00:00
Xubin Ren	a6b059d379	refactor(reasoning): make channel plugins own reasoning rendering Reasoning was being shipped to every channel as a generic progress message with a `_reasoning: true` flag. Two problems with that: 1. Channels without a low-emphasis UI primitive (Telegram, Slack, Discord, Feishu...) would dump raw model thoughts as ordinary replies, polluting the conversation. 2. The agent loop double-gated by inspecting `channels_config`, which coupled the loop to display policy. Treat reasoning as its own plugin action — `BaseChannel.send_reasoning` defaults to a documented no-op; channels that have a fitting affordance override. ChannelManager routes `_reasoning` outbounds to that method only when the channel opts in via `show_reasoning` (camelCase alias `showReasoning` mirrors `sendProgress`). Plugins that don't override silently drop reasoning — "no fit, no leak" is the contract. Reference implementation lands for WebSocket / WebUI: a new `kind: "reasoning"` frame, parked on the active assistant bubble as a collapsible `Thinking` group above the answer. CLI keeps its existing direct path (it doesn't go through the bus). `ChannelsConfig.show_reasoning` flips to `true` by default — only adapted channels surface anything, others stay quiet. Loop net diff is -3 lines: the `channels_config.show_reasoning` check moves out, leaving emit_reasoning a one-liner that publishes and trusts the channel to decide. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 06:27:53 +00:00
彭星杰	00597fccd6	fix(webui): default to new chat on load and preserve scroll on settings return - Remove auto-selection of the most recent session on initial load, so the app opens to a blank new-chat page instead of the last session. - Preserve active session state when navigating to/from settings: keep ThreadShell mounted (hidden via CSS) so scroll position, message cache, and streaming state are not lost. - Update onBackToChat to return to blank page when no session was active instead of falling back to the most recent session. - Update related test expectations to match the new navigation behavior.	2026-05-12 23:13:11 +08:00
chengyongru	9e15925cf4	refactor(agent): remove ask_user tool The ask_user tool used AskUserInterrupt(BaseException) for mid-turn blocking, creating heavy coupling across runner, loop, and session management. The model now asks questions naturally in response text, the turn ends normally, and the user's next message starts a new turn with session history providing continuity. Removed: - nanobot/agent/tools/ask.py (tool, interrupt, helpers) - tests/agent/test_ask_user.py - webui/src/components/thread/AskUserPrompt.tsx - AskUserInterrupt handling in runner.py - Dual-path message building in loop.py - Pending ask detection via history scanning - button_prompt/buttons emission in WebSocket channel - ask_user references in Slack channel docstrings Preserved (MessageTool uses these independently): - OutboundMessage.buttons field - Channel button rendering (Telegram, Slack, WebSocket)	2026-05-12 22:48:26 +08:00
Xubin Ren	bcc4b97183	fix(webui): broadcast runtime model updates Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	c92345bbb1	fix(webui): sync model badge after preset switch Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
Xubin Ren	6d07aa6059	test(webui): cover randomUUID entry shim fallback Add a focused regression test for the non-secure-context WebUI entry shim so missing crypto.randomUUID no longer depends on manual verification. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-11 15:39:05 +08:00
NearlCrews	5ea2c37325	fix(webui): shim crypto.randomUUID for non-secure contexts `crypto.randomUUID` only exists in secure contexts (HTTPS or localhost). Over LAN HTTP it is undefined, so `ChatPane`'s welcome-message flush and streaming-message handlers crash mid-render with `TypeError`, unmounting the React tree and leaving the user a blank page. Install a Math.random-backed v4-ish fallback at app entry, gated on the feature being missing. This mirrors the shim already used in the test setup and covers all six call sites (`ChatPane.tsx`, `useNanobotStream.ts`) without touching them. These IDs are client-side message keys with no security role, so non-cryptographic randomness is fine. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-11 15:39:05 +08:00
Xubin Ren	56eee06736	feat(webui): add BYOK web search settings Let WebUI users configure the single web search provider credential from BYOK while keeping saved secrets masked and hot-reloaded for new searches. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-09 14:52:48 +08:00
Xubin Ren	bbdf1db30d	fix(webui): render generated images as rounded previews Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 23:48:01 +08:00
Xubin Ren	151c3d5ad0	fix(webui): restore chat selection after settings Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 23:48:01 +08:00
Xubin Ren	2cc32ca07c	feat(webui): redesign settings and BYOK configuration Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 23:48:01 +08:00
Xubin Ren	451d740849	fix(webui): polish delete dialog and sidebar toggles Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 13:28:34 +00:00
Xubin Ren	e936ed48bd	feat: add image generation tool and WebUI mode Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 20:06:23 +08:00
Xubin Ren	ac18a8baad	feat(webui): add localized slash commands Add a session-scoped slash command palette sourced from backend command metadata, and keep welcome-page quick actions localized across all WebUI languages. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-07 00:20:28 +08:00
chengyongru	4efd904ccc	fix(webui): require token_issue_secret for LAN access with frontend auth When host is set to 0.0.0.0, the gateway now enforces that either token or token_issue_secret must be configured — it refuses to start otherwise. Bootstrap endpoint behavior: - token_issue_secret configured: always validate regardless of source IP (handles reverse-proxy scenarios where all connections appear as localhost) - No secret: only localhost can bootstrap (local dev mode) The frontend shows an authentication form when bootstrap returns 401/403, persists the secret in localStorage, and retries automatically on reload.	2026-05-06 23:51:51 +08:00
chengyongru	034bea1a44	fix(webui): require token_issue_secret for non-localhost bootstrap The previous LAN-access fix (PR #3656) relaxed the bootstrap localhost check when host was 0.0.0.0, but did not require any authentication — any device on the network could obtain a token without credentials. New behavior: - token_issue_secret configured: always validate, regardless of source IP (handles reverse-proxy scenarios where all connections appear as localhost). - No secret configured: only localhost can bootstrap (local dev mode). This supersedes the host-based check from PR #3656.	2026-05-06 23:51:51 +08:00
chengyongru	bad584cb0e	fix(webui): allow LAN access when host is 0.0.0.0 The webui bootstrap endpoint (/webui/bootstrap) rejected all non-localhost connections with HTTP 403, preventing the embedded webui from working when accessed from another device on the LAN — even when host was set to 0.0.0.0. Skip the localhost check when the server is explicitly bound to 0.0.0.0 or ::, since that signals intent to accept external connections.	2026-05-06 23:00:23 +08:00
Xubin Ren	790a03ec28	feat(webui): polish chat layout and titles Align the WebUI sidebar and chat chrome with the updated design, and generate WebUI session titles asynchronously without blocking turns. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-06 22:20:35 +08:00
Xubin Ren	7faa339902	fix(webui): keep existing package lockfile Restore the npm lockfile that is already present on main so this PR only carries the WebUI turn-completion changes. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-03 22:28:40 +08:00
Xubin Ren	96da6d8190	fix(webui): tighten turn completion handling Keep the new turn-end signal scoped to WebSocket clients, preserve pending tool-call state across trailing tool result rows, and drop the accidental npm lockfile from the Bun-based WebUI. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-03 22:28:40 +08:00
ramonpaolo	be83525f99	test(webui): cover turn-end streaming regressions	2026-05-03 22:28:40 +08:00
ramonpaolo	08744ce408	fix(webui): isolate thread cache during chat switches	2026-05-03 22:28:40 +08:00
ramonpaolo	76e3f74df7	feat(webui): improve beta turn completion and streaming UX	2026-05-03 22:28:40 +08:00
Xubin Ren	b440e76d2f	feat(webui): add model settings runtime refresh	2026-04-25 18:05:06 +00:00
Xubin Ren	a58d9fd357	feat(webui): render ask_user choices Made-with: Cursor	2026-04-25 15:46:47 +00:00
Xubin Ren	e52fe2a8e2	feat(webui): render video media attachments Add signed media URLs to live WebSocket replies and teach the WebUI to classify and render video attachments, so bot-sent videos can play inline in both live chats and session history. Made-with: Cursor	2026-04-25 03:20:40 +08:00
Xubin Ren	185a8fd34d	fix(webui): opaque composer, equal-width message area, cleaner user pill	2026-04-23 07:48:32 +00:00
Xubin Ren	e3bca929fb	fix(webui): left-align prose inside user message pill	2026-04-23 00:07:27 +08:00
Xubin Ren	e493eb09e7	test(webui): realign thread-composer attach test with current types	2026-04-23 00:07:27 +08:00
Xubin Ren	61a28c2c0a	feat(webui): support image uploads in composer and message bubbles	2026-04-23 00:07:27 +08:00
Xubin Ren	558aa98491	chore: temporary keep WebUI source-only	2026-04-21 14:33:44 +00:00
Xubin Ren	1b692debdc	docs(webui): revise README to clarify WebSocket channel setup and sequence of startup steps	2026-04-21 12:46:17 +00:00
chengyongru	8eddacf2f8	fix(webui): sync code block theme with dark mode toggle instantly - Replace one-time DOM read with MutationObserver on <html> class - Remove hardcoded #0a0a0a background, let oneDark/oneLight own it - Add light-mode header/copy-button colors (bg-zinc-100 for light) - Bump font size from 13px to 14px, line-height from 1.55 to 1.6 - Add subtle border to distinguish code block edges	2026-04-20 00:21:07 +08:00
chengyongru	a3adec08a9	style(webui): improve typography with Apple-inspired font stack and CJK support - Add explicit CJK fonts (PingFang SC, Noto Sans SC, Microsoft YaHei) and programmer fonts (JetBrains Mono, Fira Code, Cascadia Code) to Tailwind config - Bump prose base size from prose-sm (14px) to prose-lg (18px) for sharper CJK rendering - Unify user/assistant message font size at 18px with CJK-aware line-height (1.8) - Replace pure black/white foreground with Apple-style warm grays (#1d1d1f / #f5f5f7) - Override Tailwind Typography colors to use design tokens for consistency - Add negative letter-spacing on headings for tighter, more polished look	2026-04-20 00:21:07 +08:00
Xubin Ren	b3049f7323	fix(webui): stabilize empty session history state	2026-04-19 13:38:47 +00:00
Xubin Ren	f9e1d92abd	docs: update README and webui documentation for WebUI development workflow	2026-04-19 13:10:36 +00:00
Xubin Ren	4650b23d75	feat(webui): add i18n support and locale switcher	2026-04-19 06:39:06 +00:00
Xubin Ren	9ed3031a42	feat(webui): add initial webui with websocket chat flow	2026-04-18 18:51:53 +00:00

49 Commits