nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-20 08:32:25 +00:00

Author	SHA1	Message	Date
Xubin Ren	1c2ea1aad2	feat(goal): /goal command & long-running tasks (long_task) * feat(long-task): add LongTaskTool for multi-step agent tasks Implements a meta-ReAct loop where long-running tasks are broken into sequential subagent steps, each starting fresh with the original goal and progress from the previous step. This prevents context drift when agents work on complex, multi-step tasks. - Extract build_tool_registry() from SubagentManager for reuse - Add run_step() for synchronous subagent execution (no bus announcement) - Add HandoffTool and CompleteTool as signal mechanisms via shared dict - Add LongTaskTool orchestrator with simplified prompt (8 iterations/step) - Register LongTaskTool in main agent loop - Add _extract_handoff_from_messages fallback for robustness * fix(long-task): add debug logging for step-level observability * feat(long-task): major overhaul with structured handoffs, validation, and observability - Structured HandoffState: HandoffTool now accepts files_created, files_modified, next_step_hint, and verification fields instead of a plain string. Progress is passed between steps as structured data. - Completion validation round: After complete() is called, a dedicated validator step runs to verify the claim against the original goal. If validation fails, the task continues rather than returning a false completion. - Dynamic prompt system: 3 Jinja2 templates (step_start, step_middle, step_final) selected based on step number. Final steps get tighter budget and stronger "wrap up" guidance. - Automatic file change tracking: Extracts write_file/edit_file events from tool_events and injects them into the next step's context if the subagent forgot to report them explicitly. - Budget tracking & adaptive strategy: Cumulative token usage is tracked across steps. Per-step tool budget drops from 8 to 4 in the last two steps to force handoff/completion. - Crash retry with graceful degradation: A step that crashes is retried once. Persistent crashes terminate the task and return partial progress. - Full observability hooks for future WebUI integration: - set_hooks() with on_step_start, on_step_complete, on_handoff, on_validation_started, on_validation_passed, on_validation_failed, on_task_complete, on_task_error, and catch-all on_event. - Readable state properties: current_step, total_steps, status, last_handoff, cumulative_usage, goal. - inject_correction() allows external code to send user corrections that are injected into the next step's prompt. - run_step() accepts optional max_iterations for dynamic budget control. All 27 long-task tests and 11 subagent tests pass. * test(long-task): add boundary tests and fix race conditions - Add 7 edge-case tests: validation crash resilience, hook exception safety, mid-run correction injection, FIFO correction ordering, explicit file changes overriding auto-detection, final budget for max_steps=1, and dynamic budget switching boundaries - Fix assertion in test_long_task_completes_after_multiple_handoffs to match exact prompt format - Remove asyncio timing hack from test_state_exposure - Add asyncio.sleep(0) yield in test_inject_correction_during_execution to prevent race between signal injection and step continuation - All 34 tests passing * fix(long-task): address code review findings - Declare _scopes = {"core"} explicitly to prevent recursive nesting in subagent scope - Document fragile coupling in _extract_file_changes: path extraction depends on write_file/edit_file detail format; add debug log for unexpected formats - Align final-template threshold (max_steps - 2) with budget switch threshold - Eliminate hasattr(self, "_state") in _reset_state by initializing in __init__ * fix(long-task): honor final signal and file tracking Co-authored-by: Cursor <cursoragent@cursor.com> * feat(long-task): improve prompt structure and agent contract - Expand LongTaskTool.description to instruct parent agent on goal construction, return value semantics, and how to handle results. - Expand CompleteTool.description to emphasize that the summary IS the final answer returned to the parent agent. - Prefix validated return value with an explicit "final answer" directive to stop parent agent from re-running work. - Redesign step_start.md: Step 1 is now explicitly for exploration, planning, and skeleton-building. complete() is discouraged. - Remove bulky payload debug logging from _emit(); add targeted info/warning/error logs at key state transitions instead. - Add signal_type to HandoffState for cleaner signal detection. * test(long-task): expect wrapped completion message after validation Align assertions with LongTaskTool final return shape on main. Co-authored-by: Cursor <cursoragent@cursor.com> * feat(webui): turn timing strip, latency, and session-switch restore - Agent loop: publish goal_status run/idle for WebSocket turns; attach wall-clock latency_ms on turn_end and persisted assistant metadata. - WebSocket channel: forward goal_status and latency fields to clients. - NanobotClient: track goal_status started_at per chat without requiring onChat; useNanobotStream restores run strip when returning to a chat. - Thread UI: composer/shell viewport hooks for run duration and latency; format helpers and i18n strings. - MessageBubble: drop trailing StreamCursor (layout artifact vs block markdown). - Builtin / tests: model command coverage, websocket and loop tests. Covers multi-session UX and round-trip timing visibility for the WebUI. Co-authored-by: Cursor <cursoragent@cursor.com> * fix: keep message-tool file attachments after canonical history hydrate - MessageTool records per-turn media paths delivered to the active chat. - nanobot.utils.session_attachments stages out-of-media-root files and merges into the last assistant message before save (loop stays a thin call). - WebUI MediaCell: use a signed URL as a real download link when present. Fixes attachments flashing then vanishing on turn_end when paths lived outside get_media_dir (e.g. workspace files). Co-authored-by: Cursor <cursoragent@cursor.com> * feat(webui): agent activity cluster, stable keys, LTR sheen labels - Group reasoning and tool traces in AgentActivityCluster with i18n summaries - Stabilize React list keys for activity clusters (first message id anchor) - Replace background-clip shimmer with overlay sheen for streaming labels - ThreadMessages/MessageList integration and locale strings Co-authored-by: Cursor <cursoragent@cursor.com> * fix(webui): render assistant reasoning with Markdown + deferred stream - Use MarkdownText for ReasoningBubble body (same GFM/KaTeX path as replies) - Apply muted/italic prose tokens so thinking stays visually subordinate - useDeferredValue while reasoningStreaming to ease parser work during deltas - Preload markdown chunk when trace opens; add regression test with preloaded renderer Co-authored-by: Cursor <cursoragent@cursor.com> * fix(webui): default-collapse agent activity cluster while Working Outer fold no longer auto-expands during isTurnStreaming; user opens to see traces. Header sheen and live summary unchanged. Co-authored-by: Cursor <cursoragent@cursor.com> * feat(long_task): cumulative run history, file union, and prompt tuning Inject cross-step summaries and merged file paths into middle/final step templates so chains do not lose early context. Strip the last run-history block when it duplicates Previous Progress to save tokens. Add optional cumulative_prompt_max_chars and cumulative_step_body_max_chars parameters with clamped defaults. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(webui): session switch keeps in-flight thread and replays buffered WS Save the prior chat message list to the per-chat cache in a layout effect when chatId changes (before stale writes could corrupt another chat). Skip one post-switch layout cache tick so we do not snapshot the wrong tab. Buffer inbound events per chat_id when no onChat subscriber is registered (e.g. user focused another session) and drain on resubscribe up to a cap, so streaming deltas are not lost while off-tab. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(webui): snap thread scroll to bottom on session open (no smooth glide) Use scroll-behavior auto on the viewport, instant programmatic scroll when following new messages and on scrollToBottomSignal. Keep smooth only for the explicit scroll-to-bottom button. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(webui): respect manual scroll-up after opening a session Track when the user leaves the bottom with a ref and skip ResizeObserver and deferred bottom snaps until they return or the conversation is reset. Remove the time-based force-bottom window that overrode atBottom. Multi-frame scrollToBottom honours the same guard unless force (scroll button). Co-authored-by: Cursor <cursoragent@cursor.com> * Publish long_task UI snapshots on outbound metadata - Add OUTBOUND_META_AGENT_UI (_agent_ui) for channel-agnostic structured state - LongTaskTool publishes {kind: long_task, data: snapshot} on the bus with _progress - WebSocket send forwards metadata as agent_ui for WebUI clients - Tests for bus payload, WS frame, and progress assertions - Fix loop progress tests: ignore _goal_status in streaming final filter and avoid brittle outbound[-1] ordering after goal status idle messages Co-authored-by: Cursor <cursoragent@cursor.com> * feat: WebUI long_task activity card and resilient history merge Add optional ui_summary to the long_task tool for one-line UI labels. Stream long_task agent_ui into a dedicated message row with timeline, markdown peek, and a right sheet for details. Merge canonical history after turn_end while re-inserting long_task rows before the final assistant reply. Collapse duplicate task_start/step_start steps in the timeline and extend i18n. Co-authored-by: Cursor <cursoragent@cursor.com> * refactor: align long_task with thread_goal and drop orchestrator UI - Persist sustained objectives via session metadata (long_task / complete_goal); no subagent wiring or tool-driven agent_ui payloads.\n- Remove WebUI long-task activity UI, types, and translations; history merge preserves trace replay only, with legacy long_task rows normalized to traces.\n- Drop long_task prompt templates and get_long_task_run_dir; add webui thread disk helper for gateway persistence tests. Co-authored-by: Cursor <cursoragent@cursor.com> * feat(agent): thread goal runtime context, tools, and skill - Add thread_goal_state helper and mirror active objectives into Runtime Context - Wire loop/context/memory/events as needed for goal metadata in turns - Expand long_task / complete_goal semantics (pivot/cancel/honest recap) - Add always-on thread-goal SKILL.md; align /goal command prompt - Tests for context builder and thread goal state - Remove unused webui ChatPane component Co-authored-by: Cursor <cursoragent@cursor.com> * feat(thread-goal): add websocket snapshot helper and publish goal updates from long_task Introduce thread_goal_ws_blob for bounded JSON snapshots, attach snapshots to websocket turn_end metadata in AgentLoop, and let long_task fan-out dedicated thread_goal frames on the websocket channel after persisting session metadata. Co-authored-by: Cursor <cursoragent@cursor.com> * feat(channels): websocket thread_goal frames, turn_end replay, and session API scrub for subagent inject Emit thread_goal events and optional thread_goal on turn_end; scrub persisted subagent announce blobs on GET /api/sessions/.../messages and shorten session list previews so WebUI does not surface full Task/Summarize scaffolding. Co-authored-by: Cursor <cursoragent@cursor.com> * feat(webui): merge ephemeral traces per user turn when reconciling canonical history Preserve disk/live trace rows inside the matching user–assistant segment instead of stacking every trace before the final assistant reply (fixes inflated tool counts after refresh or session switch). Co-authored-by: Cursor <cursoragent@cursor.com> * feat(webui): show assistant reply copy only on the last slice before the next user turn Avoid duplicate copy affordances on intermediate assistant bubbles that precede more agent activity in the same turn (tools or further assistant text). Co-authored-by: Cursor <cursoragent@cursor.com> * feat(webui): thread_goal stream plumbing, composer goal strip, sky glow, and client-side subagent scrub projection Track thread_goal and turn_goal snapshots in NanobotClient, hydrate React state from thread_goal frames and turn_end, surface objective/elapsed in the composer, add breathing sky halo CSS while goals are active, mirror server scrub logic on history hydration and webui_thread snapshots, and extend tests/client mocks. Co-authored-by: Cursor <cursoragent@cursor.com> * feat(channels): add Slack Socket Mode connect timeout with actionable timeout errors Abort hung websockets.connect handshakes after a bounded wait, log REST-vs-WSS guidance, surface RuntimeError to channel startup, and log successful WSS setup. Co-authored-by: Cursor <cursoragent@cursor.com> * webui: expand thread goal in composer bottom sheet Add ChevronUp control on the run/goal strip that opens a bottom Sheet with full ui_summary and objective. Inline preview logic in RunElapsedStrip, add i18n strings across locales, and a composer unit test. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(webui): widen dedupeToolCallsForUi input for session API typing fetchSessionMessages types tool_calls as unknown; accept unknown so tsc build passes when passing message.tool_calls through. Co-authored-by: Cursor <cursoragent@cursor.com> * refactor(agent): extract WebSocket turn run status to webui_turn_helpers * refactor(skills): rename thread-goal to long-task and document idempotent goals * feat(skills): rename sustained-goal skill to long-goal and tighten long_task guidance * chore: remove unused subagent/context/router helpers * feat(session): rename sustained goal to goal_state and align WS/WebUI - Move helpers from agent/thread_goal_state to session/goal_state: GOAL_STATE_KEY, goal_state_runtime_lines, goal_state_ws_blob, parse_goal_state. - Session metadata now uses "goal_state"; still read legacy "thread_goal"; long_task writes drop the legacy key after save. - WebSocket: event/field goal_state, _goal_state_sync; turn_end carries goal_state; accept legacy _thread_goal_sync/thread_goal inbound metadata for dispatch. - WebUI: GoalStateWsPayload, goalState hook/client props, i18n keys goalState. - Runtime Context copy uses "Goal (active):" instead of "Thread goal". feat(agent): stream Anthropic thinking deltas and fix stream idle timeout * refactor(webui): transcript jsonl as sole timeline source * fix(agent): reject mismatched WS message chat_id and stream reasoning deltas * feat(webui): hydrate sustained goal and run timer after websocket subscribe * chore(webui,websocket): remove unused fetch helpers and legacy thread_goal WS paths * Raise default max_tokens and context window in agent schema. Align AgentDefaults and ModelPresetConfig with typical Claude-scale usage (32k completion budget, 256k context window) and update migration tests. Co-authored-by: Cursor <cursoragent@cursor.com> * feat(gateway): bootstrap prefers in-memory model; clarify websocket naming * fix(websocket): websocket _handle_message passes is_dm; refresh /status test expectations --------- Co-authored-by: chengyongru <2755839590@qq.com> Co-authored-by: chengyongru <chengyongru.ai@gmail.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-16 01:14:11 +08:00
Xubin Ren	567e95dee6	fix(cli): stop spinner before resumed answer deltas Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 09:18:59 +00:00
Xubin Ren	53831e1611	fix(cli): clear thinking spinner before trace output Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 09:15:53 +00:00
Xubin Ren	3fab736262	fix(cli): keep trace output under assistant header Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 09:13:16 +00:00
Xubin Ren	01fa362c03	Merge origin/main into feat/show-reasoning Resolves conflicts after main landed the state-machine turn refactor and the test_runner.py 9-file split: - nanobot/agent/loop.py: take main's `_state_build`/`_persist_user_message_early` flow; restore the `reasoning: bool` parameter on `_build_bus_progress_callback` so the loop hook can mark progress as reasoning-channel without coupling to the answer stream. - nanobot/cli/stream.py: keep main's configurable `bot_name`/`bot_icon` header while preserving the PR's `transient=True` Live + `self._console` routing + `_renderable()` final-render path that fixed TUI duplication. - tests/agent/test_runner.py was deleted on main and split into 9 focused files; relocated all 6 reasoning tests into a new `test_runner_reasoning.py` matching the new layout, deduplicated the per-test `ReasoningHook` boilerplate through a shared `_RecordingHook` helper. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 05:07:14 +00:00
Xubin Ren	352aaf0627	refactor(reasoning): unify reasoning extraction across providers Reasoning surfacing was split across three branches in runner.py plus two separate streaming buffers (loop hook and runner progress stream), with three independent display-side gates in the CLI. This collapsed the policy into one source of truth and fixed two real bugs: - Structured `reasoning_content` was suppressed whenever the answer was streamed, because the runner gated emission on `streamed_content`. Providers don't stream `reasoning_content`; it only arrives on the final response, so the answer stream and the reasoning channel are independent. Added `streamed_reasoning` to `AgentHookContext` to track the right bit. - `channels.showReasoning` was subordinated to `sendProgress`. They are orthogonal — turning off progress streaming shouldn't silence reasoning. Reworked the CLI gates accordingly. Single-helper consolidation: - `extract_reasoning(reasoning_content, thinking_blocks, content)` returns `(reasoning_text, cleaned_content)` with a defined fallback order: dedicated field → Anthropic thinking_blocks → inline `<think>`/`<thought>` tags. Models that expose none of these short-circuit to `(None, content)` — zero overhead. - `IncrementalThinkExtractor` replaces the ad-hoc `emit_incremental_think` function and its hand-rolled "emitted cursor" state in both the loop hook and the runner progress stream. Also documented the new `showReasoning` channel option in docs/configuration.md and noted its independence from sendProgress. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 17:14:19 +00:00
Alfredo Arenas	dfb013659a	test(cli): add tests for configurable bot identity (#3650 ) Six tests covering: - AgentDefaults preserves 'nanobot' and the cat icon by default - camelCase config keys (botName/botIcon) bind to the new fields - Empty bot_icon is accepted (opt-out of the leading icon) - ThinkingSpinner uses bot_name in its status text - StreamRenderer header combines icon and name when icon is set - StreamRenderer header is just the name when icon is empty	2026-05-11 11:50:18 +08:00
Flinn Xie	3a27af0018	feat(cli): display model reasoning content during streaming Add show_reasoning config (default: False) to display model thinking/reasoning content in the TUI during streaming. Reasoning is emitted via a new emit_reasoning hook on AgentHook, gated by the channels config. Display uses ✻ prefix with dim italic styling. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 01:02:49 +08:00
Flinn Xie	d630ac90d1	fix(cli): prevent TUI content duplication via transient Live and renderer routing Route progress output through the Live's render hook to fix cursor misalignment that caused content duplication. The root cause was that progress/reasoning output used a separate Console instance, bypassing Rich Live's process_renderables hook. Also fixes pre-existing issue where multiple headers printed per agent turn. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 01:02:49 +08:00
chengyongru	733b34d685	refactor: address code review feedback on AgentLoop.from_config() - Accept optional `provider` kwarg in from_config() to avoid double instantiation in _run_gateway (which already builds provider_snapshot) - Restore try/except ValueError wrappers in serve() and agent() for clean error messages on provider creation failure - Update test: _FakeAgentLoop captures provider from kwargs, restore strong assertion (seen["provider"] is provider)	2026-05-09 15:30:48 +08:00
chengyongru	3202f58c41	refactor: introduce AgentLoop.from_config() to centralize loop assembly Extract duplicated bus/provider/loop initialization from CLI commands (serve, _run_gateway, agent) and Nanobot facade into a single AgentLoop.from_config() classmethod. - Remove _make_provider() from cli/commands.py and nanobot.py - Remove inline provider creation in all three CLI entry points - AgentLoop.from_config() creates MessageBus, calls make_provider(), and assembles AgentLoop with all standard config-derived parameters - Supports **extra overrides for callers that need custom args (e.g. cron_service, session_manager, provider_snapshot_loader) - Update tests to mock make_provider at nanobot.providers.factory and add from_config classmethod to _FakeAgentLoop fixtures This is PR 1/4 of the model-preset feature decomposition.	2026-05-09 15:30:48 +08:00
chengyongru	908f1246d8	fix(cli): sanitize surrogate code points before entering message bus On Windows, prompt_toolkit produces lone surrogate code points (e.g. 🐈) for emoji input. These propagate through the message bus and crash at json.dumps() / file write time because surrogates cannot be encoded as UTF-8. Extract _sanitize_surrogates() that round-trips through UTF-16 to reconstruct paired surrogates into real characters (e.g. 🐈 → 🐈), replacing unpaired surrogates with U+FFFD. Apply it at the CLI input path and reuse in SafeFileHistory.	2026-05-09 01:03:34 +08:00
chengyongru	3ceabdecd5	feat(cli): support github-copilot in provider logout Logout previously claimed to support github-copilot in --help text but had no registered handler, so `provider logout github-copilot` failed with "Logout not implemented". Add the handler, sharing token deletion with the codex flow via `_delete_oauth_files`. Tighten handler-table types, fix the codex test fixture filename, and cover github-copilot plus the unknown provider path.	2026-05-04 12:10:06 +08:00
mikaku9944	387988b8e9	feat(cli): add provider logout command - Implement \ anobot provider logout <provider>\ to clear OAuth credentials. - Add \_LOGOUT_HANDLERS\ registration mechanism mirroring login. - Implement logout for \openai-codex\ by deleting local \oauth-cli-kit\ token and lock files. - Fallback gracefully when attempting to logout from providers lacking local credentials or implementations. - Fixes #2665	2026-05-04 12:10:06 +08:00
Xubin Ren	66682eb46f	test(cli): cover retry-wait interactive routing Keep provider retry wait messages on the interactive progress path so they do not fall through as assistant responses. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-03 22:59:08 +08:00
Xubin Ren	861fbb0dde	fix(provider): correct LongCat OpenAI base URL Use the SDK-ready /v1 base so LongCat chat completions hit the documented endpoint. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-02 01:52:04 +08:00
moranfong	051037ff08	feat(provider): add LongCat via OpenAI-compatible backend	2026-05-02 01:52:04 +08:00
Xubin Ren	2b886ffd1f	fix(command): expose history in chat command menus Made-with: Cursor	2026-04-27 18:23:35 +08:00
Xubin Ren	8ed10ac7df	test(command): keep history tests lint-clean Made-with: Cursor	2026-04-27 18:23:35 +08:00
Leo fu	599e25dfbf	feat(command): add /history command to show recent session messages Adds /history [n] to display the last N user/assistant messages from the current session (default 10, max 50). - Tool and system messages are filtered out for readability - Long messages are truncated to 200 characters with an ellipsis - Multimodal content (image blocks) is collapsed to its text parts - Invalid count argument returns a usage hint - /history n uses prefix routing; /history uses exact routing Also registers /history in build_help_text().	2026-04-27 18:23:35 +08:00
Xubin Ren	1ef41052da	fix(cron): rephrase fire-time prompt so agent delivers a natural reminder The old prompt framed cron firing as a "task triggered" status report, which led the agent to reply with things like "Done ✅ 已提醒 U0AV8BJPV8D 喝水" — exposing the user id and reading like a system log instead of a friendly reminder. Reword it to instruct the agent to speak directly to the user and forbid status-style language. Made-with: Cursor	2026-04-27 12:45:00 +08:00
Xubin Ren	f670da6c70	refactor(providers): move provider snapshot creation into factory	2026-04-26 14:05:13 +00:00
Xubin Ren	65b0ae81af	Merge origin/main into webui-settings Made-with: Cursor	2026-04-26 13:05:32 +00:00
Xubin Ren	799db33517	fix(heartbeat): record proactive deliveries in channel sessions Route heartbeat, cron, and message-tool deliveries through one gateway helper so user-visible proactive messages are available when the channel replies. Made-with: Cursor	2026-04-26 20:08:21 +08:00
Peixian Gong	dd26b4407d	fix(providers): make GitHub Copilot backend work with GPT-5/o-series models Calling GitHub Copilot with `gpt-5.` / `o` models (e.g. `github_copilot/gpt-5.4`, `github_copilot/gpt-5.4-mini`) failed with a chain of misleading errors: 1. `Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead.` 2. `model "gpt-5.4-mini" is not accessible via the /chat/completions endpoint` (`unsupported_api_for_model`). 3. `The requested model is not supported.` (`model_not_supported`) even after routing to /responses. Root causes (each one masked the next): * The `github_copilot` ProviderSpec did not opt into `supports_max_completion_tokens`, so `_build_kwargs` always sent the legacy `max_tokens` parameter that GPT-5/o-series reject. * `_should_use_responses_api` was hard-gated to `spec.name == "openai"` plus a direct-OpenAI base URL, so the GitHub Copilot backend always went through /chat/completions even for models the Copilot gateway exposes only via /responses (e.g. `gpt-5.4-mini`). * When /responses did fail on github_copilot, the existing "compatibility marker" heuristic silently fell back to /chat/completions — which can never succeed for these models — so the real upstream error was hidden. * `_build_responses_body` did not honour `spec.strip_model_prefix`, so the request body sent `model="github_copilot/gpt-5.4-mini"` (with the routing prefix), which the Copilot gateway rejects with `model_not_supported`. (`_build_kwargs` already stripped it; this branch was missed.) Fix: * registry.py: set `supports_max_completion_tokens=True` on the `github_copilot` spec so requests use `max_completion_tokens`. * openai_compat_provider.py: - `_should_use_responses_api` now also allows the `github_copilot` spec, and skips the direct-OpenAI base check for it (the Copilot gateway is its own base URL). - `_build_responses_body` now strips the model routing prefix when `spec.strip_model_prefix` is set, matching `_build_kwargs`. - `chat` / `chat_stream` no longer fall back from /responses to /chat/completions on the `github_copilot` spec: the fallback cannot succeed for GPT-5/o-series and would mask the real gateway error. Tests: * tests/cli/test_commands.py: switched the `test_github_copilot_provider_refreshes_client_api_key_before_chat` fixture model from `gpt-5.1` to `gpt-4` so it continues to exercise the /chat/completions code path it was designed for (gpt-5.1 now correctly routes to /responses on github_copilot). * `pytest tests/providers/ tests/cli/test_commands.py` — 314 passed. * Verified end-to-end against the live Copilot gateway with both `github_copilot/gpt-5.4` and `github_copilot/gpt-5.4-mini`. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-22 14:28:19 +08:00
hussein1362	512bf59b3c	fix(session): fsync sessions on graceful shutdown to prevent data loss On filesystems with write-back caching (rclone VFS, NFS, FUSE mounts) the OS page cache may buffer recent session writes. If the process is killed before the cache flushes, the most recent conversation turns are silently lost — causing the agent to "forget" recent context and respond to stale history on the next startup. Changes: - session/manager.py: add fsync=True option to save() that flushes the file and its parent directory to durable storage. Add flush_all() that re-saves every cached session with fsync. Default save() behavior is unchanged (no fsync) to avoid performance regression in normal operation. - cli/commands.py: call agent.sessions.flush_all() in the gateway shutdown finally block, after stopping heartbeat/cron/channels. - tests/session/test_session_fsync.py: 8 tests covering fsync flag behavior, flush_all with empty/multiple/errored sessions, and data survival across simulated process restart. - tests/cli/test_commands.py: add sessions attribute to _FakeAgentLoop so the gateway health endpoint test passes with the new shutdown flush.	2026-04-22 13:19:53 +08:00
Xubin Ren	ef8bbab7b3	test(cli): lock _render_interactive_ansi force_terminal to isatty Made-with: Cursor	2026-04-22 13:12:29 +08:00
chengyongru	79821a571f	fix: suppress intermediate progress output in cron jobs Cron jobs now pass on_progress=_silent to process_direct, matching the heartbeat pattern. Previously, tool hints and streaming deltas were published to the user channel via bus during execution, but the final response could be rejected by evaluate_response — leaving users with confusing partial output and no conclusion. Closes #3319	2026-04-20 11:43:54 +08:00
Xubin Ren	b3049f7323	fix(webui): stabilize empty session history state	2026-04-19 13:38:47 +00:00
Xubin Ren	46e11a68a7	test: speed up cron and restart timing tests Replace fixed sleep-based waits with condition polling in cron tests and mock the restart delay in CLI restart tests to reduce suite runtime without changing behavior.	2026-04-19 12:35:57 +00:00
Alfredo Arenas	2d0442976e	test(cli): update _make_console tests for isatty-based fix (#3265 ) The old test `test_make_console_uses_force_terminal` hardcoded `force_terminal is True`, which contradicts the fix: we now defer to sys.stdout.isatty() so piped / non-TTY output gets plain text instead of ANSI escape codes. Split into two tests covering both branches: - test_make_console_force_terminal_when_stdout_is_tty: TTY path (force_terminal=True, rich output) - test_make_console_force_terminal_false_when_stdout_is_not_tty: non-TTY path (force_terminal=False, plain text) — regression guard for the bug reported in #3265 Co-authored with Claude Opus 4.7	2026-04-19 04:19:59 +08:00
Xubin Ren	384bad17b4	Merge origin/main into fix/config-default-api-base Made-with: Cursor	2026-04-18 20:08:21 +00:00
chengyongru	e1fdca7d40	fix(status): correct context percentage calculation and sync consolidator - Pass resolved self.context_window_tokens to Consolidator instead of raw parameter that could be None, preventing consolidation failures - Calculate percentage against input budget (ctx - max_completion - 1024) instead of raw context window, consistent with Consolidator/snip formulas - Pass actual max_completion_tokens from provider to build_status_content - Cap percentage display at 999 to prevent runaway values - Add tests for budget-based percentage and cap behavior	2026-04-16 20:30:39 +08:00
Xubin Ren	2b8e90d8fd	test(config): cover LM Studio nullable api key	2026-04-16 02:49:54 +08:00
Xubin Ren	25ded8e747	test: cover active task count in status Lock the /status task counter to the actual stop scope by asserting it sums unfinished dispatch tasks with running subagents for the current session. Made-with: Cursor	2026-04-15 01:49:42 +08:00
aiguozhi123456	634f4b45c1	feat: show active task count in /status output	2026-04-15 01:49:42 +08:00
Xubin Ren	e4b3f9bd28	security(gateway): keep health endpoint local by default Bind the gateway health listener to localhost by default and reduce the probe response to a minimal status payload so accidental public exposure leaks less information. Made-with: Cursor	2026-04-14 07:19:38 +00:00
Xubin Ren	4999e2f734	Merge origin/main into feat/health-endpoint Keep the gateway health endpoint patch current with the latest gateway runtime changes, and lock the new HTTP routes in with CLI regression coverage and README guidance. Made-with: Cursor	2026-04-14 06:32:31 +00:00
moranfong	0750d1f182	fix(config): return provider default api base in config resolution	2026-04-13 23:42:58 +08:00
Leo fu	42624f5bf3	test: update expected token display to match consistent 1000 divisor The test fixtures use 65536 as context_window_tokens. With the divisor corrected from 1024 to 1000, the display changes from 64k to 65k.	2026-04-09 10:40:20 +08:00
Xubin Ren	075bdd5c3c	refactor: move SafeFileHistory to module level + add regression tests - Promote _SafeFileHistory to module-level SafeFileHistory for testability - Add 5 regression tests: surrogates, normal text, emoji, mixed CJK, multi-surrogates Made-with: Cursor	2026-04-07 13:57:34 +08:00
Xubin Ren	c9d4b7b905	Merge remote-tracking branch 'origin/main' into pr-2449 Made-with: Cursor # Conflicts: # nanobot/utils/evaluator.py	2026-04-06 06:30:11 +00:00
Ben Lenarts	202938ae73	feat: support ${VAR} env var interpolation in config secrets Allow config.json to reference environment variables via ${VAR_NAME} syntax. Variables are resolved at runtime by resolve_config_env_vars(), keeping the raw templates in the Pydantic model so save_config() preserves them. This lets secrets live in a separate env file (e.g. loaded by systemd EnvironmentFile=) instead of plain text in config.json.	2026-04-06 13:43:26 +08:00
Jiajun Xie	f86f226c17	fix(cli): prevent spinner ANSI escape codes from being printed verbatim Fixes #2591 The "nanobot is thinking..." spinner was printing ANSI escape codes literally in some terminals, causing garbled output like: ?[2K?[32m⠧?[0m ?[2mnanobot is thinking...?[0m Root causes: 1. Console created without force_terminal=True, so Rich couldn't reliably detect terminal capabilities 2. Spinner continued running during user input prompt, conflicting with prompt_toolkit Changes: - Set force_terminal=True in _make_console() for proper ANSI handling - Add stop_for_input() method to StreamRenderer - Call stop_for_input() before reading user input in interactive mode - Add tests for the new functionality	2026-04-05 16:50:49 +08:00
Xubin Ren	30ea048f19	Merge remote-tracking branch 'origin/main' into pr-2717-review	2026-04-04 04:42:52 +00:00
imfondof	896d578677	fix(restart): show restart completion with elapsed time across channels	2026-04-04 02:21:42 +08:00
imfondof	ba7c07ccf2	fix(restart): send completion notice after channel is ready and unify runtime keys	2026-04-04 02:21:42 +08:00
chengyongru	b9616674f0	feat(agent): two-stage memory system with Dream consolidation Replace single-stage MemoryConsolidator with a two-stage architecture: - Consolidator: lightweight token-budget triggered summarization, appends to HISTORY.md with cursor-based tracking - Dream: cron-scheduled two-phase processor that analyzes HISTORY.md and updates SOUL.md, USER.md, MEMORY.md via AgentRunner with edit_file tools for surgical, fault-tolerant updates New files: MemoryStore (pure file I/O), Dream class, DreamConfig, /dream and /dream-log commands. 89 tests covering all components.	2026-04-02 22:42:25 +08:00
chengyongru	da08dee144	feat(provider): show cache hit rate in /status (#2645 )	2026-04-02 12:51:45 +08:00
RongLei	c5f0997381	fix: refresh copilot token before requests Address PR review feedback by avoiding an async method reference as the OpenAI client api_key. Initialize the client with a placeholder key, refresh the Copilot token before each chat/chat_stream call, and update the runtime client api_key before dispatch. Add a regression test that verifies the client api_key is refreshed to a real string before chat requests. Generated with GitHub Copilot, GPT-5.4.	2026-04-02 03:46:40 +08:00

1 2

55 Commits