nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-19 16:12:30 +00:00

Author	SHA1	Message	Date
Kaloyan Tenchov	bb788cdb7d	feat(image-generation): add Gemini provider support Adds GeminiImageGenerationClient covering both Imagen 4 (:predict) and Gemini Flash (:generateContent), wires the gemini ProviderConfig through the SDK, API server, and gateway entry points, and updates the image-generation docs and skill. Errors from the Gemini endpoints are logged and surface with the HTTP status and parsed message instead of an empty string. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-18 15:28:58 +08:00
yaotutu	557f4e6ae9	feat: add MiniMax image generation provider support Add MiniMaxImageGenerationClient with support for: - Text-to-image generation via MiniMax image-01 model - Reference image support (subject_reference) - Aspect ratio selection - Proper error handling aligned with existing providers Wire up MiniMax provider config in ImageGenerationTool, gateway, serve, and Nanobot class.	2026-05-18 15:28:51 +08:00
chengyongru	d4ade8f680	feat(cli): add Model Preset wizard to onboard Extract the [M] Model Presets interactive CRUD screen from PR #3696 and adapt it to the current main branch schema (fallback_models instead of fallback_presets). Adds preset cache, field handlers for model_preset/provider/fallback_models, and 9 new tests.	2026-05-18 15:13:41 +08:00
chengyongru	28d0f8560e	fix(webui): preserve single newlines in markdown rendering Add remark-breaks plugin so that single newlines in assistant messages (such as /help output) render as line breaks instead of being collapsed into a single paragraph by standard markdown behavior.	2026-05-18 15:12:27 +08:00
Xubin Ren	ba38f90832	Merge PR #3877 : feat(webui+agent): optimize streaming, activity rendering, and runtime sync feat(webui+agent): optimize streaming, activity rendering, and runtime sync	2026-05-18 02:04:36 +08:00
Xubin Ren	eb3aed359f	Refine file edit progress gating	2026-05-18 01:59:55 +08:00
Xubin Ren	4445fcc8b9	refactor(cli): localize reasoning buffer state	2026-05-18 01:34:08 +08:00
liyazhou	b67205f5aa	fix(cli): buffer reasoning tokens to avoid one-token-per-line display	2026-05-18 01:34:08 +08:00
Xubin Ren	de8761f25a	fix(test): add gateway llm runtime fake	2026-05-18 01:19:45 +08:00
Xubin Ren	8708ccea86	Merge branch 'main' of https://github.com/HKUDS/nanobot into codex/webui-performance	2026-05-18 01:18:28 +08:00
Xubin Ren	eb0ff3ad1d	fix(memory): refresh session before empty guard	2026-05-18 01:16:47 +08:00
chengyongru	c58a360b25	fix(test): seed get_or_create mock for session-refresh guard compatibility	2026-05-18 01:16:47 +08:00
chengyongru	5bb94edc99	refactor(autocompact): delegate _archive to Consolidator.compact_idle_session Replace AutoCompact._archive() direct session mutation with delegation to Consolidator.compact_idle_session(). Remove _split_unconsolidated() method since that logic now lives inside compact_idle_session. All session mutation for idle compaction now goes through the Consolidator's lock, eliminating the race condition between background token consolidation and idle TTL compaction. Changes: - autocompact.py: rewrite _archive() to call compact_idle_session, remove _split_unconsolidated(), clean up unused imports - test_autocompact_unit.py: replace TestArchive/TestSplitUnconsolidated with TestArchiveDelegates that verifies delegation behavior - test_auto_compact.py: convert all consolidator.archive mocks to consolidator.compact_idle_session mocks via _make_fake_compact helper	2026-05-18 01:16:47 +08:00
chengyongru	888d54790d	fix(memory): add session-refresh guard to maybe_consolidate_by_tokens When background consolidation runs with a stale session reference (captured before AutoCompact replaced the session via compact_idle_session), it could operate on outdated data. Now, after acquiring the per-session lock, the method refreshes its session reference from SessionManager.get_or_create(). If the session was replaced, it swaps in the fresh reference before doing any consolidation work. This prevents a race where AutoCompact truncates an idle session while a background maybe_consolidate_by_tokens call is in flight with the old session object.	2026-05-18 01:16:47 +08:00
chengyongru	48d35bd2d9	feat(consolidator): add compact_idle_session method with lock-protected truncation Add Consolidator.compact_idle_session(session_key, max_suffix=8) that performs hard-truncation of idle sessions under the per-session consolidation lock. This is the single lock-protected path for AutoCompact to use instead of modifying session state directly, fixing the race condition between AutoCompact and Consolidator. Behavior: - Acquires per-session consolidation lock - Invalidates cache and reloads fresh from disk - Splits unconsolidated tail into archive prefix and retained suffix - Archives prefix via LLM (with raw_archive fallback on failure) - Persists _last_summary in session metadata on success - Returns summary text, None on LLM failure, or '' if nothing to archive Tests: 6 new tests covering prefix archival, empty session timestamp refresh, (nothing) summary exclusion, LLM failure fallback, last_consolidated offset, and lock acquisition verification.	2026-05-18 01:16:47 +08:00
Xubin Ren	fce1550814	fix(webui): refresh bootstrap token before expiry	2026-05-18 00:53:36 +08:00
voidborne-d	bf8a6e35fd	docs(deployment): match docker run gateway example to docker-compose.yml (refs #3873 ) The `docker run` example for `gateway` in `docs/deployment.md` had drifted from the canonical configuration in `docker-compose.yml`: - It omitted the security flags that `docker-compose.yml` already declares (`cap_drop: ALL` + `cap_add: SYS_ADMIN` + unconfined apparmor/seccomp). These are required whenever `tools.exec.sandbox: "bwrap"` is enabled, because bwrap needs CAP_SYS_ADMIN for user namespaces; without them bwrap exits with `clone3: Operation not permitted` and exec tools silently fail. - It omitted `-p 8765:8765`, even though both the bundled `docker-compose.yml` and `Dockerfile` (`EXPOSE 18790 8765`) already expose the WebSocket channel / WebUI port; users following the docs would get a reachable gateway health endpoint but an unreachable WebUI. This change keeps the two paths in sync so anyone reading deployment.md and using `docker run` directly gets the same security posture and port surface as the Compose path. Also adds a short `!IMPORTANT` note documenting that `gateway.host` and `channels.websocket.host` default to `127.0.0.1` (set in `nanobot/config/schema.py:GatewayConfig`). Docker `-p` cannot forward to the container's loopback interface, so the user must set both binds to `0.0.0.0` in `config.json` for the published ports to actually be reachable. This is the symptom reported as items 2 + 3 of #3873; items 1 + 4 of that issue are already resolved on `main` (`Dockerfile` line 49 already exposes both ports, and README.md lines 218-220 already reflect that the WebUI ships in the wheel). Docs only, no code changes. Signed-off-by: voidborne-d <258577966+voidborne-d@users.noreply.github.com>	2026-05-18 00:45:49 +08:00
Xubin Ren	f017e209da	docs(configuration): align Docker env-file example	2026-05-18 00:45:34 +08:00
olgagaga	5a34504b76	docs(configuration): expand "Environment Variables for Secrets" section - Note that any string field supports ${VAR_NAME} and resolved values are never written back to disk. - Document the failure mode for unset variables. - Add MCP (stdio env + HTTP headers) and web-search examples. - Add Docker, direnv, and secret-manager (1Password / pass / Bitwarden) delivery patterns alongside the existing systemd example. - Replace plaintext apiKey values in tools.web.search examples (Brave, Tavily, Jina, Kagi, Olostep) with ${PROVIDER_API_KEY} placeholders so the docs stop modelling the anti-pattern. - Cross-link from the Security section. Refs: HKUDS/nanobot#2172	2026-05-18 00:45:34 +08:00
Xubin Ren	af26ed0041	fix(heartbeat): remove unused runtime import	2026-05-18 00:40:31 +08:00
Xubin Ren	112f40ad67	fix(agent): refresh llm runtime for background tasks	2026-05-18 00:35:12 +08:00
Xubin Ren	2f323e24c1	fix(webui): polish session titles and status	2026-05-17 23:52:50 +08:00
Xubin Ren	361f31c0e4	fix(webui): use portal file reference tooltips	2026-05-17 23:52:29 +08:00
Xubin Ren	945f208d38	feat(webui): render file edit activity	2026-05-17 23:52:14 +08:00
Xubin Ren	c8bb04a8fe	feat(webui): persist agent activity events	2026-05-17 23:51:52 +08:00
Xubin Ren	4b5de66c58	Polish WebUI streaming and provider settings	2026-05-17 17:41:33 +08:00
Xubin Ren	9340567f2d	Fix duplicate reasoning display	2026-05-17 17:11:38 +08:00
Xubin Ren	e5be4dac7a	Optimize WebUI streaming and long history rendering Batch stream deltas, window long transcripts, lazy-load syntax highlighting, and refine activity/composer interactions. Add title refresh retries plus tests for streaming, windowing, code blocks, and live activity behavior.	2026-05-17 17:04:57 +08:00
Xubin Ren	175b58e259	fix(docker): document bundled webui port	2026-05-17 15:51:04 +08:00
huanglei.214	3bf8de047a	fix docker build	2026-05-17 15:51:04 +08:00
chengyongru	400f822601	fix(providers): recognize Chinese rate-limit marker '访问量过大' as transient error	2026-05-17 14:25:20 +08:00
Xubin Ren	9fb9d7afcb	docs: update README with v0.2.0 release details, including new features and improvements	2026-05-16 15:22:32 +00:00
Xubin Ren	c018c3fb6a	chore(release): bundle webui into wheel and prep 0.2.0 v0.2.0	2026-05-16 13:38:11 +00:00
olgagaga	0ca0fe2221	fix(providers): wire MiMo thinking control on gateway providers (#3845 ) The xiaomi_mimo ProviderSpec carries thinking_style="thinking_type", but gateway providers (OpenRouter etc.) route MiMo under their own spec which has no thinking_style. As a result, `reasoning_effort="none"` was silently ignored: `{"thinking": {"type": "disabled"}}` was never injected and responses still contained reasoning_content. Mirror the Kimi pattern that already handles the same problem: add an explicit _MIMO_THINKING_MODELS allowlist (mimo-v2.5-pro, mimo-v2.5, mimo-v2-pro, mimo-v2-omni — per Xiaomi docs), an _is_mimo_thinking_model helper that strips publisher prefixes ("xiaomi/mimo-v2.5-pro" matches), and a sibling branch in _build_kwargs that injects the thinking payload by model name. mimo-v2-flash is intentionally excluded — it has no thinking mode. Also include MiMo in the explicit_thinking predicate so the reasoning_content backfill (#3554, #3584) covers the gateway path consistently with the direct path. Tests cover the gateway disable/enable signals, bare-slug fallback, flash exclusion, and a non-MiMo sanity check.	2026-05-16 20:46:34 +08:00
chengyongru	8a819dda1e	fix(agent): remove duplicate runtime context injection in mid-turn drain _drain_pending injected a full runtime context block (including goal state) into every injected user message, but the initial message already carries runtime context via build_messages(). This caused goal state to appear multiple times in the LLM context window within a single turn, wasting tokens (up to 4000 chars per duplicate). Now _drain_pending only passes the raw user content without runtime context. The initial turn message remains the sole carrier.	2026-05-16 20:46:08 +08:00
chengyongru	45eacc3a98	docs: update CLAUDE.md to reflect current codebase state - Update channels list: add WeCom, DingTalk, Email, MoChat, MS Teams - Update providers: add Bedrock, Codex, Responses API, image generation, transcription - Update tools: add long_task/sustained goals, image generation, sandbox backends - Update session: add goal_state.py for sustained goal tracking - Add missing subsystems: API Server, Command Router, Heartbeat, Pairing, Skills, Security	2026-05-16 20:45:52 +08:00
Xubin Ren	387724c355	test(agent): add tests to ensure goal state does not leak across sessions	2026-05-16 11:14:56 +00:00
ykstart	f97b960433	fix(exec): refine format command deny pattern to allow URL parameters The previous regex r"(?:^\|[;&\|]\s*)format\b" incorrectly blocked commands containing URL parameters like &format=json. Added negative lookahead (?!=) so format= (URL param key=value) is allowed while standalone format commands (e.g. ;format, &format, \|format) remain blocked. Added test cases for both blocking and allowing scenarios.	2026-05-16 18:52:42 +08:00
Xubin Ren	e87c07c368	fix(agent): prevent outer wall-clock timeout for streaming requests	2026-05-16 10:12:57 +00:00
Xubin Ren	06a1bef9fe	fix(goal): reduce pre-long_task overthinking	2026-05-16 09:57:44 +00:00
Xubin Ren	e804f2fddb	fix(agent): align LLM wall timeout with sustained goals for main + subagents Centralize runner_wall_llm_timeout_s in session goal_state metadata helpers so spawned subagents inherit the same policy as AgentLoop without coupling to long_task. Pass optional resolver into SubagentManager and add tests. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-16 16:33:49 +08:00
Xubin Ren	cf09a8d691	refactor(webui): disable React StrictMode and enhance Markdown rendering	2026-05-16 08:33:15 +00:00
Xubin Ren	2144af7cd0	fix(agent): disable LLM wall-clock timeout during sustained goals	2026-05-16 05:27:40 +00:00
Xubin Ren	90632469f6	fix(webui): rename goal-related terminology and enhance UI components	2026-05-16 04:42:58 +00:00
olgagaga	e14c0310ad	docs(contributing): warn that `ruff format` predates the codebase The Development Setup block instructs new contributors to run `ruff format nanobot/`, but the tree predates the formatter and many lines exceed the configured 100-char limit (E501 is ignored). Running the command as documented produces an ~80-file unrelated diff that buries real changes. Document this and recommend formatting only the files actually touched.	2026-05-16 12:25:28 +08:00
Xubin Ren	2e31002e6e	refactor(long_task): streamline goal instructions and enhance documentation	2026-05-16 04:25:09 +00:00
Xubin Ren	897eedaaa7	chore(ci): update Python version in CI workflow to focus on supported runtimes 3.13 and 3.14	2026-05-16 04:15:58 +00:00
yanalialiuk	18072856ec	feat: add Atomic Chat as OpenAI-compatible local provider Register atomic_chat in the provider registry with default base URL http://localhost:1337/v1, schema field, docs, and config tests.	2026-05-16 12:14:33 +08:00
Xubin Ren	9ccef018c2	feat(telegram): add new slash commands and update regex for command handling	2026-05-15 17:55:52 +00:00
Xubin Ren	0f96ab7e70	fix(webui): drop App markdown warmup; keep preloadMarkdownText export Startup no longer triggers preloadMarkdownText (#3746). Restore the named export so MessageBubble can still warm the lazy markdown chunk when the reasoning panel opens (compatible with current main). Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-16 01:42:42 +08:00

1 2 3 4 5 ...

2525 Commits