nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-21 09:02:32 +00:00

Author	SHA1	Message	Date
Kaloyan Tenchov	3874b3acf4	fix(signal): normalize composite sender_ids in is_allowed too The base BaseChannel.is_allowed() does a literal ``sender_id in allow_from`` check, but Signal's sender_id is a pipe-joined composite of phone/UUID parts. After splitting an allowlist entry like ``+phone\|uuid`` into two separate entries, the per-DM gate accepted it but the base gate still denied because the composite sender string wasn't literally in the list. Override is_allowed on SignalChannel to delegate to _sender_matches_allowlist, which already splits both sides on ``\|`` and normalizes each part. _sender_matches_allowlist itself now also splits allowlist entries on ``\|`` so legacy composite entries keep working too. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	01725bab11	test(signal): cover markdown adjacency, nesting, and malformed input The existing markdown suite was strong on UTF-16 offsets and chunk redistribution but had no coverage for nested or adjacent styles, no test that an unmatched opener round-trips as plain text, and no test for the blockquote/inline-code interaction. Add six cases including the documented contiguous-BOLD output for `# wrap me`, which Signal renders as one visual span. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	a786e3d225	test(signal): consolidate channel-capture setup into one factory Two test classes (TestHandleDataMessageDM, TestHandleDataMessageGroup) plus three TestCommandHandling tests each repeated the same handful of lines: build a channel, mock _handle_message to record kwargs, replace _start_typing with a no-op, paper over the assignment with type: ignore. Hoist the pattern into _make_channel_with_capture and call it from all five sites. Drops 30+ lines of duplication and 7 type: ignore comments. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	626f262121	test(signal): cover SSE receive loop and the empty-phone start guard Previously the SSE loop and the empty-phone-number short-circuit in start() had zero coverage. Both now have tests: a fake httpx stream feeds canned SSE lines, exercising the valid-frame, invalid-JSON, non-200, and no-http-client paths; start() with an empty phone number is asserted to return without entering the HTTP loop. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	7caf492ae2	refactor(signal): split _handle_data_message into policy and assembly helpers The receive-path handler was ~165 lines deep into nested DM/group policy checks, buffer mutations, mention stripping, attachment downloads, and final bus forwarding. Pull the policy gate out into _check_inbound_policy (returns (allow, chat_id), still appends to the group buffer once allowed) and the text+media construction into _assemble_inbound_content. The top-level method now reads as orchestration only. Add TestCheckInboundPolicy that exercises the helper directly across the DM/group policy permutations, including the buffer side effect, so the new seam is locked in. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	9aa2ab1657	feat(signal): make signal-cli attachments directory configurable The inbound attachment loop hardcoded ~/.local/share/signal-cli/attachments as the source path. That is the daemon's default on Linux but not on macOS or Windows, and breaks if the daemon was launched with XDG_DATA_HOME set. Add SignalConfig.attachments_dir as an optional override. When unset the behavior is unchanged; when set the value is run through Path.expanduser() so ~ is honored. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	971b774282	refactor(signal): wrap top-level receive handler with _safe_handle Replace the inline try/except at the end of _handle_receive_notification with a small async context manager that swallows the exception, logs self.logger.error with the offending payload's repr (bounded to 200 chars), and attaches the traceback via logger.opt(exception=True). The previous log line only carried `e`, so diagnosing a bad envelope from production logs required correlating timestamps. The wrapper is generic so future receive/dispatch sites can adopt it; for now only this site uses it. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	1377759705	fix(signal): normalize identifiers when matching DM allowlist The DM allowlist check split sender_id on '\|' and looked for raw membership in the allow_from list. Senders carry their phone number with a leading '+' but admins routinely write allowlist entries without it (or vice versa), and UUID/ACI matches were case-sensitive. Both forms now flow through _normalize_signal_id, so an entry like 19995550001 matches a sender +19995550001 and a UUID matches case-insensitively. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	d56bafa6d0	refactor(signal): hygiene cleanups around constants, typing, and config - Hoist the cell-strip patterns to module level so they match the rest of the module's regex style and aren't reparsed on every call. - Type the markdown transform callback and the mention id walker so the inline Callable signature is no longer an untyped Any. - Add _HTTP_TIMEOUT_SECONDS alongside the other class-level tunables. - Reject group_message_buffer_size <= 0 in a Pydantic field_validator rather than silently disabling the buffer at write time. - Mark SignalConfig.allow_from as a computed_field so it shows up in model_dump() instead of being invisible to serialization. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	6ec6c9bb83	fix(signal): redistribute textStyle ranges across split message chunks split_message can break a long Signal payload into multiple JSON-RPC sends, but the previous code attached the full textStyle list only to chunk 0. Style ranges in later chunks were dropped, and ranges whose offsets pointed past chunk 0's end were sent as invalid metadata against chunk 0. Add _partition_styles, which rebases each range against the chunk it lives in (in UTF-16 code units, matching the markdown converter) and splits boundary-spanning ranges across the chunks they touch. Whitespace trimmed by split_message's lstrip is skipped so offsets stay aligned. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	8a2a5eecdd	fix(signal): emit textStyle offsets in UTF-16 code units Signal's BodyRange (via signal-cli's textStyle) interprets start/length as UTF-16 code units, but the Phase-3 assembly used Python's len(), which counts code points. A single non-BMP character (e.g. an emoji) earlier in a message shifted every subsequent styled span left by one unit, dropping the last letter of bold/italic words. Track a running UTF-16 offset in the assembly loop and add regression tests covering emojis, supplementary CJK, ZWJ sequences, and a multi-section message that mirrors the reported failure. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	08154b4374	fix(signal): drop duplicate self in unconfigured-account log call Addresses review feedback on HKUDS/nanobot#3852: self.self.logger.error would crash if the phone_number guard ever fired.	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	880097acd5	feat(signal): add Signal channel support Integrates signal-cli daemon via HTTP JSON-RPC as a nanobot channel. Supports DMs and group chats with open/allowlist access policies, markdown→Signal text style conversion, typing indicators, attachment handling, group message context buffering, and automatic reconnect with exponential backoff. Includes unit tests for channel lifecycle, message routing, mention detection, markdown conversion, and message splitting. Originally based on https://github.com/HKUDS/nanobot/pull/601.	2026-05-20 22:57:49 +08:00
chengyongru	e02615c93d	Merge branch 'main' into nightly	2026-05-18 18:05:29 +08:00
Kaloyan Tenchov	e9259e680e	feat(image-generation): add Gemini provider support Adds GeminiImageGenerationClient covering both Imagen 4 (:predict) and Gemini Flash (:generateContent), wires the gemini ProviderConfig through the SDK, API server, and gateway entry points, and updates the image-generation docs and skill. Errors from the Gemini endpoints are logged and surface with the HTTP status and parsed message instead of an empty string. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-18 15:25:03 +08:00
yaotutu	a5b85a3d6b	feat: add MiniMax image generation provider support Add MiniMaxImageGenerationClient with support for: - Text-to-image generation via MiniMax image-01 model - Reference image support (subject_reference) - Aspect ratio selection - Proper error handling aligned with existing providers Wire up MiniMax provider config in ImageGenerationTool, gateway, serve, and Nanobot class.	2026-05-18 15:14:45 +08:00
chengyongru	d4ade8f680	feat(cli): add Model Preset wizard to onboard Extract the [M] Model Presets interactive CRUD screen from PR #3696 and adapt it to the current main branch schema (fallback_models instead of fallback_presets). Adds preset cache, field handlers for model_preset/provider/fallback_models, and 9 new tests.	2026-05-18 15:13:41 +08:00
chengyongru	28d0f8560e	fix(webui): preserve single newlines in markdown rendering Add remark-breaks plugin so that single newlines in assistant messages (such as /help output) render as line breaks instead of being collapsed into a single paragraph by standard markdown behavior.	2026-05-18 15:12:27 +08:00
Xubin Ren	ba38f90832	Merge PR #3877 : feat(webui+agent): optimize streaming, activity rendering, and runtime sync feat(webui+agent): optimize streaming, activity rendering, and runtime sync	2026-05-18 02:04:36 +08:00
Xubin Ren	eb3aed359f	Refine file edit progress gating	2026-05-18 01:59:55 +08:00
Xubin Ren	4445fcc8b9	refactor(cli): localize reasoning buffer state	2026-05-18 01:34:08 +08:00
liyazhou	b67205f5aa	fix(cli): buffer reasoning tokens to avoid one-token-per-line display	2026-05-18 01:34:08 +08:00
Xubin Ren	de8761f25a	fix(test): add gateway llm runtime fake	2026-05-18 01:19:45 +08:00
Xubin Ren	8708ccea86	Merge branch 'main' of https://github.com/HKUDS/nanobot into codex/webui-performance	2026-05-18 01:18:28 +08:00
Xubin Ren	eb0ff3ad1d	fix(memory): refresh session before empty guard	2026-05-18 01:16:47 +08:00
chengyongru	c58a360b25	fix(test): seed get_or_create mock for session-refresh guard compatibility	2026-05-18 01:16:47 +08:00
chengyongru	5bb94edc99	refactor(autocompact): delegate _archive to Consolidator.compact_idle_session Replace AutoCompact._archive() direct session mutation with delegation to Consolidator.compact_idle_session(). Remove _split_unconsolidated() method since that logic now lives inside compact_idle_session. All session mutation for idle compaction now goes through the Consolidator's lock, eliminating the race condition between background token consolidation and idle TTL compaction. Changes: - autocompact.py: rewrite _archive() to call compact_idle_session, remove _split_unconsolidated(), clean up unused imports - test_autocompact_unit.py: replace TestArchive/TestSplitUnconsolidated with TestArchiveDelegates that verifies delegation behavior - test_auto_compact.py: convert all consolidator.archive mocks to consolidator.compact_idle_session mocks via _make_fake_compact helper	2026-05-18 01:16:47 +08:00
chengyongru	888d54790d	fix(memory): add session-refresh guard to maybe_consolidate_by_tokens When background consolidation runs with a stale session reference (captured before AutoCompact replaced the session via compact_idle_session), it could operate on outdated data. Now, after acquiring the per-session lock, the method refreshes its session reference from SessionManager.get_or_create(). If the session was replaced, it swaps in the fresh reference before doing any consolidation work. This prevents a race where AutoCompact truncates an idle session while a background maybe_consolidate_by_tokens call is in flight with the old session object.	2026-05-18 01:16:47 +08:00
chengyongru	48d35bd2d9	feat(consolidator): add compact_idle_session method with lock-protected truncation Add Consolidator.compact_idle_session(session_key, max_suffix=8) that performs hard-truncation of idle sessions under the per-session consolidation lock. This is the single lock-protected path for AutoCompact to use instead of modifying session state directly, fixing the race condition between AutoCompact and Consolidator. Behavior: - Acquires per-session consolidation lock - Invalidates cache and reloads fresh from disk - Splits unconsolidated tail into archive prefix and retained suffix - Archives prefix via LLM (with raw_archive fallback on failure) - Persists _last_summary in session metadata on success - Returns summary text, None on LLM failure, or '' if nothing to archive Tests: 6 new tests covering prefix archival, empty session timestamp refresh, (nothing) summary exclusion, LLM failure fallback, last_consolidated offset, and lock acquisition verification.	2026-05-18 01:16:47 +08:00
Xubin Ren	fce1550814	fix(webui): refresh bootstrap token before expiry	2026-05-18 00:53:36 +08:00
voidborne-d	bf8a6e35fd	docs(deployment): match docker run gateway example to docker-compose.yml (refs #3873 ) The `docker run` example for `gateway` in `docs/deployment.md` had drifted from the canonical configuration in `docker-compose.yml`: - It omitted the security flags that `docker-compose.yml` already declares (`cap_drop: ALL` + `cap_add: SYS_ADMIN` + unconfined apparmor/seccomp). These are required whenever `tools.exec.sandbox: "bwrap"` is enabled, because bwrap needs CAP_SYS_ADMIN for user namespaces; without them bwrap exits with `clone3: Operation not permitted` and exec tools silently fail. - It omitted `-p 8765:8765`, even though both the bundled `docker-compose.yml` and `Dockerfile` (`EXPOSE 18790 8765`) already expose the WebSocket channel / WebUI port; users following the docs would get a reachable gateway health endpoint but an unreachable WebUI. This change keeps the two paths in sync so anyone reading deployment.md and using `docker run` directly gets the same security posture and port surface as the Compose path. Also adds a short `!IMPORTANT` note documenting that `gateway.host` and `channels.websocket.host` default to `127.0.0.1` (set in `nanobot/config/schema.py:GatewayConfig`). Docker `-p` cannot forward to the container's loopback interface, so the user must set both binds to `0.0.0.0` in `config.json` for the published ports to actually be reachable. This is the symptom reported as items 2 + 3 of #3873; items 1 + 4 of that issue are already resolved on `main` (`Dockerfile` line 49 already exposes both ports, and README.md lines 218-220 already reflect that the WebUI ships in the wheel). Docs only, no code changes. Signed-off-by: voidborne-d <258577966+voidborne-d@users.noreply.github.com>	2026-05-18 00:45:49 +08:00
Xubin Ren	f017e209da	docs(configuration): align Docker env-file example	2026-05-18 00:45:34 +08:00
olgagaga	5a34504b76	docs(configuration): expand "Environment Variables for Secrets" section - Note that any string field supports ${VAR_NAME} and resolved values are never written back to disk. - Document the failure mode for unset variables. - Add MCP (stdio env + HTTP headers) and web-search examples. - Add Docker, direnv, and secret-manager (1Password / pass / Bitwarden) delivery patterns alongside the existing systemd example. - Replace plaintext apiKey values in tools.web.search examples (Brave, Tavily, Jina, Kagi, Olostep) with ${PROVIDER_API_KEY} placeholders so the docs stop modelling the anti-pattern. - Cross-link from the Security section. Refs: HKUDS/nanobot#2172	2026-05-18 00:45:34 +08:00
Xubin Ren	af26ed0041	fix(heartbeat): remove unused runtime import	2026-05-18 00:40:31 +08:00
Xubin Ren	112f40ad67	fix(agent): refresh llm runtime for background tasks	2026-05-18 00:35:12 +08:00
Xubin Ren	2f323e24c1	fix(webui): polish session titles and status	2026-05-17 23:52:50 +08:00
Xubin Ren	361f31c0e4	fix(webui): use portal file reference tooltips	2026-05-17 23:52:29 +08:00
Xubin Ren	945f208d38	feat(webui): render file edit activity	2026-05-17 23:52:14 +08:00
Xubin Ren	c8bb04a8fe	feat(webui): persist agent activity events	2026-05-17 23:51:52 +08:00
Xubin Ren	4b5de66c58	Polish WebUI streaming and provider settings	2026-05-17 17:41:33 +08:00
Xubin Ren	9340567f2d	Fix duplicate reasoning display	2026-05-17 17:11:38 +08:00
Xubin Ren	e5be4dac7a	Optimize WebUI streaming and long history rendering Batch stream deltas, window long transcripts, lazy-load syntax highlighting, and refine activity/composer interactions. Add title refresh retries plus tests for streaming, windowing, code blocks, and live activity behavior.	2026-05-17 17:04:57 +08:00
Xubin Ren	175b58e259	fix(docker): document bundled webui port	2026-05-17 15:51:04 +08:00
huanglei.214	3bf8de047a	fix docker build	2026-05-17 15:51:04 +08:00
chengyongru	400f822601	fix(providers): recognize Chinese rate-limit marker '访问量过大' as transient error	2026-05-17 14:25:20 +08:00
Xubin Ren	9fb9d7afcb	docs: update README with v0.2.0 release details, including new features and improvements	2026-05-16 15:22:32 +00:00
chengyongru	82c323c2d9	fix(providers): recognize Chinese rate-limit marker '访问量过大' as transient error	2026-05-16 22:06:54 +08:00
Xubin Ren	c018c3fb6a	chore(release): bundle webui into wheel and prep 0.2.0 v0.2.0	2026-05-16 13:38:11 +00:00
olgagaga	0ca0fe2221	fix(providers): wire MiMo thinking control on gateway providers (#3845 ) The xiaomi_mimo ProviderSpec carries thinking_style="thinking_type", but gateway providers (OpenRouter etc.) route MiMo under their own spec which has no thinking_style. As a result, `reasoning_effort="none"` was silently ignored: `{"thinking": {"type": "disabled"}}` was never injected and responses still contained reasoning_content. Mirror the Kimi pattern that already handles the same problem: add an explicit _MIMO_THINKING_MODELS allowlist (mimo-v2.5-pro, mimo-v2.5, mimo-v2-pro, mimo-v2-omni — per Xiaomi docs), an _is_mimo_thinking_model helper that strips publisher prefixes ("xiaomi/mimo-v2.5-pro" matches), and a sibling branch in _build_kwargs that injects the thinking payload by model name. mimo-v2-flash is intentionally excluded — it has no thinking mode. Also include MiMo in the explicit_thinking predicate so the reasoning_content backfill (#3554, #3584) covers the gateway path consistently with the direct path. Tests cover the gateway disable/enable signals, bare-slug fallback, flash exclusion, and a non-MiMo sanity check.	2026-05-16 20:46:34 +08:00
chengyongru	8a819dda1e	fix(agent): remove duplicate runtime context injection in mid-turn drain _drain_pending injected a full runtime context block (including goal state) into every injected user message, but the initial message already carries runtime context via build_messages(). This caused goal state to appear multiple times in the LLM context window within a single turn, wasting tokens (up to 4000 chars per duplicate). Now _drain_pending only passes the raw user content without runtime context. The initial turn message remains the sole carrier.	2026-05-16 20:46:08 +08:00

1 2 3 4 5 ...

2540 Commits