nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-21 00:52:34 +00:00

Author	SHA1	Message	Date
chengyongru	f44ee6cb27	fix(signal): bypass base is_allowed for policy-approved messages Override _handle_message to publish directly to the bus for messages that have already passed _check_inbound_policy. The denied DM pairing path calls super()._handle_message() to issue pairing codes via the base class. This avoids cross-policy leakage where e.g. group open policy would cause is_allowed to incorrectly allow denied DM senders. Also includes: - SSE: strip one optional leading space after 'data:' per spec - Convert 20+ f-string log calls to loguru lazy formatting - Add end-to-end tests for DM/group routing through the full chain - Add cross-policy test (dm allowlist + group open) for pairing - Add Signal channel documentation to docs/chat-apps.md	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	5be3df1d6f	fix(signal): consult pairing store in is_allowed BaseChannel.is_allowed ORs is_approved (the pairing store) into the allow decision; the signal override dropped that step and only looked at config.allow_from. With the new DM-pairing flow in place, an approved-via-pairing sender's next message would have failed the allow check and triggered another pairing code in a loop. OR in a normalized check against the pairing store: walk each part of the pipe-joined sender_id through _normalize_signal_id and call is_approved for each variant, so an approval stored under one form (phone with/without "+", UUID/ACI) still matches when the next inbound uses a different form. Mirrors how slack.py:643 handles it. Also tightens the empty-allowlist warning to only fire when nothing else granted access, since pairing-store hits are now a valid path. Not part of the original review, but Comments 2 and 3 turn this latent gap into a broken round-trip — included so the pairing UX actually works.	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	79c23787f6	fix(signal): join multi-line SSE data with newline per spec Per the SSE spec, multiple data: lines within a single event must be joined with \n before parsing. signal-cli emits single-line JSON so this was latent, but the joining was wrong. Addresses review comment on PR #3852.	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	b647aa5f47	fix(signal): route denied DMs through _handle_message for pairing code Previously _check_inbound_policy returned (False, chat_id) for DMs that failed the allowlist and the caller dropped them — so unapproved DM senders never saw a pairing code. Mirror Slack: when the policy gate denies a DM but dm.enabled is true, still call _handle_message(content="", is_dm=True) so BaseChannel can issue the pairing reply. Group denials stay a hard drop. Combined with the previous is_dm forwarding, unapproved DM senders now receive a pairing code through the standard flow. Addresses review comment on PR #3852.	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	a9a8bdcef6	fix(signal): pass is_dm to _handle_message so DM pairing flow runs BaseChannel._handle_message uses is_dm to decide whether to issue a pairing code when is_allowed rejects the sender. Without it the base class treats every denied message as a group message and silently drops it. Forward is_dm=not is_group_message so unapproved DM users get a pairing code through the standard flow. This change only takes effect once denied DMs actually reach _handle_message (next commit); on its own it is a no-op since the policy gate still short-circuits before this call. Addresses review comment on PR #3852.	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	2d81cc0ae1	fix(signal): raise on signal-cli error response so send is retriable _send_http_request collapses every exception path into a {"error": ...} dict, so the if "error" in response branch inside send() is the only place where send failures surface. Logging-only there meant the ChannelManager retry mechanism never fired. Raise RuntimeError so the base-class retry path is exercised; the outer try/except already re-raises into the caller. Addresses review comment on PR #3852.	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	aed6b6967c	Cleanup	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	3874b3acf4	fix(signal): normalize composite sender_ids in is_allowed too The base BaseChannel.is_allowed() does a literal ``sender_id in allow_from`` check, but Signal's sender_id is a pipe-joined composite of phone/UUID parts. After splitting an allowlist entry like ``+phone\|uuid`` into two separate entries, the per-DM gate accepted it but the base gate still denied because the composite sender string wasn't literally in the list. Override is_allowed on SignalChannel to delegate to _sender_matches_allowlist, which already splits both sides on ``\|`` and normalizes each part. _sender_matches_allowlist itself now also splits allowlist entries on ``\|`` so legacy composite entries keep working too. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	01725bab11	test(signal): cover markdown adjacency, nesting, and malformed input The existing markdown suite was strong on UTF-16 offsets and chunk redistribution but had no coverage for nested or adjacent styles, no test that an unmatched opener round-trips as plain text, and no test for the blockquote/inline-code interaction. Add six cases including the documented contiguous-BOLD output for `# wrap me`, which Signal renders as one visual span. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	a786e3d225	test(signal): consolidate channel-capture setup into one factory Two test classes (TestHandleDataMessageDM, TestHandleDataMessageGroup) plus three TestCommandHandling tests each repeated the same handful of lines: build a channel, mock _handle_message to record kwargs, replace _start_typing with a no-op, paper over the assignment with type: ignore. Hoist the pattern into _make_channel_with_capture and call it from all five sites. Drops 30+ lines of duplication and 7 type: ignore comments. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	626f262121	test(signal): cover SSE receive loop and the empty-phone start guard Previously the SSE loop and the empty-phone-number short-circuit in start() had zero coverage. Both now have tests: a fake httpx stream feeds canned SSE lines, exercising the valid-frame, invalid-JSON, non-200, and no-http-client paths; start() with an empty phone number is asserted to return without entering the HTTP loop. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	7caf492ae2	refactor(signal): split _handle_data_message into policy and assembly helpers The receive-path handler was ~165 lines deep into nested DM/group policy checks, buffer mutations, mention stripping, attachment downloads, and final bus forwarding. Pull the policy gate out into _check_inbound_policy (returns (allow, chat_id), still appends to the group buffer once allowed) and the text+media construction into _assemble_inbound_content. The top-level method now reads as orchestration only. Add TestCheckInboundPolicy that exercises the helper directly across the DM/group policy permutations, including the buffer side effect, so the new seam is locked in. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	9aa2ab1657	feat(signal): make signal-cli attachments directory configurable The inbound attachment loop hardcoded ~/.local/share/signal-cli/attachments as the source path. That is the daemon's default on Linux but not on macOS or Windows, and breaks if the daemon was launched with XDG_DATA_HOME set. Add SignalConfig.attachments_dir as an optional override. When unset the behavior is unchanged; when set the value is run through Path.expanduser() so ~ is honored. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	971b774282	refactor(signal): wrap top-level receive handler with _safe_handle Replace the inline try/except at the end of _handle_receive_notification with a small async context manager that swallows the exception, logs self.logger.error with the offending payload's repr (bounded to 200 chars), and attaches the traceback via logger.opt(exception=True). The previous log line only carried `e`, so diagnosing a bad envelope from production logs required correlating timestamps. The wrapper is generic so future receive/dispatch sites can adopt it; for now only this site uses it. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	1377759705	fix(signal): normalize identifiers when matching DM allowlist The DM allowlist check split sender_id on '\|' and looked for raw membership in the allow_from list. Senders carry their phone number with a leading '+' but admins routinely write allowlist entries without it (or vice versa), and UUID/ACI matches were case-sensitive. Both forms now flow through _normalize_signal_id, so an entry like 19995550001 matches a sender +19995550001 and a UUID matches case-insensitively. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	d56bafa6d0	refactor(signal): hygiene cleanups around constants, typing, and config - Hoist the cell-strip patterns to module level so they match the rest of the module's regex style and aren't reparsed on every call. - Type the markdown transform callback and the mention id walker so the inline Callable signature is no longer an untyped Any. - Add _HTTP_TIMEOUT_SECONDS alongside the other class-level tunables. - Reject group_message_buffer_size <= 0 in a Pydantic field_validator rather than silently disabling the buffer at write time. - Mark SignalConfig.allow_from as a computed_field so it shows up in model_dump() instead of being invisible to serialization. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	6ec6c9bb83	fix(signal): redistribute textStyle ranges across split message chunks split_message can break a long Signal payload into multiple JSON-RPC sends, but the previous code attached the full textStyle list only to chunk 0. Style ranges in later chunks were dropped, and ranges whose offsets pointed past chunk 0's end were sent as invalid metadata against chunk 0. Add _partition_styles, which rebases each range against the chunk it lives in (in UTF-16 code units, matching the markdown converter) and splits boundary-spanning ranges across the chunks they touch. Whitespace trimmed by split_message's lstrip is skipped so offsets stay aligned. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	8a2a5eecdd	fix(signal): emit textStyle offsets in UTF-16 code units Signal's BodyRange (via signal-cli's textStyle) interprets start/length as UTF-16 code units, but the Phase-3 assembly used Python's len(), which counts code points. A single non-BMP character (e.g. an emoji) earlier in a message shifted every subsequent styled span left by one unit, dropping the last letter of bold/italic words. Track a running UTF-16 offset in the assembly loop and add regression tests covering emojis, supplementary CJK, ZWJ sequences, and a multi-section message that mirrors the reported failure. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	08154b4374	fix(signal): drop duplicate self in unconfigured-account log call Addresses review feedback on HKUDS/nanobot#3852: self.self.logger.error would crash if the phone_number guard ever fired.	2026-05-20 22:57:49 +08:00
Kaloyan Tenchov	880097acd5	feat(signal): add Signal channel support Integrates signal-cli daemon via HTTP JSON-RPC as a nanobot channel. Supports DMs and group chats with open/allowlist access policies, markdown→Signal text style conversion, typing indicators, attachment handling, group message context buffering, and automatic reconnect with exponential backoff. Includes unit tests for channel lifecycle, message routing, mention detection, markdown conversion, and message splitting. Originally based on https://github.com/HKUDS/nanobot/pull/601.	2026-05-20 22:57:49 +08:00
chengyongru	e02615c93d	Merge branch 'main' into nightly	2026-05-18 18:05:29 +08:00
Kaloyan Tenchov	e9259e680e	feat(image-generation): add Gemini provider support Adds GeminiImageGenerationClient covering both Imagen 4 (:predict) and Gemini Flash (:generateContent), wires the gemini ProviderConfig through the SDK, API server, and gateway entry points, and updates the image-generation docs and skill. Errors from the Gemini endpoints are logged and surface with the HTTP status and parsed message instead of an empty string. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-18 15:25:03 +08:00
yaotutu	a5b85a3d6b	feat: add MiniMax image generation provider support Add MiniMaxImageGenerationClient with support for: - Text-to-image generation via MiniMax image-01 model - Reference image support (subject_reference) - Aspect ratio selection - Proper error handling aligned with existing providers Wire up MiniMax provider config in ImageGenerationTool, gateway, serve, and Nanobot class.	2026-05-18 15:14:45 +08:00
chengyongru	d4ade8f680	feat(cli): add Model Preset wizard to onboard Extract the [M] Model Presets interactive CRUD screen from PR #3696 and adapt it to the current main branch schema (fallback_models instead of fallback_presets). Adds preset cache, field handlers for model_preset/provider/fallback_models, and 9 new tests.	2026-05-18 15:13:41 +08:00
chengyongru	28d0f8560e	fix(webui): preserve single newlines in markdown rendering Add remark-breaks plugin so that single newlines in assistant messages (such as /help output) render as line breaks instead of being collapsed into a single paragraph by standard markdown behavior.	2026-05-18 15:12:27 +08:00
Xubin Ren	ba38f90832	Merge PR #3877 : feat(webui+agent): optimize streaming, activity rendering, and runtime sync feat(webui+agent): optimize streaming, activity rendering, and runtime sync	2026-05-18 02:04:36 +08:00
Xubin Ren	eb3aed359f	Refine file edit progress gating	2026-05-18 01:59:55 +08:00
Xubin Ren	4445fcc8b9	refactor(cli): localize reasoning buffer state	2026-05-18 01:34:08 +08:00
liyazhou	b67205f5aa	fix(cli): buffer reasoning tokens to avoid one-token-per-line display	2026-05-18 01:34:08 +08:00
Xubin Ren	de8761f25a	fix(test): add gateway llm runtime fake	2026-05-18 01:19:45 +08:00
Xubin Ren	8708ccea86	Merge branch 'main' of https://github.com/HKUDS/nanobot into codex/webui-performance	2026-05-18 01:18:28 +08:00
Xubin Ren	eb0ff3ad1d	fix(memory): refresh session before empty guard	2026-05-18 01:16:47 +08:00
chengyongru	c58a360b25	fix(test): seed get_or_create mock for session-refresh guard compatibility	2026-05-18 01:16:47 +08:00
chengyongru	5bb94edc99	refactor(autocompact): delegate _archive to Consolidator.compact_idle_session Replace AutoCompact._archive() direct session mutation with delegation to Consolidator.compact_idle_session(). Remove _split_unconsolidated() method since that logic now lives inside compact_idle_session. All session mutation for idle compaction now goes through the Consolidator's lock, eliminating the race condition between background token consolidation and idle TTL compaction. Changes: - autocompact.py: rewrite _archive() to call compact_idle_session, remove _split_unconsolidated(), clean up unused imports - test_autocompact_unit.py: replace TestArchive/TestSplitUnconsolidated with TestArchiveDelegates that verifies delegation behavior - test_auto_compact.py: convert all consolidator.archive mocks to consolidator.compact_idle_session mocks via _make_fake_compact helper	2026-05-18 01:16:47 +08:00
chengyongru	888d54790d	fix(memory): add session-refresh guard to maybe_consolidate_by_tokens When background consolidation runs with a stale session reference (captured before AutoCompact replaced the session via compact_idle_session), it could operate on outdated data. Now, after acquiring the per-session lock, the method refreshes its session reference from SessionManager.get_or_create(). If the session was replaced, it swaps in the fresh reference before doing any consolidation work. This prevents a race where AutoCompact truncates an idle session while a background maybe_consolidate_by_tokens call is in flight with the old session object.	2026-05-18 01:16:47 +08:00
chengyongru	48d35bd2d9	feat(consolidator): add compact_idle_session method with lock-protected truncation Add Consolidator.compact_idle_session(session_key, max_suffix=8) that performs hard-truncation of idle sessions under the per-session consolidation lock. This is the single lock-protected path for AutoCompact to use instead of modifying session state directly, fixing the race condition between AutoCompact and Consolidator. Behavior: - Acquires per-session consolidation lock - Invalidates cache and reloads fresh from disk - Splits unconsolidated tail into archive prefix and retained suffix - Archives prefix via LLM (with raw_archive fallback on failure) - Persists _last_summary in session metadata on success - Returns summary text, None on LLM failure, or '' if nothing to archive Tests: 6 new tests covering prefix archival, empty session timestamp refresh, (nothing) summary exclusion, LLM failure fallback, last_consolidated offset, and lock acquisition verification.	2026-05-18 01:16:47 +08:00
Xubin Ren	fce1550814	fix(webui): refresh bootstrap token before expiry	2026-05-18 00:53:36 +08:00
voidborne-d	bf8a6e35fd	docs(deployment): match docker run gateway example to docker-compose.yml (refs #3873 ) The `docker run` example for `gateway` in `docs/deployment.md` had drifted from the canonical configuration in `docker-compose.yml`: - It omitted the security flags that `docker-compose.yml` already declares (`cap_drop: ALL` + `cap_add: SYS_ADMIN` + unconfined apparmor/seccomp). These are required whenever `tools.exec.sandbox: "bwrap"` is enabled, because bwrap needs CAP_SYS_ADMIN for user namespaces; without them bwrap exits with `clone3: Operation not permitted` and exec tools silently fail. - It omitted `-p 8765:8765`, even though both the bundled `docker-compose.yml` and `Dockerfile` (`EXPOSE 18790 8765`) already expose the WebSocket channel / WebUI port; users following the docs would get a reachable gateway health endpoint but an unreachable WebUI. This change keeps the two paths in sync so anyone reading deployment.md and using `docker run` directly gets the same security posture and port surface as the Compose path. Also adds a short `!IMPORTANT` note documenting that `gateway.host` and `channels.websocket.host` default to `127.0.0.1` (set in `nanobot/config/schema.py:GatewayConfig`). Docker `-p` cannot forward to the container's loopback interface, so the user must set both binds to `0.0.0.0` in `config.json` for the published ports to actually be reachable. This is the symptom reported as items 2 + 3 of #3873; items 1 + 4 of that issue are already resolved on `main` (`Dockerfile` line 49 already exposes both ports, and README.md lines 218-220 already reflect that the WebUI ships in the wheel). Docs only, no code changes. Signed-off-by: voidborne-d <258577966+voidborne-d@users.noreply.github.com>	2026-05-18 00:45:49 +08:00
Xubin Ren	f017e209da	docs(configuration): align Docker env-file example	2026-05-18 00:45:34 +08:00
olgagaga	5a34504b76	docs(configuration): expand "Environment Variables for Secrets" section - Note that any string field supports ${VAR_NAME} and resolved values are never written back to disk. - Document the failure mode for unset variables. - Add MCP (stdio env + HTTP headers) and web-search examples. - Add Docker, direnv, and secret-manager (1Password / pass / Bitwarden) delivery patterns alongside the existing systemd example. - Replace plaintext apiKey values in tools.web.search examples (Brave, Tavily, Jina, Kagi, Olostep) with ${PROVIDER_API_KEY} placeholders so the docs stop modelling the anti-pattern. - Cross-link from the Security section. Refs: HKUDS/nanobot#2172	2026-05-18 00:45:34 +08:00
Xubin Ren	af26ed0041	fix(heartbeat): remove unused runtime import	2026-05-18 00:40:31 +08:00
Xubin Ren	112f40ad67	fix(agent): refresh llm runtime for background tasks	2026-05-18 00:35:12 +08:00
Xubin Ren	2f323e24c1	fix(webui): polish session titles and status	2026-05-17 23:52:50 +08:00
Xubin Ren	361f31c0e4	fix(webui): use portal file reference tooltips	2026-05-17 23:52:29 +08:00
Xubin Ren	945f208d38	feat(webui): render file edit activity	2026-05-17 23:52:14 +08:00
Xubin Ren	c8bb04a8fe	feat(webui): persist agent activity events	2026-05-17 23:51:52 +08:00
Xubin Ren	4b5de66c58	Polish WebUI streaming and provider settings	2026-05-17 17:41:33 +08:00
Xubin Ren	9340567f2d	Fix duplicate reasoning display	2026-05-17 17:11:38 +08:00
Xubin Ren	e5be4dac7a	Optimize WebUI streaming and long history rendering Batch stream deltas, window long transcripts, lazy-load syntax highlighting, and refine activity/composer interactions. Add title refresh retries plus tests for streaming, windowing, code blocks, and live activity behavior.	2026-05-17 17:04:57 +08:00
Xubin Ren	175b58e259	fix(docker): document bundled webui port	2026-05-17 15:51:04 +08:00

1 2 3 4 5 ...

2547 Commits