nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-22 17:42:24 +00:00

Author	SHA1	Message	Date
Xubin Ren	4f895e6307	refactor(providers): centralize gateway reasoning control	2026-05-21 14:41:50 +08:00
olgagaga	0cd2f626c0	fix(providers): inject OpenRouter `reasoning.effort` for thinking models Follow-up to #3851: that PR added `extra_body.thinking={type: disabled}` for MiMo via OpenRouter, but OR doesn't forward provider-specific thinking shapes to upstream — it strips unknown extra_body fields and uses its own unified `reasoning` parameter. So MiMo via OR kept thinking despite the injection (reproduced by @ClearPlume on #3851 with identical kwargs but provider switched from openrouter → xiaomi_mimo). For known thinking-capable models (Kimi, MiMo) routed via the openrouter spec, also inject `extra_body.reasoning = {effort: <effort>}` in OR's documented enum ("none"\|"minimal"\|"low"\|"medium"\|"high"\|"xhigh"). OR translates this to the upstream model's native shape. Existing tests updated to expect both fields on the OR path. The direct xiaomi_mimo and moonshot paths are unchanged (the new branch is gated on spec.name == "openrouter"). Flash and non-MiMo models on OR continue to receive no injection.	2026-05-21 14:41:50 +08:00
chengyongru	e2b51fa5dc	fix(weixin): prevent silent message drops from poll exceptions and expired tokens - Remove suppress(Exception) from poll loop and message processing; add logger.exception so inbound errors are visible. - Check both ret and errcode on send to avoid silent drops when iLink returns ret != 0 with errcode == 0. - Proactively refresh context_token via getconfig before sending if the cached token is older than 60s. This prevents message loss on long agent turns and cron pushes without relying on complex retry logic. Refs: openclaw/openclaw#61174, NousResearch/hermes-agent#21011	2026-05-21 13:41:05 +08:00
Xubin Ren	3d3ebf1110	test(provider): cover duplicate streaming tool call ids	2026-05-21 12:28:24 +08:00
chengyongru	886e7e43d5	fix(signal): bypass base is_allowed for policy-approved messages Override _handle_message to publish directly to the bus for messages that have already passed _check_inbound_policy. The denied DM pairing path calls super()._handle_message() to issue pairing codes via the base class. This avoids cross-policy leakage where e.g. group open policy would cause is_allowed to incorrectly allow denied DM senders. Also includes: - SSE: strip one optional leading space after 'data:' per spec - Convert 20+ f-string log calls to loguru lazy formatting - Add end-to-end tests for DM/group routing through the full chain - Add cross-policy test (dm allowlist + group open) for pairing - Add Signal channel documentation to docs/chat-apps.md	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	b3d0d24a52	fix(signal): consult pairing store in is_allowed BaseChannel.is_allowed ORs is_approved (the pairing store) into the allow decision; the signal override dropped that step and only looked at config.allow_from. With the new DM-pairing flow in place, an approved-via-pairing sender's next message would have failed the allow check and triggered another pairing code in a loop. OR in a normalized check against the pairing store: walk each part of the pipe-joined sender_id through _normalize_signal_id and call is_approved for each variant, so an approval stored under one form (phone with/without "+", UUID/ACI) still matches when the next inbound uses a different form. Mirrors how slack.py:643 handles it. Also tightens the empty-allowlist warning to only fire when nothing else granted access, since pairing-store hits are now a valid path. Not part of the original review, but Comments 2 and 3 turn this latent gap into a broken round-trip — included so the pairing UX actually works.	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	dc33247671	fix(signal): route denied DMs through _handle_message for pairing code Previously _check_inbound_policy returned (False, chat_id) for DMs that failed the allowlist and the caller dropped them — so unapproved DM senders never saw a pairing code. Mirror Slack: when the policy gate denies a DM but dm.enabled is true, still call _handle_message(content="", is_dm=True) so BaseChannel can issue the pairing reply. Group denials stay a hard drop. Combined with the previous is_dm forwarding, unapproved DM senders now receive a pairing code through the standard flow. Addresses review comment on PR #3852.	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	d653f23aba	fix(signal): raise on signal-cli error response so send is retriable _send_http_request collapses every exception path into a {"error": ...} dict, so the if "error" in response branch inside send() is the only place where send failures surface. Logging-only there meant the ChannelManager retry mechanism never fired. Raise RuntimeError so the base-class retry path is exercised; the outer try/except already re-raises into the caller. Addresses review comment on PR #3852.	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	96767ca179	Cleanup	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	b300ea495f	fix(signal): normalize composite sender_ids in is_allowed too The base BaseChannel.is_allowed() does a literal ``sender_id in allow_from`` check, but Signal's sender_id is a pipe-joined composite of phone/UUID parts. After splitting an allowlist entry like ``+phone\|uuid`` into two separate entries, the per-DM gate accepted it but the base gate still denied because the composite sender string wasn't literally in the list. Override is_allowed on SignalChannel to delegate to _sender_matches_allowlist, which already splits both sides on ``\|`` and normalizes each part. _sender_matches_allowlist itself now also splits allowlist entries on ``\|`` so legacy composite entries keep working too. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	632f41e418	test(signal): cover markdown adjacency, nesting, and malformed input The existing markdown suite was strong on UTF-16 offsets and chunk redistribution but had no coverage for nested or adjacent styles, no test that an unmatched opener round-trips as plain text, and no test for the blockquote/inline-code interaction. Add six cases including the documented contiguous-BOLD output for `# wrap me`, which Signal renders as one visual span. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	9c486b90d5	test(signal): consolidate channel-capture setup into one factory Two test classes (TestHandleDataMessageDM, TestHandleDataMessageGroup) plus three TestCommandHandling tests each repeated the same handful of lines: build a channel, mock _handle_message to record kwargs, replace _start_typing with a no-op, paper over the assignment with type: ignore. Hoist the pattern into _make_channel_with_capture and call it from all five sites. Drops 30+ lines of duplication and 7 type: ignore comments. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	590ac99c8a	test(signal): cover SSE receive loop and the empty-phone start guard Previously the SSE loop and the empty-phone-number short-circuit in start() had zero coverage. Both now have tests: a fake httpx stream feeds canned SSE lines, exercising the valid-frame, invalid-JSON, non-200, and no-http-client paths; start() with an empty phone number is asserted to return without entering the HTTP loop. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	7733a7840e	refactor(signal): split _handle_data_message into policy and assembly helpers The receive-path handler was ~165 lines deep into nested DM/group policy checks, buffer mutations, mention stripping, attachment downloads, and final bus forwarding. Pull the policy gate out into _check_inbound_policy (returns (allow, chat_id), still appends to the group buffer once allowed) and the text+media construction into _assemble_inbound_content. The top-level method now reads as orchestration only. Add TestCheckInboundPolicy that exercises the helper directly across the DM/group policy permutations, including the buffer side effect, so the new seam is locked in. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	83aed43682	feat(signal): make signal-cli attachments directory configurable The inbound attachment loop hardcoded ~/.local/share/signal-cli/attachments as the source path. That is the daemon's default on Linux but not on macOS or Windows, and breaks if the daemon was launched with XDG_DATA_HOME set. Add SignalConfig.attachments_dir as an optional override. When unset the behavior is unchanged; when set the value is run through Path.expanduser() so ~ is honored. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	882d4139d7	fix(signal): normalize identifiers when matching DM allowlist The DM allowlist check split sender_id on '\|' and looked for raw membership in the allow_from list. Senders carry their phone number with a leading '+' but admins routinely write allowlist entries without it (or vice versa), and UUID/ACI matches were case-sensitive. Both forms now flow through _normalize_signal_id, so an entry like 19995550001 matches a sender +19995550001 and a UUID matches case-insensitively. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	ca72f6b6c9	refactor(signal): hygiene cleanups around constants, typing, and config - Hoist the cell-strip patterns to module level so they match the rest of the module's regex style and aren't reparsed on every call. - Type the markdown transform callback and the mention id walker so the inline Callable signature is no longer an untyped Any. - Add _HTTP_TIMEOUT_SECONDS alongside the other class-level tunables. - Reject group_message_buffer_size <= 0 in a Pydantic field_validator rather than silently disabling the buffer at write time. - Mark SignalConfig.allow_from as a computed_field so it shows up in model_dump() instead of being invisible to serialization. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	96eb3b7194	fix(signal): redistribute textStyle ranges across split message chunks split_message can break a long Signal payload into multiple JSON-RPC sends, but the previous code attached the full textStyle list only to chunk 0. Style ranges in later chunks were dropped, and ranges whose offsets pointed past chunk 0's end were sent as invalid metadata against chunk 0. Add _partition_styles, which rebases each range against the chunk it lives in (in UTF-16 code units, matching the markdown converter) and splits boundary-spanning ranges across the chunks they touch. Whitespace trimmed by split_message's lstrip is skipped so offsets stay aligned. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	8f6b7611a2	fix(signal): emit textStyle offsets in UTF-16 code units Signal's BodyRange (via signal-cli's textStyle) interprets start/length as UTF-16 code units, but the Phase-3 assembly used Python's len(), which counts code points. A single non-BMP character (e.g. an emoji) earlier in a message shifted every subsequent styled span left by one unit, dropping the last letter of bold/italic words. Track a running UTF-16 offset in the assembly loop and add regression tests covering emojis, supplementary CJK, ZWJ sequences, and a multi-section message that mirrors the reported failure. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:00:36 +08:00
Kaloyan Tenchov	8ec1025193	feat(signal): add Signal channel support Integrates signal-cli daemon via HTTP JSON-RPC as a nanobot channel. Supports DMs and group chats with open/allowlist access policies, markdown→Signal text style conversion, typing indicators, attachment handling, group message context buffering, and automatic reconnect with exponential backoff. Includes unit tests for channel lifecycle, message routing, mention detection, markdown conversion, and message splitting. Originally based on https://github.com/HKUDS/nanobot/pull/601.	2026-05-21 01:00:36 +08:00
Xubin Ren	eae51333ad	fix(providers): point Skywork at APIFree agent endpoint	2026-05-20 12:33:03 +08:00
moran	61ae869610	feat(providers): add APIFree support Add APIFree as a built-in OpenAI-compatible provider. APIFree offers agent-optimised models such as skywork-ai/skyclaw-v1 through an OpenAI-compatible API at https://api.apifree.ai/agent/v1. Changes: - Register apifree provider in the provider registry - Add config schema field - Add documentation with configuration example - Add provider tests, websocket channel tests, and webui tests - Add provider icon in settings UI	2026-05-20 12:33:03 +08:00
Xubin Ren	3eebe08dba	fix(exec): detach stdin for shell commands	2026-05-20 12:07:17 +08:00
Xubin Ren	38a5f09f02	refactor: preserve cold-start lazy boundaries	2026-05-20 12:02:23 +08:00
chengyongru	af9f8d54b8	perf: optimize gateway cold start from ~6.9s to ~460ms (#3918 ) Channel lazy load: discover_enabled() only imports enabled channel modules instead of all 18 modules with heavy SDKs (telegram, discord, slack, etc). discover_all() now delegates to discover_enabled(). Lazy OpenAI client: defer AsyncOpenAI() + httpx construction to _ensure_client() with asyncio.Lock double-checked locking. openai and httpx imports moved from module-level into _ensure_client(). Minor: lazy Nanobot/RunResult and CronService exports via __getattr__. Benchmark: 6910ms → 460ms (-93.3%)	2026-05-20 12:02:23 +08:00
Xubin Ren	1391aa3d57	fix(tests): make settings workspace path portable	2026-05-20 02:20:44 +08:00
Xubin Ren	e00220bdb6	feat(providers): add Skywork provider support	2026-05-20 02:20:44 +08:00
moran	2d302a006e	feat(image-generation): add StepFun provider support and StepPlan docs - Add StepFunImageGenerationClient with step-image-edit-2 / step-1x-medium support - Map aspect ratios to StepFun size strings (WxH order) - Add style_reference for step-1x-medium reference-image generation - Register in image gen provider registry (auto-discovered by nanobot.py) - Add 7 unit tests: payload, default size, explicit size, style_reference (1x/non-1x), missing key, no-images - Add StepFun section to docs/image-generation.md with provider config - Add StepPlan (订阅制) subsection with apiBase override example	2026-05-20 00:08:38 +08:00
Xubin Ren	cda1de863e	Merge remote-tracking branch 'origin/main' into codex/review-pr-3894 # Conflicts: # tests/utils/test_webui_transcript.py	2026-05-19 23:19:33 +08:00
Xubin Ren	57d5276da1	feat(webui): upgrade settings and sidebar controls (#3906 ) * feat(settings): expand settings api payload * feat(webui): build app-style settings center * feat(webui): add centered chat search dialog * fix(webui): shorten chat search label * fix(webui): center dialog entrance animation * fix(webui): simplify chat search results * fix(webui): tighten mobile settings navigation * feat(webui): persist sidebar state * feat(webui): add sidebar organization controls * refactor(webui): organize backend helpers * refactor(webui): remove utils compatibility shims * refactor(session): move shared webui helpers out of webui package * feat(webui): add image generation settings * style(webui): refine settings overview layout * fix(webui): localize settings zh-CN copy * style(webui): add settings status indicators * feat(webui): show sidebar run indicators * fix(webui): persist sidebar run indicators * fix(webui): highlight settings pending status * fix(webui): align settings test with provider update * fix(utils): preserve legacy webui helper imports	2026-05-19 22:42:38 +08:00
Xubin Ren	44b7bba9bd	fix(image-generation): align media delivery and mime handling	2026-05-19 15:35:19 +08:00
chengyongru	d7a73093a8	refactor: remove dead image media attachment code - Remove generated_image_paths_from_messages() and _extract_text_payload() from artifacts.py (no runtime callers) - Remove session_attachments.py entirely (merge_turn_media_into_last_assistant and stage_media_paths_for_session_replay had no runtime callers) - Remove test_session_media_persist.py and the orphaned test in test_artifacts.py	2026-05-19 15:35:19 +08:00
chengyongru	fc1c8ea770	fix(image-generation): let LLM deliver images via message tool instead of runtime media attachment The runtime media-attachment mechanism was broken for streaming channels (e.g. WebSocket): the _streamed flag caused _send_once to skip the final OutboundMessage that carried generated media, so images were never delivered. Rather than adding complex coordination between streaming and media delivery, delegate image delivery to the LLM: after generate_image returns artifact paths, the next_step prompt now instructs the LLM to call the message tool with the paths in the media parameter. This works uniformly across all channels, streaming or not. Remove generated_media from TurnContext, _assemble_outbound, and _state_save. Update prompts in identity.md, SKILL.md, message tool description, and artifacts.py to reflect the new flow.	2026-05-19 15:35:19 +08:00
chengyongru	c588d56a77	refactor(image-generation): introduce provider registry to eliminate manual wiring Adds ImageGenerationProvider ABC with shared __init__, _http_post(), and _require_images(). Introduces _IMAGE_GEN_PROVIDERS registry with register/get/image_gen_provider_configs() helpers. Four existing providers (OpenRouter, AIHubMix, Gemini, MiniMax) now inherit from the base class and self-register. Adding a new provider only requires writing one class + one registration line. Eliminates if/else chains in the tool dispatch and hardcoded provider config dicts in commands.py (3 sites) and nanobot.py (1 site). Fixes the agent CLI command missing image_generation_provider_configs entirely. Also simplifies test monkeypatch targets to patch the registry lookup.	2026-05-19 15:35:19 +08:00
Kaloyan Tenchov	7367741ac1	feat(image-generation): add Gemini provider support Adds GeminiImageGenerationClient covering both Imagen 4 (:predict) and Gemini Flash (:generateContent), wires the gemini ProviderConfig through the SDK, API server, and gateway entry points, and updates the image-generation docs and skill. Errors from the Gemini endpoints are logged and surface with the HTTP status and parsed message instead of an empty string. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-19 15:35:19 +08:00
Xubin Ren	0a5606b409	fix webui tool trace dedupe	2026-05-19 13:12:19 +08:00
Xubin Ren	c4293a7835	feat(providers): add Ant Ling support	2026-05-18 22:13:52 +08:00
Xubin Ren	40c1d83b32	fix(ci): update live file edit test expectations	2026-05-18 22:01:33 +08:00
Xubin Ren	7e2dbdef7d	feat(webui): stream live file edit events	2026-05-18 22:01:33 +08:00
chengyongru	d4ade8f680	feat(cli): add Model Preset wizard to onboard Extract the [M] Model Presets interactive CRUD screen from PR #3696 and adapt it to the current main branch schema (fallback_models instead of fallback_presets). Adds preset cache, field handlers for model_preset/provider/fallback_models, and 9 new tests.	2026-05-18 15:13:41 +08:00
Xubin Ren	ba38f90832	Merge PR #3877 : feat(webui+agent): optimize streaming, activity rendering, and runtime sync feat(webui+agent): optimize streaming, activity rendering, and runtime sync	2026-05-18 02:04:36 +08:00
Xubin Ren	eb3aed359f	Refine file edit progress gating	2026-05-18 01:59:55 +08:00
Xubin Ren	4445fcc8b9	refactor(cli): localize reasoning buffer state	2026-05-18 01:34:08 +08:00
Xubin Ren	de8761f25a	fix(test): add gateway llm runtime fake	2026-05-18 01:19:45 +08:00
Xubin Ren	8708ccea86	Merge branch 'main' of https://github.com/HKUDS/nanobot into codex/webui-performance	2026-05-18 01:18:28 +08:00
Xubin Ren	eb0ff3ad1d	fix(memory): refresh session before empty guard	2026-05-18 01:16:47 +08:00
chengyongru	c58a360b25	fix(test): seed get_or_create mock for session-refresh guard compatibility	2026-05-18 01:16:47 +08:00
chengyongru	5bb94edc99	refactor(autocompact): delegate _archive to Consolidator.compact_idle_session Replace AutoCompact._archive() direct session mutation with delegation to Consolidator.compact_idle_session(). Remove _split_unconsolidated() method since that logic now lives inside compact_idle_session. All session mutation for idle compaction now goes through the Consolidator's lock, eliminating the race condition between background token consolidation and idle TTL compaction. Changes: - autocompact.py: rewrite _archive() to call compact_idle_session, remove _split_unconsolidated(), clean up unused imports - test_autocompact_unit.py: replace TestArchive/TestSplitUnconsolidated with TestArchiveDelegates that verifies delegation behavior - test_auto_compact.py: convert all consolidator.archive mocks to consolidator.compact_idle_session mocks via _make_fake_compact helper	2026-05-18 01:16:47 +08:00
chengyongru	888d54790d	fix(memory): add session-refresh guard to maybe_consolidate_by_tokens When background consolidation runs with a stale session reference (captured before AutoCompact replaced the session via compact_idle_session), it could operate on outdated data. Now, after acquiring the per-session lock, the method refreshes its session reference from SessionManager.get_or_create(). If the session was replaced, it swaps in the fresh reference before doing any consolidation work. This prevents a race where AutoCompact truncates an idle session while a background maybe_consolidate_by_tokens call is in flight with the old session object.	2026-05-18 01:16:47 +08:00
chengyongru	48d35bd2d9	feat(consolidator): add compact_idle_session method with lock-protected truncation Add Consolidator.compact_idle_session(session_key, max_suffix=8) that performs hard-truncation of idle sessions under the per-session consolidation lock. This is the single lock-protected path for AutoCompact to use instead of modifying session state directly, fixing the race condition between AutoCompact and Consolidator. Behavior: - Acquires per-session consolidation lock - Invalidates cache and reloads fresh from disk - Splits unconsolidated tail into archive prefix and retained suffix - Archives prefix via LLM (with raw_archive fallback on failure) - Persists _last_summary in session metadata on success - Returns summary text, None on LLM failure, or '' if nothing to archive Tests: 6 new tests covering prefix archival, empty session timestamp refresh, (nothing) summary exclusion, LLM failure fallback, last_consolidated offset, and lock acquisition verification.	2026-05-18 01:16:47 +08:00

1 2 3 4 5 ...

898 Commits