nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-06-13 14:23:58 +00:00

Author	SHA1	Message	Date
outlook84	a4a2c55120	feat(telegram): add webhook support and ordered message queue Introduce webhook mode for the Telegram channel and implement a session-based message reordering mechanism. Key changes: - Update `python-telegram-bot` dependency to include the `webhooks` extra. - Add `TelegramConfig` fields for webhook configuration, with validation rules for public HTTPS URLs and Telegram's secret token. - Implement `_enqueue_ordered_update` and `_drain_ordered_updates` in `TelegramChannel` to stage incoming messages and commands behind a short per-session reorder window, ensuring sequential delivery based on message and update IDs. - Configure `start_webhook` in `TelegramChannel.start()` when webhook mode is enabled. - Add unit tests for webhook config validations, webhook startup, and message reordering. - Document webhook configuration and reverse proxy details in `docs/chat-apps.md`.	2026-05-26 16:14:51 +08:00
moran	179acfe104	feat(providers): add Step Plan support Document how to use StepFun's Step Plan subscription endpoint with the existing `stepfun` provider by overriding `apiBase`, following the same pattern as the `zhipu` provider's coding plan documentation. - Base URL: `https://api.stepfun.com/step_plan/v1` (dedicated endpoint) - API Key: same `STEPFUN_API_KEY` as the regular `stepfun` provider - Models: `step-3.5-flash`, `step-3.5-flash-2603`, `step-router-v1` Changes: - `docs/configuration.md` — provider tip, and config example showing `apiBase` override on the existing `stepfun` provider Test: 488/488 provider tests passed.	2026-05-25 18:57:36 +08:00
outlook84	c433d60681	feat: Enhance OpenAI provider configuration with extraBody support and apiType validation	2026-05-25 01:23:36 +08:00
outlook84	d472595417	feat: Add OpenAI API type configuration and update provider settings	2026-05-25 01:23:36 +08:00
Xubin Ren	ec99232208	docs: fix Xiaomi MiMo token plan env key	2026-05-23 22:56:24 +08:00
honjiaxuan	43a1784c5f	docs: use xiaomi_mimo provider for MiMo token plan Replace standalone 'Token Plan' section with general Xiaomi MiMo section using the built-in xiaomi_mimo provider. Token plan becomes a note within the section, since it's just an apiBase override. Key changes: - Use xiaomi_mimo provider (auto-matches via 'mimo' keyword in model name) - Drop redundant provider field (auto-detected) - Add token plan tip to provider tips block - Restructure as general Xiaomi MiMo section with token plan as note	2026-05-23 22:56:24 +08:00
Xubin Ren	3d3ef586e7	docs(config): clarify exec timeout and transcription apiBase	2026-05-23 17:32:59 +08:00
Xubin Ren	5937236f9d	test(image-generation): tighten zhipu provider coverage	2026-05-23 17:06:36 +08:00
Jiajun Xie	3e6f9907fe	feat: Add Zhipu (智谱) image generation provider	2026-05-23 17:06:36 +08:00
Xubin Ren	f5534bcaa0	Merge origin/main into fix-ollama-image-generation	2026-05-22 21:15:42 +08:00
Xubin Ren	8281cd1946	test(providers): cover Novita gateway fallback	2026-05-21 16:16:32 +08:00
Alex-wuhu	e5476573f4	test(providers): align Novita provider coverage	2026-05-21 16:16:32 +08:00
Alex-wuhu	0d1d23b5fb	feat: add Novita AI provider	2026-05-21 16:16:32 +08:00
Haisam Abbas	84603f4cf2	Add Ollama image generation support	2026-05-21 12:06:08 +05:00
chengyongru	886e7e43d5	fix(signal): bypass base is_allowed for policy-approved messages Override _handle_message to publish directly to the bus for messages that have already passed _check_inbound_policy. The denied DM pairing path calls super()._handle_message() to issue pairing codes via the base class. This avoids cross-policy leakage where e.g. group open policy would cause is_allowed to incorrectly allow denied DM senders. Also includes: - SSE: strip one optional leading space after 'data:' per spec - Convert 20+ f-string log calls to loguru lazy formatting - Add end-to-end tests for DM/group routing through the full chain - Add cross-policy test (dm allowlist + group open) for pairing - Add Signal channel documentation to docs/chat-apps.md	2026-05-21 01:00:36 +08:00
Xubin Ren	eae51333ad	fix(providers): point Skywork at APIFree agent endpoint	2026-05-20 12:33:03 +08:00
moran	6194a9b919	docs(configuration): fix APIFree formatting — merge wrapped description into single line	2026-05-20 12:33:03 +08:00
moran	61ae869610	feat(providers): add APIFree support Add APIFree as a built-in OpenAI-compatible provider. APIFree offers agent-optimised models such as skywork-ai/skyclaw-v1 through an OpenAI-compatible API at https://api.apifree.ai/agent/v1. Changes: - Register apifree provider in the provider registry - Add config schema field - Add documentation with configuration example - Add provider tests, websocket channel tests, and webui tests - Add provider icon in settings UI	2026-05-20 12:33:03 +08:00
Xubin Ren	e00220bdb6	feat(providers): add Skywork provider support	2026-05-20 02:20:44 +08:00
moran	4dccee56a7	docs: translate StepPlan section from Chinese to English	2026-05-20 00:08:38 +08:00
moran	2d302a006e	feat(image-generation): add StepFun provider support and StepPlan docs - Add StepFunImageGenerationClient with step-image-edit-2 / step-1x-medium support - Map aspect ratios to StepFun size strings (WxH order) - Add style_reference for step-1x-medium reference-image generation - Register in image gen provider registry (auto-discovered by nanobot.py) - Add 7 unit tests: payload, default size, explicit size, style_reference (1x/non-1x), missing key, no-images - Add StepFun section to docs/image-generation.md with provider config - Add StepPlan (订阅制) subsection with apiBase override example	2026-05-20 00:08:38 +08:00
Xubin Ren	15dba8d080	Polish local provider docs	2026-05-19 22:15:09 +08:00
chengyongru	59548b0a04	docs(image-generation): collapse redundant Quick Setup examples Keep one minimal OpenRouter example and link to Provider Notes for AIHubMix, MiniMax, and Gemini configuration.	2026-05-19 15:35:19 +08:00
chengyongru	99e4d25d4c	docs(image-generation): add MiniMax to docs and skill Updates docs/image-generation.md and skills/image-generation/SKILL.md to include MiniMax configuration examples, supported aspect ratios, and troubleshooting references. Also updates the supported provider list to include minimax alongside openrouter, aihubmix, and gemini.	2026-05-19 15:35:19 +08:00
Kaloyan Tenchov	7367741ac1	feat(image-generation): add Gemini provider support Adds GeminiImageGenerationClient covering both Imagen 4 (:predict) and Gemini Flash (:generateContent), wires the gemini ProviderConfig through the SDK, API server, and gateway entry points, and updates the image-generation docs and skill. Errors from the Gemini endpoints are logged and surface with the HTTP status and parsed message instead of an empty string. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-19 15:35:19 +08:00
Xubin Ren	c4293a7835	feat(providers): add Ant Ling support	2026-05-18 22:13:52 +08:00
voidborne-d	bf8a6e35fd	docs(deployment): match docker run gateway example to docker-compose.yml (refs #3873 ) The `docker run` example for `gateway` in `docs/deployment.md` had drifted from the canonical configuration in `docker-compose.yml`: - It omitted the security flags that `docker-compose.yml` already declares (`cap_drop: ALL` + `cap_add: SYS_ADMIN` + unconfined apparmor/seccomp). These are required whenever `tools.exec.sandbox: "bwrap"` is enabled, because bwrap needs CAP_SYS_ADMIN for user namespaces; without them bwrap exits with `clone3: Operation not permitted` and exec tools silently fail. - It omitted `-p 8765:8765`, even though both the bundled `docker-compose.yml` and `Dockerfile` (`EXPOSE 18790 8765`) already expose the WebSocket channel / WebUI port; users following the docs would get a reachable gateway health endpoint but an unreachable WebUI. This change keeps the two paths in sync so anyone reading deployment.md and using `docker run` directly gets the same security posture and port surface as the Compose path. Also adds a short `!IMPORTANT` note documenting that `gateway.host` and `channels.websocket.host` default to `127.0.0.1` (set in `nanobot/config/schema.py:GatewayConfig`). Docker `-p` cannot forward to the container's loopback interface, so the user must set both binds to `0.0.0.0` in `config.json` for the published ports to actually be reachable. This is the symptom reported as items 2 + 3 of #3873; items 1 + 4 of that issue are already resolved on `main` (`Dockerfile` line 49 already exposes both ports, and README.md lines 218-220 already reflect that the WebUI ships in the wheel). Docs only, no code changes. Signed-off-by: voidborne-d <258577966+voidborne-d@users.noreply.github.com>	2026-05-18 00:45:49 +08:00
Xubin Ren	f017e209da	docs(configuration): align Docker env-file example	2026-05-18 00:45:34 +08:00
olgagaga	5a34504b76	docs(configuration): expand "Environment Variables for Secrets" section - Note that any string field supports ${VAR_NAME} and resolved values are never written back to disk. - Document the failure mode for unset variables. - Add MCP (stdio env + HTTP headers) and web-search examples. - Add Docker, direnv, and secret-manager (1Password / pass / Bitwarden) delivery patterns alongside the existing systemd example. - Replace plaintext apiKey values in tools.web.search examples (Brave, Tavily, Jina, Kagi, Olostep) with ${PROVIDER_API_KEY} placeholders so the docs stop modelling the anti-pattern. - Cross-link from the Security section. Refs: HKUDS/nanobot#2172	2026-05-18 00:45:34 +08:00
Xubin Ren	c018c3fb6a	chore(release): bundle webui into wheel and prep 0.2.0	2026-05-16 13:38:11 +00:00
yanalialiuk	18072856ec	feat: add Atomic Chat as OpenAI-compatible local provider Register atomic_chat in the provider registry with default base URL http://localhost:1337/v1, schema field, docs, and config tests.	2026-05-16 12:14:33 +08:00
chengyongru	2d64aa7dd8	docs(pairing): consolidate access control docs — MECE allowFrom + pairing	2026-05-15 15:46:44 +08:00
chengyongru	8aff3d6151	docs(pairing): add user-friendly pairing documentation	2026-05-15 15:46:44 +08:00
Xubin Ren	5efd67919b	feat(runner): support fallback candidates Resolve fallbackModels as preset references or explicit inline provider configs so failover uses complete model settings without exposing fallback logic to the agent loop. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 15:34:03 +00:00
Xubin Ren	43db848db0	Revert "feat(runner): support structured fallback models" This reverts commit 02b059a616dc6dc82ad15282102c7b27a5a34e40.	2026-05-13 14:11:08 +00:00
Xubin Ren	02b059a616	feat(runner): support structured fallback models Bind fallback model chains to the active model configuration so defaults and presets do not inherit or merge fallback behavior implicitly. Require explicit fallback providers while preserving per-fallback generation overrides and context-window safety. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 13:57:30 +00:00
Xubin Ren	9d50f1b933	feat: polish trace delivery and slash menu UX Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 08:47:34 +00:00
Xubin Ren	458b4ba235	feat(reasoning): stream reasoning content as a first-class channel Reasoning now flows as its own stream — symmetric to the answer's ``delta`` / ``stream_end`` pair — instead of being shipped as one oversized progress message. This lets WebUI render a live "Thinking…" bubble that updates in place, then auto-collapses when the stream closes. Other channels remain plugin no-ops by default. ## Protocol New metadata: ``_reasoning_delta`` (chunk) and ``_reasoning_end`` (close marker). ChannelManager routes both to the dedicated plugin hooks below; the legacy one-shot ``_reasoning`` is kept for back-compat and BaseChannel expands it into a single delta + end pair so plugins only ever implement the streaming primitives. WebSocket emits two new events: - ``reasoning_delta`` (event, chat_id, text, optional stream_id) - ``reasoning_end`` (event, chat_id, optional stream_id) ## BaseChannel surface - ``send_reasoning_delta(chat_id, delta, metadata)`` — no-op default - ``send_reasoning_end(chat_id, metadata)`` — no-op default - ``send_reasoning(msg)`` — back-compat wrapper, base impl forwards to the streaming primitives A channel adds reasoning support by overriding the two streaming primitives. Telegram / Slack / Discord / Feishu / WeChat / Matrix keep the base no-ops until their bubble UIs are adapted; reasoning silently drops at dispatch, never as a stray text message. ## AgentHook Adds ``emit_reasoning_end`` to the hook lifecycle. ``_LoopHook`` tracks whether a reasoning segment is open and closes it on: - the first answer delta arriving (so the UI locks the bubble before the answer renders below), - ``on_stream_end``, - one-shot ``reasoning_content`` / ``thinking_blocks`` after a single non-streaming response. ## WebUI - ``UIMessage.reasoning`` is now a single accumulated string with a companion ``reasoningStreaming`` flag. - ``useNanobotStream`` consumes ``reasoning_delta`` / ``reasoning_end``; legacy ``kind: "reasoning"`` is auto-translated to a delta + end. - New ``ReasoningBubble``: shimmer header + auto-expanded while streaming, collapses to a clickable "Thinking" pill once closed, respects ``prefers-reduced-motion``. - Answer deltas adopt the reasoning placeholder so the bubble and the answer share one assistant row. ## Tests - ``tests/channels/test_channel_manager_reasoning.py`` — manager routes delta + end, drops on channel opt-out, expands one-shot back-compat. - ``tests/channels/test_websocket_channel.py`` — new ``reasoning_delta`` / ``reasoning_end`` frames, empty-chunk safety, no-subscriber safety, back-compat expansion. - ``tests/agent/test_runner_reasoning.py`` — runner closes the segment on streaming answer start and after one-shot reasoning. - WebUI ``useNanobotStream`` + ``message-bubble`` cover the new protocol and the shimmer styling. ## Docs ``docs/configuration.md`` and ``docs/websocket.md`` document the new events and the plugin contract. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 07:13:43 +00:00
Xubin Ren	a6b059d379	refactor(reasoning): make channel plugins own reasoning rendering Reasoning was being shipped to every channel as a generic progress message with a `_reasoning: true` flag. Two problems with that: 1. Channels without a low-emphasis UI primitive (Telegram, Slack, Discord, Feishu...) would dump raw model thoughts as ordinary replies, polluting the conversation. 2. The agent loop double-gated by inspecting `channels_config`, which coupled the loop to display policy. Treat reasoning as its own plugin action — `BaseChannel.send_reasoning` defaults to a documented no-op; channels that have a fitting affordance override. ChannelManager routes `_reasoning` outbounds to that method only when the channel opts in via `show_reasoning` (camelCase alias `showReasoning` mirrors `sendProgress`). Plugins that don't override silently drop reasoning — "no fit, no leak" is the contract. Reference implementation lands for WebSocket / WebUI: a new `kind: "reasoning"` frame, parked on the active assistant bubble as a collapsible `Thinking` group above the answer. CLI keeps its existing direct path (it doesn't go through the bus). `ChannelsConfig.show_reasoning` flips to `true` by default — only adapted channels surface anything, others stay quiet. Loop net diff is -3 lines: the `channels_config.show_reasoning` check moves out, leaving emit_reasoning a one-liner that publishes and trusts the channel to decide. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 06:27:53 +00:00
Xubin Ren	01fa362c03	Merge origin/main into feat/show-reasoning Resolves conflicts after main landed the state-machine turn refactor and the test_runner.py 9-file split: - nanobot/agent/loop.py: take main's `_state_build`/`_persist_user_message_early` flow; restore the `reasoning: bool` parameter on `_build_bus_progress_callback` so the loop hook can mark progress as reasoning-channel without coupling to the answer stream. - nanobot/cli/stream.py: keep main's configurable `bot_name`/`bot_icon` header while preserving the PR's `transient=True` Live + `self._console` routing + `_renderable()` final-render path that fixed TUI duplication. - tests/agent/test_runner.py was deleted on main and split into 9 focused files; relocated all 6 reasoning tests into a new `test_runner_reasoning.py` matching the new layout, deduplicated the per-test `ReasoningHook` boilerplate through a shared `_RecordingHook` helper. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-13 05:07:14 +00:00
Xubin Ren	352aaf0627	refactor(reasoning): unify reasoning extraction across providers Reasoning surfacing was split across three branches in runner.py plus two separate streaming buffers (loop hook and runner progress stream), with three independent display-side gates in the CLI. This collapsed the policy into one source of truth and fixed two real bugs: - Structured `reasoning_content` was suppressed whenever the answer was streamed, because the runner gated emission on `streamed_content`. Providers don't stream `reasoning_content`; it only arrives on the final response, so the answer stream and the reasoning channel are independent. Added `streamed_reasoning` to `AgentHookContext` to track the right bit. - `channels.showReasoning` was subordinated to `sendProgress`. They are orthogonal — turning off progress streaming shouldn't silence reasoning. Reworked the CLI gates accordingly. Single-helper consolidation: - `extract_reasoning(reasoning_content, thinking_blocks, content)` returns `(reasoning_text, cleaned_content)` with a defined fallback order: dedicated field → Anthropic thinking_blocks → inline `<think>`/`<thought>` tags. Models that expose none of these short-circuit to `(None, content)` — zero overhead. - `IncrementalThinkExtractor` replaces the ad-hoc `emit_incremental_think` function and its hand-rolled "emitted cursor" state in both the loop hook and the runner progress stream. Also documented the new `showReasoning` channel option in docs/configuration.md and noted its independence from sendProgress. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 17:14:19 +00:00
Xubin Ren	35f64cd828	docs(config): document model presets Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-12 20:06:22 +08:00
chengyongru	49f85f5c23	docs(schema,config): clarify reasoning_effort semantics for MiMo thinking mode - Update AgentDefaults.reasoning_effort comment to document "none" (disable) and None (preserve provider default). - Add configuration.md tip explaining MiMo thinking mode behavior.	2026-05-11 14:38:28 +08:00
Xubin Ren	e936ed48bd	feat: add image generation tool and WebUI mode Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 20:06:23 +08:00
Tim O'Brien	67875d7a15	fix: wire toolHintMaxLength through AgentLoop constructors The config field was added but never passed from config to AgentLoop. The value was always falling back to the default (40) regardless of what was set in config.json. Now passes tool_hint_max_length through all AgentLoop() call sites: - nanobot/nanobot.py (main bot) - nanobot/cli/commands.py (CLI agent, dev, webui commands) Also adds documentation in docs/configuration.md.	2026-05-06 21:18:39 +08:00
chengyongru	c30e4d86f3	refactor(agent): simplify subagent concurrency with rejection over semaphore Replace the asyncio.Semaphore queueing approach with a simple count check in SpawnTool.execute(). When the concurrency limit is reached, the tool returns an error string so the agent can perceive the reason and adjust its behavior instead of silently queueing. - Remove max_concurrent_subagents parameter threading through AgentLoop, commands.py, and nanobot.py - SubagentManager reads the limit directly from AgentDefaults - SpawnTool checks get_running_count() before calling spawn() - Simplify tests to verify rejection behavior	2026-05-05 22:22:04 +08:00
Xubin Ren	861fbb0dde	fix(provider): correct LongCat OpenAI base URL Use the SDK-ready /v1 base so LongCat chat completions hit the documented endpoint. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-02 01:52:04 +08:00
moranfong	051037ff08	feat(provider): add LongCat via OpenAI-compatible backend	2026-05-02 01:52:04 +08:00
Xubin Ren	306958d6e6	add native Bedrock Converse provider Made-with: Cursor	2026-05-01 18:52:03 +08:00
hanyuanling	0b111a0e0c	fix(channels): support per-channel progress controls	2026-04-29 16:43:09 +08:00

1 2

96 Commits