nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-06-15 15:24:06 +00:00

Author	SHA1	Message	Date
chengyongru	b232a52794	fix: tighten cron session deletion UX	2026-06-12 14:51:02 +08:00
chengyongru	c00371c761	docs: clarify streamed timeout fallback behavior maintainer edit: update fallback docs and provider docstring to describe the new stream-stall timeout recovery exception.	2026-06-10 18:10:44 +08:00
chengyongru	dadb35af49	feat(exec): add path prepend config	2026-06-10 18:09:57 +08:00
Moran	9c492143b4	search: add Bocha web search provider	2026-06-10 15:51:15 +08:00
moran	7930058348	feat(asr): add StepFun ASR SSE transcription provider - Add StepFunTranscriptionProvider class in nanobot/providers/transcription.py - New _post_stepfun_asr_with_retry() function handling SSE stream parsing (transcript.text.delta → transcript.text.done event sequence) - Register 'stepfun' in transcription_registry.py with default model stepaudio-2.5-asr - Reuse existing stepfun provider config (apiBase can point to Plan endpoint) - Add 17 tests covering SSE parsing, retry contract, empty-text edge case, and registry integration - Update docs/configuration.md with stepfun ASR documentation StepFun ASR uses a dedicated SSE endpoint (/v1/audio/asr/sse) rather than the chat-completions or Whisper multipart formats used by other providers. Users on Step Plan can set apiBase to the Plan endpoint.	2026-06-10 15:50:38 +08:00
chengyongru	4a58b83acc	docs: make onboarding friendlier for beginners (#4177 ) * docs: make onboarding friendlier for beginners * docs: build clearer documentation paths Maintainer edit: turn the onboarding follow-up into a layered docs structure for first-time setup, provider selection, troubleshooting, CLI reference, and source-level architecture. This keeps quick start focused while giving advanced users precise reference paths. * docs: render architecture flow with mermaid Maintainer edit: replace the ASCII architecture sketch with a GitHub-rendered Mermaid flowchart so the core runtime path is easier to scan in the PR and README docs. * docs: recommend model presets for model config Maintainer edit: make named modelPresets the primary model configuration path and expand fallback preset examples so string fallbacks are clearly preset names, not raw model IDs. * docs: document api base urls and langfuse setup Maintainer edit: explain when users need apiBase/base URL in quick start and provider docs, and add Langfuse tracing setup with troubleshooting links. * docs: use python module pip consistently Maintainer edit: keep install commands tied to the active Python interpreter by using python -m pip in the Azure optional dependency notes too. * docs: add non-technical getting started path Maintainer edit: add a wizard-first guide for users without terminal or JSON background, including a text TUI menu example and links from the main docs entrypoints. * docs: avoid hard-wrapped prose in user docs Maintainer edit: unwrap ordinary prose across user-facing documentation while preserving markdown structure, code blocks, tables, lists, and prompt/template files. * docs: keep desktop list continuations nested Maintainer edit: preserve list nesting after unwrapping prose in the desktop WebUI sync guide. * docs: add one-command installer Maintainer edit: add auditable macOS/Linux and Windows install scripts that install nanobot-ai and start the onboarding wizard, then document the commands in the main onboarding entrypoints. * docs: add installer dry run mode Maintainer edit: add --dry-run to the one-command installer scripts so users can preview Python detection, install source, pip command, and wizard behavior without changing their environment. * docs: clean installer error output Maintainer edit: make PowerShell installer failures print a concise Error: message instead of Write-Error call-site details. * docs: add provider setup cookbook Maintainer edit: add pasteable provider recipes for common hosted, local, fallback, runtime switching, and Langfuse setups, then link the cookbook from onboarding and troubleshooting entrypoints. * docs: address review feedback * docs: clarify reader paths * docs: explain terminal basics for beginners * docs: clarify wizard navigation * docs: avoid duplicate onboarding steps * docs: add setup status check * docs: explain status output * docs: remove provider recommendation wording * docs: explain status diagnostics * docs: reduce hard-wrapped guidance * docs: migrate config examples to presets * docs: clarify python command fallbacks * docs: improve installer failure recovery * docs: expand install troubleshooting * docs: cover installer download failures * docs: put stable install paths first * docs: add bundled webui quick path * docs: clarify provider-neutral setup * docs: clarify gateway setup for chat surfaces * docs: improve docs navigation paths * docs: add configuration quick jump * docs: clarify provider secret variables * chore: request PR review acknowledgement Empty commit: please read the PR review comments and reply on the PR to confirm that you have received them. This commit intentionally changes no files; it exists only to notify the remote Codex run so it can end its active goal. * docs: add README start here guide * docs: avoid provider recommendation wording * docs: guide next steps after first reply * docs: explain merging JSON snippets * docs: add CLI command chooser * docs: add configuration task map * docs: add deployment readiness guide * docs: simplify WebUI entry paths * docs: add provider recipe chooser * docs: fix provider factual references Update OpenRouter and LongCat model examples, align Bedrock guidance, and make fallback snippets schema-valid. Also correct group policy wording and image-generation provider lists to match the current code. * fix: keep PowerShell installer from closing caller shell * docs: mention self-guided configuration	2026-06-10 00:36:22 +08:00
chengyongru	56ce18167e	docs: clarify email post-action expunge fallback maintainer edit: clarify that postActionExpunge only allows the broad EXPUNGE fallback when UID-scoped expunge is unavailable or fails.	2026-06-09 14:50:59 +08:00
Flávio Veloso Soares	6de8d7f52e	feat(email): add postActionExpunge option to gate broad IMAP expunge	2026-06-09 14:50:59 +08:00
Flávio Veloso Soares	ec5460d23e	feat(email): add configurable post-action handling	2026-06-09 14:50:59 +08:00
comadreja	f3eb2aa08b	feat(transcription): add AssemblyAI as transcription provider Add AssemblyAI as a third transcription provider option alongside OpenAI and Groq. AssemblyAI offers better accuracy for certain audio types (distant voices, noisy environments) and serves as a reliable fallback when other providers struggle. Changes: - Add AssemblyAITranscriptionProvider class in providers/transcription.py - Add 'assemblyai' option in base channel's transcribe_audio() - Per-channel configuration via transcriptionProvider in config Usage: Set transcriptionProvider: 'assemblyai' and provide an AssemblyAI API key via transcriptionApiKey in the channel config.	2026-06-09 05:33:18 +08:00
NanoBot	c20ecc52d7	feat(transcription): add Xiaomi MiMo ASR provider (mimo-v2.5-asr) Add support for Xiaomi MiMo ASR as a third transcription backend alongside Groq and OpenAI Whisper. Xiaomi ASR uses the /v1/chat/completions endpoint with base64-encoded audio input, rather than the standard Whisper multipart upload format. Co-Authored-By:连 <lian@tangping.homes>	2026-06-09 04:29:09 +08:00
Ilia Breitburg	0eb3010e40	feat(transcription): configurable STT model + OpenRouter provider Add a `transcriptionModel` channel setting and an OpenRouter transcription backend so voice messages can be transcribed through OpenRouter's speech-to-text endpoint (e.g. nvidia/parakeet-tdt-0.6b-v3, openai/whisper-1), alongside the existing Groq/OpenAI Whisper providers. - schema: add channels.transcriptionModel (None = provider default) - providers/transcription: extract a shared POST/retry skeleton; add a JSON+base64 OpenRouterTranscriptionProvider; make the STT model a constructor param on all providers instead of hardcoding it - channels: route transcriptionProvider="openrouter" and thread the model through the manager to each channel - docs + tests Only dedicated STT models work on OpenRouter's transcription endpoint; chat LLMs (e.g. google/gemini-3.5-flash) are rejected there. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-09 04:01:37 +08:00
Xubin Ren	9c81280300	feat(transcription): add shared voice input support (#4232 ) * feat(webui): add voice transcription input * feat(webui): render ANSI output in code blocks * refactor(webui): isolate voice recorder logic * refactor(transcription): keep websocket ingress thin * refactor(transcription): resolve channel audio settings on demand * style(webui): neutralize voice waveform color * feat(webui): add voice input tooltip * feat(webui): add voice input keyboard shortcut * fix(webui): distinguish voice shortcut platforms * fix(webui): place voice button after model selector * refactor(webui): share voice hold recording helpers * fix(desktop): allow microphone voice input * fix(webui): stabilize token usage month labels * feat(webui): show voice input on settings overview * fix(webui): label voice capability as recognition * fix(webui): align capability overview status * refactor(webui): isolate transcription socket handling * fix(webui): soften silent voice waveform * refactor(audio): clarify transcription service location * docs(transcription): clarify audio and provider boundaries * fix(exec): reduce session output polling flake	2026-06-09 01:08:49 +08:00
chengyongru	a73924f77e	docs: document MCP SSRF allowlist behavior Maintainer edit: explain that HTTP/SSE MCP now uses the shared SSRF guard before connecting and before following redirects, so local or private HTTP MCP endpoints require an explicit tools.ssrfWhitelist entry.	2026-06-08 16:03:57 +08:00
Xubin Ren	73353785a0	docs(sdk): document Nanobot teardown	2026-06-06 15:35:28 +08:00
Xubin Ren	935a37182d	docs(command): document /skill command	2026-06-05 18:48:51 +08:00
chengyongru	d435cb0b21	fix: harden custom image provider compatibility Maintainer edit: preserve provider-specific size hints for custom image generation endpoints while keeping the default 1K mapping compatible. Clarify the custom provider contract in docs and cover response_format/size overrides in tests.	2026-06-05 15:56:03 +08:00
chengyongru	ae17a79bdf	fix: harden custom image generation config Maintainer edit: require providers.custom.apiBase before making custom image requests and allow unauthenticated local endpoints by omitting Authorization when no apiKey is configured.	2026-06-05 15:56:03 +08:00
axelray-dev	748b28da01	feat(image): support custom image generation provider Addresses #4132. Add CustomImageGenerationClient for any OpenAI-compatible image generation API (POST {apiBase}/images/generations). Uses the existing providers.custom config slot. No schema changes required. Tests: 54 passed, ruff clean. Signed-off-by: axelray-dev <110029405+axelray-dev@users.noreply.github.com>	2026-06-05 15:56:03 +08:00
Kunal Karmakar	ba3fa38e97	Add support for Azure AAD based Auth	2026-06-05 01:17:34 +08:00
chengyongru	d1a94dae8a	refactor(dream): replace two-phase Dream class with simple cron + process_direct (#3990 ) * refactor(dream): replace two-phase Dream class with simple cron + process_direct - Remove the heavyweight Dream class (AgentRunner-based two-phase system) from nanobot/agent/memory.py - Delete dream_phase1.md and dream_phase2.md templates - New dream.md template serves as the consolidation prompt - Cron callback uses agent.process_direct(prompt, session_key=\"dream\") instead of agent.dream.run() - Always performs git auto_commit after execution - /dream command updated to use process_direct + git commit - DreamConfig kept for backward compatibility; deprecated fields (model_override, max_batch_size, max_iterations, annotate_line_ages) are ignored but accepted in config - interval_h remains configurable via agents.defaults.dream.interval_h - Update tests and webui settings to match new architecture * feat(loop): add ephemeral mode to process_direct, skip history writes for Dream When ephemeral=True, _state_save skips enforce_file_cap (which calls raw_archive -> append_history) and consolidator.maybe_consolidate_by_tokens. This prevents Dream sessions from creating a positive feedback loop where they process their own output. The session IS still saved to disk. * fix(loop): skip extra hooks for ephemeral sessions (Dream) * feat(dream): per-run timestamped sessions with rotation for WebUI * test(config): restore DreamConfig schedule and alias tests * fix(dream): include LLM response summary in git auto-commit message The old two-phase Dream class included the Phase 1 analysis in the git commit message body. The new single-phase version lost this. Restore it by extracting resp.content from the process_direct return value and appending it to the commit message in both the cron handler and the /dream command. * fix(test): accept ephemeral kwarg in test_openai_api fake_process * refactor(dream): merge dream_session.py into MemoryStore The standalone dream_session.py module only contained three small helpers that all revolve around MemoryStore concerns (session keys, commit messages, file pruning). Fold them into MemoryStore as @staticmethod to reduce indirection and avoid a 35-line module with no independent reason to exist. * fix(test): address code review — patch correct instance, use actual function - Fix test_ephemeral_skips_raw_archive to patch loop.context.memory instead of the fixture's separate MemoryStore instance - Fix TestDreamCommitMessage to call MemoryStore.build_dream_commit_message instead of reimplementing the logic inline - Move Dream helpers in memory.py above the Consolidator section comment to avoid misleading visual boundary * fix(dream): gate cursor advancement and restrict tools maintainer edit: Dream now processes backlog from the oldest unprocessed entries, only advances the cursor after a completed ephemeral run, and uses a restricted file-only tool registry for background consolidation. * fix(dream): skip idle compact for dream sessions Dream runs use internal dream:* sessions that are pruned by Dream retention. Exclude them from AutoCompact scheduling, archive execution, and summary injection so idle-session compaction cannot truncate Dream transcripts. * fix(dream): keep batched history isolated * feat(dream): tag archived memory for single-phase Dream --------- Co-authored-by: Xubin Ren <52506698+Re-bin@users.noreply.github.com>	2026-06-02 22:46:47 +08:00
LZDQ	b1a3053ceb	Channel napcat by Claude	2026-06-02 14:10:10 +08:00
Xubin Ren	edf34d857a	search: add Volcengine web search provider	2026-06-02 13:55:12 +08:00
chengyongru	851150fcd8	docs: document DingTalk group user isolation	2026-06-01 23:01:19 +08:00
Xubin Ren	f309982bb0	chore(release): update version to 0.2.1	2026-06-01 16:51:24 +08:00
chengyongru	15c2bd25b3	refactor(heartbeat): remove Completed section and tighten section gating - Remove ## Completed section from HEARTBEAT.md template; completed tasks should be deleted, not accumulated - Change in_active_section from tri-state (None/True/False) to bool (True/False) so stray text before any ## heading no longer triggers heartbeat - Add test cases for stray pre-heading text and ## Notes section - Update docs/chat-commands.md to reference ## Active Tasks	2026-05-31 15:15:37 +08:00
mytechdream	68712fc489	fix(matrix): handle SAS device verification	2026-05-31 01:00:14 +08:00
hanyuanling	ec4f9e9857	Add document extraction channel toggle	2026-05-29 15:31:03 +08:00
chengyongru	fe2af64e04	refactor(heartbeat): migrate heartbeat service to cron-based auto-registration Remove standalone nanobot/heartbeat/ service and replace it with an auto-registered system cron job on gateway startup. Key behaviors preserved: - HeartbeatConfig (enabled, interval_s, keep_recent_messages) remains in GatewayConfig for backward compatibility. - On startup, if enabled, a system cron job "heartbeat" is registered with schedule derived from interval_s. - HEARTBEAT.md is checked on each tick; empty/template-identical files skip to avoid wasting LLM calls. - Post-run evaluate_response and session history truncation (keep_recent_messages) are retained. - Delivery target selection, deliverable filtering, and preamble guidance are preserved. Files removed: - nanobot/heartbeat/__init__.py - nanobot/heartbeat/service.py - tests/heartbeat/* - tests/agent/test_heartbeat_service.py Templates and docs updated to reflect cron-based usage.	2026-05-28 20:20:28 +08:00
outlook84	a4a2c55120	feat(telegram): add webhook support and ordered message queue Introduce webhook mode for the Telegram channel and implement a session-based message reordering mechanism. Key changes: - Update `python-telegram-bot` dependency to include the `webhooks` extra. - Add `TelegramConfig` fields for webhook configuration, with validation rules for public HTTPS URLs and Telegram's secret token. - Implement `_enqueue_ordered_update` and `_drain_ordered_updates` in `TelegramChannel` to stage incoming messages and commands behind a short per-session reorder window, ensuring sequential delivery based on message and update IDs. - Configure `start_webhook` in `TelegramChannel.start()` when webhook mode is enabled. - Add unit tests for webhook config validations, webhook startup, and message reordering. - Document webhook configuration and reverse proxy details in `docs/chat-apps.md`.	2026-05-26 16:14:51 +08:00
moran	179acfe104	feat(providers): add Step Plan support Document how to use StepFun's Step Plan subscription endpoint with the existing `stepfun` provider by overriding `apiBase`, following the same pattern as the `zhipu` provider's coding plan documentation. - Base URL: `https://api.stepfun.com/step_plan/v1` (dedicated endpoint) - API Key: same `STEPFUN_API_KEY` as the regular `stepfun` provider - Models: `step-3.5-flash`, `step-3.5-flash-2603`, `step-router-v1` Changes: - `docs/configuration.md` — provider tip, and config example showing `apiBase` override on the existing `stepfun` provider Test: 488/488 provider tests passed.	2026-05-25 18:57:36 +08:00
outlook84	c433d60681	feat: Enhance OpenAI provider configuration with extraBody support and apiType validation	2026-05-25 01:23:36 +08:00
outlook84	d472595417	feat: Add OpenAI API type configuration and update provider settings	2026-05-25 01:23:36 +08:00
Xubin Ren	ec99232208	docs: fix Xiaomi MiMo token plan env key	2026-05-23 22:56:24 +08:00
honjiaxuan	43a1784c5f	docs: use xiaomi_mimo provider for MiMo token plan Replace standalone 'Token Plan' section with general Xiaomi MiMo section using the built-in xiaomi_mimo provider. Token plan becomes a note within the section, since it's just an apiBase override. Key changes: - Use xiaomi_mimo provider (auto-matches via 'mimo' keyword in model name) - Drop redundant provider field (auto-detected) - Add token plan tip to provider tips block - Restructure as general Xiaomi MiMo section with token plan as note	2026-05-23 22:56:24 +08:00
Xubin Ren	3d3ef586e7	docs(config): clarify exec timeout and transcription apiBase	2026-05-23 17:32:59 +08:00
Xubin Ren	5937236f9d	test(image-generation): tighten zhipu provider coverage	2026-05-23 17:06:36 +08:00
Jiajun Xie	3e6f9907fe	feat: Add Zhipu (智谱) image generation provider	2026-05-23 17:06:36 +08:00
Xubin Ren	f5534bcaa0	Merge origin/main into fix-ollama-image-generation	2026-05-22 21:15:42 +08:00
Xubin Ren	8281cd1946	test(providers): cover Novita gateway fallback	2026-05-21 16:16:32 +08:00
Alex-wuhu	e5476573f4	test(providers): align Novita provider coverage	2026-05-21 16:16:32 +08:00
Alex-wuhu	0d1d23b5fb	feat: add Novita AI provider	2026-05-21 16:16:32 +08:00
Haisam Abbas	84603f4cf2	Add Ollama image generation support	2026-05-21 12:06:08 +05:00
chengyongru	886e7e43d5	fix(signal): bypass base is_allowed for policy-approved messages Override _handle_message to publish directly to the bus for messages that have already passed _check_inbound_policy. The denied DM pairing path calls super()._handle_message() to issue pairing codes via the base class. This avoids cross-policy leakage where e.g. group open policy would cause is_allowed to incorrectly allow denied DM senders. Also includes: - SSE: strip one optional leading space after 'data:' per spec - Convert 20+ f-string log calls to loguru lazy formatting - Add end-to-end tests for DM/group routing through the full chain - Add cross-policy test (dm allowlist + group open) for pairing - Add Signal channel documentation to docs/chat-apps.md	2026-05-21 01:00:36 +08:00
Xubin Ren	eae51333ad	fix(providers): point Skywork at APIFree agent endpoint	2026-05-20 12:33:03 +08:00
moran	6194a9b919	docs(configuration): fix APIFree formatting — merge wrapped description into single line	2026-05-20 12:33:03 +08:00
moran	61ae869610	feat(providers): add APIFree support Add APIFree as a built-in OpenAI-compatible provider. APIFree offers agent-optimised models such as skywork-ai/skyclaw-v1 through an OpenAI-compatible API at https://api.apifree.ai/agent/v1. Changes: - Register apifree provider in the provider registry - Add config schema field - Add documentation with configuration example - Add provider tests, websocket channel tests, and webui tests - Add provider icon in settings UI	2026-05-20 12:33:03 +08:00
Xubin Ren	e00220bdb6	feat(providers): add Skywork provider support	2026-05-20 02:20:44 +08:00
moran	4dccee56a7	docs: translate StepPlan section from Chinese to English	2026-05-20 00:08:38 +08:00
moran	2d302a006e	feat(image-generation): add StepFun provider support and StepPlan docs - Add StepFunImageGenerationClient with step-image-edit-2 / step-1x-medium support - Map aspect ratios to StepFun size strings (WxH order) - Add style_reference for step-1x-medium reference-image generation - Register in image gen provider registry (auto-discovered by nanobot.py) - Add 7 unit tests: payload, default size, explicit size, style_reference (1x/non-1x), missing key, no-images - Add StepFun section to docs/image-generation.md with provider config - Add StepPlan (订阅制) subsection with apiBase override example	2026-05-20 00:08:38 +08:00

1 2 3

125 Commits