nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-20 08:32:25 +00:00

Author	SHA1	Message	Date
Celina Hanouti	407314a672	feat(providers): add Hugging Face inference provider	2026-04-28 14:24:32 +08:00
Subal	80ee4483f8	feat: make consolidation ratio configurable	2026-04-26 20:24:42 +08:00
chengyongru	f6a417e77d	fix(transcription): harden language parameter validation and tests - Add ISO-639 pattern validation (2-3 lowercase letters) to schema - Normalize empty language to None in provider constructors - Extract shared httpx mock stubs, parameterize provider tests - Add test for language=None omitting field from multipart body - Add test for Pydantic pattern validation rejecting invalid codes	2026-04-22 12:41:32 +08:00
k	123d69bfb7	fix: allow specifying transcription language	2026-04-22 12:41:32 +08:00
flobo3	1826ab44fa	feat(transcription): add language parameter for Groq Whisper STT	2026-04-22 12:41:32 +08:00
Xubin Ren	384bad17b4	Merge origin/main into fix/config-default-api-base Made-with: Cursor	2026-04-18 20:08:21 +00:00
Xubin Ren	cc5a666d5d	review(dream): harden line-age annotation per review feedback Follow-up to #3212, fully backward compatible: - Extract the 14-day staleness threshold as `_STALE_THRESHOLD_DAYS` module constant and pass it into the Phase 1 prompt template as `{{ stale_threshold_days }}`. The number lived in three places before (code threshold, prompt instruction, docstring); now there is one. - Add `DreamConfig.annotate_line_ages` (default True = current behavior) and propagate it through `Dream.__init__` and the gateway wiring in cli/commands.py. Gives users a knob to disable the feature without a code patch if an LLM reacts poorly to the `← Nd` suffix. - Harden `_annotate_with_ages` against dirty working trees: when HEAD blob line count disagrees with the working-tree content length, skip annotation entirely instead of assigning ages to the wrong lines. The previous `i >= len(ages)` guard only handled one direction of the mismatch. - Inline-comment the `max_iterations` 10→15 bump with a pointer to exp002 so future blame has context. - Add 4 regression tests: end-to-end `← 30d` reaches prompt, 14/15 threshold boundary, `annotate_line_ages=False` bypasses git entirely (verified via `assert_not_called`), length-mismatch defense, and template-var rendering. Made-with: Cursor	2026-04-17 13:45:38 +08:00
chengyongru	35f3084c03	feat(dream): per-line age annotations + dedup-aware prompt + max_iter=15 Three improvements to Dream's memory consolidation: 1. Per-line git-blame age annotations: MEMORY.md lines get `← Nd` suffixes (N>14) from dulwich annotate. SOUL.md/USER.md excluded as permanent. LLM uses content judgment, not just age, to decide what to prune. 2. Dedup-aware Phase 1 prompt: reframed as dual-task (extract facts + deduplicate existing files) with explicit redundancy patterns to scan for. Validated through 20 experiments (exp-002 prompt + max_iter=15 was best, averaging -1643 chars/5.4% compression per run). 3. Phase 1 analysis as commit body: dream git commits now include the full Phase 1 analysis for transparency via /dream-log. 4. max_iterations raised from 10 to 15: 30% improvement over 10 with no risk; 20 showed diminishing returns (exp-020: -701 vs exp-017: -1643).	2026-04-17 13:45:38 +08:00
Xubin Ren	90b7d940e8	refactor(config): nest MyTool settings under tools.my (with legacy-key migration)	2026-04-16 15:58:20 +00:00
chengyongru	b51da93cbb	feat(agent): add SelfTool for runtime self-inspection and configuration Add a built-in tool that lets the agent inspect and modify its own runtime state (model, iterations, context window, etc.). Key features: - inspect: view current config, usage stats, and subagent status - modify: adjust parameters at runtime (protected by type/range validation) - Subagent observability: inspect running subagent tasks (phase, iteration, tool events, errors) — subagents are no longer a black box - Watchdog corrects out-of-bounds values on each iteration - Enabled by default in read-only mode (self_modify: false) - All changes are in-memory only; restart restores defaults - Comprehensive test suite (90 tests) Includes a self-awareness skill (always-on) with progressive disclosure: SKILL.md for core rules, references/examples.md for detailed scenarios.	2026-04-16 23:44:26 +08:00
Soham Bhattacharya	41a1b0058d	Add support for nullable API keys and LM Studio	2026-04-16 02:49:54 +08:00
Aisht	d0a282e766	feat(provider): add MiniMax Anthropic endpoint for thinking mode - Add minimax_anthropic provider using Anthropic-compatible endpoint - Endpoint: https://api.minimax.io/anthropic - Supports reasoning_effort parameter for thinking mode (low/medium/high/adaptive) - Uses same MINIMAX_API_KEY as existing minimax provider	2026-04-16 01:00:45 +08:00
Xubin Ren	e4b3f9bd28	security(gateway): keep health endpoint local by default Bind the gateway health listener to localhost by default and reduce the probe response to a minimal status payload so accidental public exposure leaks less information. Made-with: Cursor	2026-04-14 07:19:38 +00:00
moranfong	0750d1f182	fix(config): return provider default api base in config resolution	2026-04-13 23:42:58 +08:00
Xubin Ren	09c238ca0f	Merge origin/main into pr-2959 Resolve the config plumbing conflicts and keep disabled skill filtering consistent for subagent prompts after syncing with main. Made-with: Cursor	2026-04-12 02:02:39 +00:00
Mike Terhar	d3aa209cf6	add kagi web search tool	2026-04-11 16:53:05 +08:00
Xubin Ren	84e840659a	refactor(config): rename auto compact config key Prefer the more user-friendly idleCompactAfterMinutes name for auto compact while keeping sessionTtlMinutes as a backward-compatible alias. Update tests and README to document the retained recent-context behavior and the new preferred key.	2026-04-11 15:56:41 +08:00
chengyongru	fb6dd111e1	feat(agent): auto compact — proactive session compression to reduce token cost and latency (#2982 ) When a user is idle for longer than a configured TTL, nanobot proactively compresses the session context into a summary. This reduces token cost and first-token latency when the user returns — instead of re-processing a long stale context with an expired KV cache, the model receives a compact summary and fresh input.	2026-04-11 15:56:41 +08:00
chenyahui	0e6331b66d	feat(exec): support allowed_env_keys to pass specified env vars to subprocess Add allowed_env_keys config field to selectively forward host environment variables (e.g. GOPATH, JAVA_HOME) into the sandboxed subprocess environment, while keeping the default allow-list unchanged.	2026-04-09 23:35:44 +08:00
chenyahui	e9c4fe6824	feat(skills): add disabled_skills config to exclude skills from loading Introduce a disabled_skills option in the config schema that allows users to specify a list of skill names to be excluded. The setting is threaded from config through Nanobot -> AgentLoop -> ContextBuilder -> SkillsLoader. Disabled skills are filtered out from list_skills, get_always_skills, and build_skills_summary. Four new test cases cover the filtering behavior.	2026-04-09 14:11:47 +08:00
whs	743e73da3f	feat(session): add unified_session config to share one session across all channels	2026-04-09 11:09:25 +08:00
Xubin Ren	1e8a6663ca	test(anthropic): add regression tests for thinking modes incl. adaptive Also update schema comment to mention 'adaptive' as a valid value. Made-with: Cursor	2026-04-07 22:53:43 +08:00
Xubin Ren	35dde8a30e	refactor: unify voice transcription config across all channels - Move transcriptionProvider to global channels config (not per-channel) - ChannelManager auto-resolves API key from matching provider config - BaseChannel gets transcription_provider attribute, no more getattr hack - Remove redundant transcription fields from WhatsAppConfig - Update README: document transcriptionProvider, update provider table Made-with: Cursor	2026-04-06 06:07:30 +00:00
Xubin Ren	a8707ca8f6	Merge origin/main into feat/best_skill_and_hook (resolve 4 conflicts) Made-with: Cursor	2026-04-05 18:53:17 +00:00
hoaresky	6bd2950b99	Fix: add asyncio timeout guard for DuckDuckGo search DDGS's internal `timeout=10` relies on `requests` read-timeout semantics, which only measure the gap between bytes — not total wall-clock time. When the underlying HTTP connection enters CLOSE-WAIT or the server dribbles data slowly, this timeout never fires, causing `ddgs.text` to hang indefinitely via `asyncio.to_thread`. Since `asyncio.to_thread` cannot cancel the underlying OS thread, the agent's session lock is never released, blocking all subsequent messages on the same session (observed: 8+ hours of unresponsiveness). Fix: - Add `timeout` field to `WebSearchConfig` (default: 30s, configurable via config.json or NANOBOT_TOOLS__WEB__SEARCH__TIMEOUT env var) - Wrap `asyncio.to_thread` with `asyncio.wait_for` to enforce a hard wall-clock deadline Closes #2804 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-06 02:21:51 +08:00
Jiajun	7e1ae3eab4	feat(provider): add Qianfan provider support (#2699 )	2026-04-05 16:52:37 +08:00
04cb	5f08d61d8f	fix(security): add ssrfWhitelist config to unblock Tailscale/CGNAT (#2669 )	2026-04-04 19:43:18 +08:00
Xubin Ren	0a3a60a7a4	refactor(memory): simplify Dream config naming and rename gitstore module	2026-04-04 10:01:45 +00:00
Xubin Ren	30ea048f19	Merge remote-tracking branch 'origin/main' into pr-2717-review	2026-04-04 04:42:52 +00:00
Xubin Ren	652377bee9	Merge origin/main into feat/web-disable-flag Made-with: Cursor	2026-04-03 18:41:43 +00:00
Lingao Meng	cf6c979339	feat(provider): add Xiaomi MiMo LLM support Register xiaomi_mimo as an OpenAI-compatible provider with its API base URL, add xiaomi_mimo to the provider config schema, and document it in README. Signed-off-by: Lingao Meng <menglingao@xiaomi.com>	2026-04-03 14:42:57 +08:00
chengyongru	b9616674f0	feat(agent): two-stage memory system with Dream consolidation Replace single-stage MemoryConsolidator with a two-stage architecture: - Consolidator: lightweight token-budget triggered summarization, appends to HISTORY.md with cursor-based tracking - Dream: cron-scheduled two-phase processor that analyzes HISTORY.md and updates SOUL.md, USER.md, MEMORY.md via AgentRunner with edit_file tools for surgical, fault-tolerant updates New files: MemoryStore (pure file I/O), Dream class, DreamConfig, /dream and /dream-log commands. 89 tests covering all components.	2026-04-02 22:42:25 +08:00
Xubin Ren	fbedf7ad77	feat: harden agent runtime for long-running tasks	2026-04-01 19:12:49 +00:00
Shiniese	7f1dca3186	feat: unify web tool config under WebToolsConfig + add web tool toggle controls - Rename WebSearchConfig references to the new WebToolsConfig root struct that wraps both search config and global proxy settings - Add 'enable' flag to WebToolsConfig to allow fully disabling all web-related tools (WebSearch, WebFetch) at runtime - Update AgentLoop and SubagentManager to receive the full web config object instead of separate web_search_config/web_proxy parameters - Update CLI command initialization to pass the consolidated web config struct instead of split fields - Change default web search provider from brave to duckduckgo for better out-of-the-box usability (no API key required)	2026-03-30 16:22:11 +08:00
Xubin Ren	5635907e33	feat(api): load serve settings from config Read serve host, port, and timeout from config by default, keep CLI flags higher priority, and bind the API to localhost by default for safer local usage.	2026-03-29 15:32:33 +00:00
longyongshen	813de554c9	feat(provider): add Step Fun (阶跃星辰) provider support Made-with: Cursor	2026-03-25 22:43:47 +08:00
Xubin Ren	f0f0bf02d7	refactor(channel): centralize retry around explicit send failures Make channel delivery failures raise consistently so retry policy lives in ChannelManager rather than being split across individual channels. Tighten Telegram stream finalization, clarify sendMaxRetries semantics, and align the docs with the behavior the system actually guarantees.	2026-03-25 22:37:11 +08:00
chengyongru	5e9fa28ff2	feat(channel): add message send retry mechanism with exponential backoff - Add send_max_retries config option (default: 3, range: 0-10) - Implement _send_with_retry in ChannelManager with 1s/2s/4s backoff - Propagate CancelledError for graceful shutdown - Fix telegram send_delta to raise exceptions for Manager retry - Add comprehensive tests for retry logic - Document channel settings in README	2026-03-25 22:37:11 +08:00
Xubin Ren	13d6c0ae52	feat(config): add configurable timezone for runtime context Add agent-level timezone configuration with a UTC default, propagate it into runtime context and heartbeat prompts, and document valid IANA timezone usage in the README.	2026-03-25 22:07:14 +08:00
Xubin Ren	3dfdab704e	refactor: replace litellm with native openai + anthropic SDKs - Remove litellm dependency entirely (supply chain risk mitigation) - Add AnthropicProvider (native SDK) and OpenAICompatProvider (unified) - Merge CustomProvider into OpenAICompatProvider, delete custom_provider.py - Add ProviderSpec.backend field for declarative provider routing - Remove _resolve_model, find_gateway, find_by_model (dead heuristics) - Pass resolved spec directly into provider — zero internal lookups - Stub out litellm-dependent model database (cli/models.py) - Add anthropic>=0.45.0 to dependencies, remove litellm - 593 tests passed, net -1034 lines	2026-03-25 01:58:48 +08:00
Xubin Ren	14763a6ad1	fix(provider): accept canonical and alias provider names consistently	2026-03-24 03:03:59 +00:00
Xubin Ren	2056061765	refine heartbeat session retention boundaries	2026-03-24 00:33:43 +08:00
Desmond Sow	f64ae3b900	feat(provider): add OpenVINO Model Server provider (#2193 ) add OpenVINO Model Server provider	2026-03-23 11:07:46 +08:00
Matt von Rohr	7878340031	feat(providers): add Mistral AI provider Register Mistral as a first-class provider with LiteLLM routing, MISTRAL_API_KEY env var, and https://api.mistral.ai/v1 default base. Includes schema field, registry entry, and tests.	2026-03-23 11:07:46 +08:00
Xubin Ren	bd621df57f	feat: add streaming channel support with automatic fallback Provider layer: add chat_stream / chat_stream_with_retry to all providers (base fallback, litellm, custom, azure, codex). Refactor shared kwargs building in each provider. Channel layer: BaseChannel gains send_delta (no-op) and supports_streaming (checks config + method override). ChannelManager routes _stream_delta / _stream_end to send_delta, skips _streamed final messages. AgentLoop._dispatch builds bus-backed on_stream/on_stream_end callbacks when _wants_stream metadata is set. Non-streaming path unchanged. CLI: clean up spinner ANSI workarounds, simplify commands.py flow. Made-with: Cursor	2026-03-23 10:20:41 +08:00
Xubin Ren	32f4e60145	refactor(providers): hide oauth-only providers from config setup Exclude openai_codex alongside github_copilot from generated config, filter OAuth-only providers out of the onboarding wizard, and clarify in README that OAuth login stores session state outside config. Also unify the GitHub Copilot login command spelling and add regression tests. Made-with: Cursor	2026-03-21 03:20:59 +08:00
Harvey Mackie	e029d52e70	chore: remove redundant github_copilot field from config.json	2026-03-21 03:20:59 +08:00
Xubin Ren	1c39a4d311	refactor(tools): keep exec enable without configurable deny patterns Made-with: Cursor	2026-03-20 17:46:08 +00:00
Xubin Ren	3825ed8595	merge origin/main into pr-1824 - wire tools.exec.enable and deny_patterns into the current AgentLoop - preserve the current WebSearchTool config-based registration path - treat deny_patterns=[] as an explicit override instead of falling back to the default blacklist - add regression coverage for disabled exec registration and custom deny patterns Made-with: Cursor	2026-03-20 17:21:42 +00:00
Xubin Ren	f44c4f9e3c	refactor: remove deprecated memory_window, harden wizard display	2026-03-20 18:46:13 +08:00

1 2 3 4

190 Commits