nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-20 00:22:31 +00:00

Author	SHA1	Message	Date
Xubin Ren	e936ed48bd	feat: add image generation tool and WebUI mode Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-08 20:06:23 +08:00
Tim O'Brien	daa4a25c9b	feat(config): add toolHintMaxLength to control tool hint truncation Add to config (default: 40, range: 20-500). Controls how many characters of tool hints are shown in progress updates (e.g. '$ cd …/project && npm test'). Set to 120+ to see full commands instead of truncated hints: ```json { "agents": { "defaults": { "toolHintMaxLength": 120 } } } ``` - Thread max_length through format_tool_hints → _fmt_known/_fmt_mcp/_fmt_fallback - Make path abbreviation in _abbreviate_command proportional to max_length - Add TestToolHintMaxLength test class with 5 tests - All 41 existing tests pass	2026-05-06 21:18:39 +08:00
chengyongru	05e0106592	refactor(logging): preserve tracebacks and add channel context - Preserve tracebacks: logger.error in except blocks → logger.exception - Channel context: BaseChannel injects self.logger = logger.bind(channel=name) - Third-party bridge: redirect_lib_logging() replaces ad-hoc stdlib-to-loguru bridges - Log levels: network timeouts downgraded from ERROR → WARNING - Fix --verbose flag to actually work with loguru (set handler to DEBUG)	2026-05-06 21:17:45 +08:00
chengyongru	c30e4d86f3	refactor(agent): simplify subagent concurrency with rejection over semaphore Replace the asyncio.Semaphore queueing approach with a simple count check in SpawnTool.execute(). When the concurrency limit is reached, the tool returns an error string so the agent can perceive the reason and adjust its behavior instead of silently queueing. - Remove max_concurrent_subagents parameter threading through AgentLoop, commands.py, and nanobot.py - SubagentManager reads the limit directly from AgentDefaults - SpawnTool checks get_running_count() before calling spawn() - Simplify tests to verify rejection behavior	2026-05-05 22:22:04 +08:00
chengyongru	5853d5dfda	fix: allow_patterns take priority over deny_patterns in ExecTool (#3594 ) * fix: allow_patterns take priority over deny_patterns in ExecTool Previously deny_patterns were checked first with no bypass, meaning allow_patterns could never exempt commands from the built-in deny list. This made it impossible to whitelist destructive commands for specific directories (e.g. build/cleanup tasks). Changes: - shell.py: check allow_patterns first; if matched, skip deny check - shell.py: deny_patterns now appends to built-in list (not replaces) - schema.py: add allow_patterns/deny_patterns to ExecToolConfig - loop.py/subagent.py: pass allow_patterns/deny_patterns to ExecTool - Add test_exec_allow_patterns.py covering priority semantics * fix: separate deny pattern errors from workspace violation detection The deny pattern error message "Command blocked by safety guard" was included in _WORKSPACE_BLOCK_MARKERS, causing deny_pattern blocks to be misclassified as fatal workspace violations. This meant LLMs had no chance to retry with a different command — the turn was aborted immediately. Changes: - shell.py: deny/allowlist error messages now use distinct phrasing ("blocked by deny pattern filter" / "blocked by allowlist filter") - runner.py: remove "blocked by safety guard" from _WORKSPACE_BLOCK_MARKERS so deny_pattern errors are treated as normal tool errors (LLM can retry) instead of fatal violations - workspace path errors still use "blocked by safety guard" and remain fatal as intended * fix: update test assertions to match new deny pattern error message * fix: indentation error in test file * fix: restore SSRF fatal classification and tidy exec pattern plumbing Address review feedback on the deny/allow_patterns rework: - runner.py: re-add "internal/private url detected" to _WORKSPACE_BLOCK_MARKERS. The earlier marker removal also stripped fatal classification from SSRF / internal-URL rejections (whose message still says "blocked by safety guard"), turning a hard security boundary into something the LLM could retry. - loop.py / subagent.py: drop `or None` between ExecToolConfig and ExecTool. The schema default is an empty list and ExecTool already normalizes None back to [], so the indirection was a no-op. - shell.py: extract `explicitly_allowed` flag in _guard_command so allow_patterns are scanned once instead of twice and the control flow no longer relies on a no-op `pass + else` branch. - tests/agent/test_runner.py: add a regression test asserting that the SSRF block message is treated as fatal, while deny/allowlist filter messages are deliberately non-fatal. * fix: remove unused exec allow-pattern test import Keep the new ExecTool allow-pattern coverage clean under ruff. Co-authored-by: Cursor <cursoragent@cursor.com> --------- Co-authored-by: Xubin Ren <xubinrencs@gmail.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-03 00:27:17 +08:00
moranfong	051037ff08	feat(provider): add LongCat via OpenAI-compatible backend	2026-05-02 01:52:04 +08:00
Xubin Ren	306958d6e6	add native Bedrock Converse provider Made-with: Cursor	2026-05-01 18:52:03 +08:00
chengyongru	28f9bbff31	feat(web_search): add olostep provider Adds Olostep (https://www.olostep.com) as an optional web_search backend using the official olostep Python SDK (client.answers.create()). Changes: - pyproject.toml: adds olostep>=0.1.0 optional dependency - schema.py: adds olostep to provider comment in WebSearchConfig - web.py: adds _search_olostep() with lazy import and provider branching - docs/configuration.md: documents Olostep setup under web search config - tests: unit tests for the new provider Backward compatible: existing users see no behavior change unless they opt into provider: "olostep". No hard dependency at runtime path. Co-authored-by: umerkay <umerkk164@gmail.com>	2026-04-28 19:09:38 +08:00
hussein1362	415e617398	feat(providers): add extra_body config for OpenAI-compatible endpoints Add an `extra_body` field to `ProviderConfig` that merges arbitrary key-value pairs into every OpenAI-compatible request body. This is the escape hatch for provider-specific features that nanobot does not have first-class fields for. Real-world use cases this unblocks via config alone (no code changes): - vLLM/TGI `chat_template_kwargs` (e.g. `enable_thinking: false`) - vLLM guided decoding (`guided_json`, `guided_regex`) - Local model sampling params (`repetition_penalty`, `top_k`, `min_p`) - Any future provider-specific param without a new PR each time The config extra_body is applied last via recursive deep-merge, so it can extend or override provider-specific defaults (e.g. thinking params) without clobbering sibling keys set by internal logic. Changes: - Add `extra_body: dict[str, Any] \| None` to `ProviderConfig` - Pass it through `factory.py` to `OpenAICompatProvider.__init__` - Deep-merge into `_build_kwargs` after all internal extra_body entries - Add `_deep_merge` helper (recursive dict merge, does not mutate inputs) - 21 tests: deep-merge semantics, provider init, _build_kwargs integration, thinking coexistence, real-world patterns (guided_json, repetition_penalty), and schema validation	2026-04-28 15:56:13 +08:00
Xubin Ren	18432c313f	Merge origin/main into web-tools Made-with: Cursor	2026-04-28 07:17:05 +00:00
Celina Hanouti	2b455b1e14	feat(providers): add Hugging Face inference provider	2026-04-28 14:55:28 +08:00
Xubin Ren	ad4802600e	refactor(config): make max messages default explicit Use 120 as the config-level default and normalize zero back to that limit so session replay always receives an explicit message cap. Made-with: Cursor	2026-04-28 14:54:32 +08:00
hussein1362	d45ffcf519	feat(config): wire max_messages into session history replay The max_messages config field in AgentDefaults was accepted by the schema but never threaded through to the actual get_history() calls in the agent loop. Both call sites in _process_message hardcoded the default, so sessions with slow or local models accumulated unbounded history that inflated prompt tokens and caused LLM timeouts. Changes: - Add max_messages field to AgentDefaults (default 0 = use built-in constant, any positive value caps history replay) - Store the value on AgentLoop and pass it to get_history() when non-zero - Wire the config through all three AgentLoop construction sites in commands.py (gateway, API server, CLI chat) - 14 focused tests covering schema validation, init storage, history slicing, boundary alignment, integration wiring, and the zero/default path	2026-04-28 14:54:32 +08:00
Subal	80ee4483f8	feat: make consolidation ratio configurable	2026-04-26 20:24:42 +08:00
Xubin Ren	c1e7aa5504	refactor(config): resolve env vars via in-place Pydantic walk Replace the dump→resolve→model_validate roundtrip with a recursive walk that substitutes ${VAR} in string values directly on BaseModel / __pydantic_extra__ / dict / list nodes. Identity is preserved on any subtree with no references, so the original Config instance is returned unchanged when nothing needs resolving. Side effects: - exclude=True fields (e.g. DreamConfig.cron) now survive even when other fields in the same config contain ${VAR} references, closing the edge case left open by the previous fast-path-only fix. - _has_env_refs is dropped (the walker short-circuits naturally). - Added a regression test pairing cron with a resolved providers.groq api_key to lock the coexistence case. Made-with: Cursor	2026-04-22 22:31:40 +08:00
Saimon Ventura	c9a21d96d8	fix(config): preserve excluded fields in resolve_config_env_vars `resolve_config_env_vars` unconditionally dumped the config via `model_dump(mode="json")` and revalidated it, which silently dropped any field declared with `exclude=True` (e.g. `DreamConfig.cron` — introduced by the Dream rename refactor in #2717). Result: `agents.defaults.dream.cron` was never honored at runtime — the gateway always fell back to the default `every 2h` schedule even when `cron` was set in config.json. Fix: skip the roundtrip entirely when the config has no `${VAR}` references. Env-var interpolation still works unchanged when refs exist; the legacy `cron` override now survives the common case of fully-resolved config. Regression test covers the bug path.	2026-04-22 22:31:40 +08:00
Mizarka	3d40e159ae	feat(web-tools): add option to disable fetching via Jina Reader A new configuration block has been added for the web fetch tool, which allows forcing the tool to use the local readability-lxml mode. Combined with the previous option to modify the user agent, allows bypassing most Cloudflare captchas and JS proof-of-work. Assisted-by: Jo'Zahir:Qwen3.6-35B-A3B	2026-04-22 09:28:30 +00:00
Mizarka	ec2f0ccfdb	feat(web-tools): add configurable User-Agent Assisted-by: Jo'Zahir:Qwen3.6-35B-A3B	2026-04-22 09:11:57 +00:00
chengyongru	f6a417e77d	fix(transcription): harden language parameter validation and tests - Add ISO-639 pattern validation (2-3 lowercase letters) to schema - Normalize empty language to None in provider constructors - Extract shared httpx mock stubs, parameterize provider tests - Add test for language=None omitting field from multipart body - Add test for Pydantic pattern validation rejecting invalid codes	2026-04-22 12:41:32 +08:00
k	123d69bfb7	fix: allow specifying transcription language	2026-04-22 12:41:32 +08:00
flobo3	1826ab44fa	feat(transcription): add language parameter for Groq Whisper STT	2026-04-22 12:41:32 +08:00
Xubin Ren	384bad17b4	Merge origin/main into fix/config-default-api-base Made-with: Cursor	2026-04-18 20:08:21 +00:00
Xubin Ren	cc5a666d5d	review(dream): harden line-age annotation per review feedback Follow-up to #3212, fully backward compatible: - Extract the 14-day staleness threshold as `_STALE_THRESHOLD_DAYS` module constant and pass it into the Phase 1 prompt template as `{{ stale_threshold_days }}`. The number lived in three places before (code threshold, prompt instruction, docstring); now there is one. - Add `DreamConfig.annotate_line_ages` (default True = current behavior) and propagate it through `Dream.__init__` and the gateway wiring in cli/commands.py. Gives users a knob to disable the feature without a code patch if an LLM reacts poorly to the `← Nd` suffix. - Harden `_annotate_with_ages` against dirty working trees: when HEAD blob line count disagrees with the working-tree content length, skip annotation entirely instead of assigning ages to the wrong lines. The previous `i >= len(ages)` guard only handled one direction of the mismatch. - Inline-comment the `max_iterations` 10→15 bump with a pointer to exp002 so future blame has context. - Add 4 regression tests: end-to-end `← 30d` reaches prompt, 14/15 threshold boundary, `annotate_line_ages=False` bypasses git entirely (verified via `assert_not_called`), length-mismatch defense, and template-var rendering. Made-with: Cursor	2026-04-17 13:45:38 +08:00
chengyongru	35f3084c03	feat(dream): per-line age annotations + dedup-aware prompt + max_iter=15 Three improvements to Dream's memory consolidation: 1. Per-line git-blame age annotations: MEMORY.md lines get `← Nd` suffixes (N>14) from dulwich annotate. SOUL.md/USER.md excluded as permanent. LLM uses content judgment, not just age, to decide what to prune. 2. Dedup-aware Phase 1 prompt: reframed as dual-task (extract facts + deduplicate existing files) with explicit redundancy patterns to scan for. Validated through 20 experiments (exp-002 prompt + max_iter=15 was best, averaging -1643 chars/5.4% compression per run). 3. Phase 1 analysis as commit body: dream git commits now include the full Phase 1 analysis for transparency via /dream-log. 4. max_iterations raised from 10 to 15: 30% improvement over 10 with no risk; 20 showed diminishing returns (exp-020: -701 vs exp-017: -1643).	2026-04-17 13:45:38 +08:00
Xubin Ren	90b7d940e8	refactor(config): nest MyTool settings under tools.my (with legacy-key migration)	2026-04-16 15:58:20 +00:00
chengyongru	b51da93cbb	feat(agent): add SelfTool for runtime self-inspection and configuration Add a built-in tool that lets the agent inspect and modify its own runtime state (model, iterations, context window, etc.). Key features: - inspect: view current config, usage stats, and subagent status - modify: adjust parameters at runtime (protected by type/range validation) - Subagent observability: inspect running subagent tasks (phase, iteration, tool events, errors) — subagents are no longer a black box - Watchdog corrects out-of-bounds values on each iteration - Enabled by default in read-only mode (self_modify: false) - All changes are in-memory only; restart restores defaults - Comprehensive test suite (90 tests) Includes a self-awareness skill (always-on) with progressive disclosure: SKILL.md for core rules, references/examples.md for detailed scenarios.	2026-04-16 23:44:26 +08:00
Soham Bhattacharya	41a1b0058d	Add support for nullable API keys and LM Studio	2026-04-16 02:49:54 +08:00
Aisht	d0a282e766	feat(provider): add MiniMax Anthropic endpoint for thinking mode - Add minimax_anthropic provider using Anthropic-compatible endpoint - Endpoint: https://api.minimax.io/anthropic - Supports reasoning_effort parameter for thinking mode (low/medium/high/adaptive) - Uses same MINIMAX_API_KEY as existing minimax provider	2026-04-16 01:00:45 +08:00
Xubin Ren	e4b3f9bd28	security(gateway): keep health endpoint local by default Bind the gateway health listener to localhost by default and reduce the probe response to a minimal status payload so accidental public exposure leaks less information. Made-with: Cursor	2026-04-14 07:19:38 +00:00
moranfong	0750d1f182	fix(config): return provider default api base in config resolution	2026-04-13 23:42:58 +08:00
Xubin Ren	09c238ca0f	Merge origin/main into pr-2959 Resolve the config plumbing conflicts and keep disabled skill filtering consistent for subagent prompts after syncing with main. Made-with: Cursor	2026-04-12 02:02:39 +00:00
Mike Terhar	d3aa209cf6	add kagi web search tool	2026-04-11 16:53:05 +08:00
Xubin Ren	84e840659a	refactor(config): rename auto compact config key Prefer the more user-friendly idleCompactAfterMinutes name for auto compact while keeping sessionTtlMinutes as a backward-compatible alias. Update tests and README to document the retained recent-context behavior and the new preferred key.	2026-04-11 15:56:41 +08:00
chengyongru	fb6dd111e1	feat(agent): auto compact — proactive session compression to reduce token cost and latency (#2982 ) When a user is idle for longer than a configured TTL, nanobot proactively compresses the session context into a summary. This reduces token cost and first-token latency when the user returns — instead of re-processing a long stale context with an expired KV cache, the model receives a compact summary and fresh input.	2026-04-11 15:56:41 +08:00
chenyahui	0e6331b66d	feat(exec): support allowed_env_keys to pass specified env vars to subprocess Add allowed_env_keys config field to selectively forward host environment variables (e.g. GOPATH, JAVA_HOME) into the sandboxed subprocess environment, while keeping the default allow-list unchanged.	2026-04-09 23:35:44 +08:00
chenyahui	e9c4fe6824	feat(skills): add disabled_skills config to exclude skills from loading Introduce a disabled_skills option in the config schema that allows users to specify a list of skill names to be excluded. The setting is threaded from config through Nanobot -> AgentLoop -> ContextBuilder -> SkillsLoader. Disabled skills are filtered out from list_skills, get_always_skills, and build_skills_summary. Four new test cases cover the filtering behavior.	2026-04-09 14:11:47 +08:00
whs	743e73da3f	feat(session): add unified_session config to share one session across all channels	2026-04-09 11:09:25 +08:00
Xubin Ren	1e8a6663ca	test(anthropic): add regression tests for thinking modes incl. adaptive Also update schema comment to mention 'adaptive' as a valid value. Made-with: Cursor	2026-04-07 22:53:43 +08:00
Xubin Ren	35dde8a30e	refactor: unify voice transcription config across all channels - Move transcriptionProvider to global channels config (not per-channel) - ChannelManager auto-resolves API key from matching provider config - BaseChannel gets transcription_provider attribute, no more getattr hack - Remove redundant transcription fields from WhatsAppConfig - Update README: document transcriptionProvider, update provider table Made-with: Cursor	2026-04-06 06:07:30 +00:00
Ben Lenarts	202938ae73	feat: support ${VAR} env var interpolation in config secrets Allow config.json to reference environment variables via ${VAR_NAME} syntax. Variables are resolved at runtime by resolve_config_env_vars(), keeping the raw templates in the Pydantic model so save_config() preserves them. This lets secrets live in a separate env file (e.g. loaded by systemd EnvironmentFile=) instead of plain text in config.json.	2026-04-06 13:43:26 +08:00
Xubin Ren	a8707ca8f6	Merge origin/main into feat/best_skill_and_hook (resolve 4 conflicts) Made-with: Cursor	2026-04-05 18:53:17 +00:00
hoaresky	6bd2950b99	Fix: add asyncio timeout guard for DuckDuckGo search DDGS's internal `timeout=10` relies on `requests` read-timeout semantics, which only measure the gap between bytes — not total wall-clock time. When the underlying HTTP connection enters CLOSE-WAIT or the server dribbles data slowly, this timeout never fires, causing `ddgs.text` to hang indefinitely via `asyncio.to_thread`. Since `asyncio.to_thread` cannot cancel the underlying OS thread, the agent's session lock is never released, blocking all subsequent messages on the same session (observed: 8+ hours of unresponsiveness). Fix: - Add `timeout` field to `WebSearchConfig` (default: 30s, configurable via config.json or NANOBOT_TOOLS__WEB__SEARCH__TIMEOUT env var) - Wrap `asyncio.to_thread` with `asyncio.wait_for` to enforce a hard wall-clock deadline Closes #2804 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-06 02:21:51 +08:00
Jiajun	7e1ae3eab4	feat(provider): add Qianfan provider support (#2699 )	2026-04-05 16:52:37 +08:00
Xubin Ren	9ef5b1e145	fix: reset ssrf whitelist on config reload and document config refresh	2026-04-04 19:43:18 +08:00
04cb	5f08d61d8f	fix(security): add ssrfWhitelist config to unblock Tailscale/CGNAT (#2669 )	2026-04-04 19:43:18 +08:00
Xubin Ren	0a3a60a7a4	refactor(memory): simplify Dream config naming and rename gitstore module	2026-04-04 10:01:45 +00:00
Xubin Ren	30ea048f19	Merge remote-tracking branch 'origin/main' into pr-2717-review	2026-04-04 04:42:52 +00:00
Xubin Ren	652377bee9	Merge origin/main into feat/web-disable-flag Made-with: Cursor	2026-04-03 18:41:43 +00:00
imfondof	896d578677	fix(restart): show restart completion with elapsed time across channels	2026-04-04 02:21:42 +08:00
imfondof	ba7c07ccf2	fix(restart): send completion notice after channel is ready and unify runtime keys	2026-04-04 02:21:42 +08:00

1 2 3 4 5

219 Commits