nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-04-22 11:00:19 +00:00

Author	SHA1	Message	Date
Xubin Ren	1b211c7d3a	Merge branch 'main' into nanobot-webui Made-with: Cursor	2026-04-18 19:17:16 +00:00
Xubin Ren	9ed3031a42	feat(webui): add initial webui with websocket chat flow	2026-04-18 18:51:53 +00:00
chengyongru	5818569e8f	feat(wizard): auto-detect Literal fields as select menus Literal["standard", "persistent"] fields are now rendered as select dropdowns instead of free-text input. This makes provider_retry_mode and any future Literal fields self-documenting in the wizard.	2026-04-18 21:56:10 +08:00
chengyongru	ebb5179cab	feat(wizard): add Channel Common, API Server menus and field constraint validation - Add [H] Channel Common menu to configure send_progress, send_tool_hints, send_max_retries, and transcription_provider - Add [I] API Server menu to configure host, port, timeout - Add real-time Pydantic field constraint validation (ge/gt/le/lt/min_length/max_length) with constraint hints shown in field display (e.g. "Send Max Retries (0-10)") - Add _pause() to View Configuration Summary to prevent immediate screen clear - Fix _format_value dict branch to handle BaseModel instances without crashing	2026-04-18 21:56:10 +08:00
chengyongru	34e8f97b1f	refactor(templates): separate identity and SOUL responsibilities Move all behavioral instructions out of identity.md into SOUL.md so that each file has a single clear purpose: - identity.md: capability facts only (runtime, workspace, format hints, tool guidance, untrusted content warning) - SOUL.md: behavioral rules (name, personality, execution rules) The "Act, don't narrate" rule is refined into layered behavior: act immediately on single-step tasks, plan first for multi-step tasks. This eliminates the contradiction where identity said "never end with a plan" but user SOUL.md said "always plan first".	2026-04-18 21:55:56 +08:00
Xubin Ren	6bfb75ed03	feat(websocket): multiplex multiple chat_ids over a single connection	2026-04-18 16:49:12 +08:00
Xubin Ren	70a1279b86	test: pin retry-wait callback routing so internal heartbeats stay off channels Add two focused regression tests for the retry-wait leak this PR fixes: - tests/agent/test_runner.py::test_runner_binds_on_retry_wait_to_retry_callback_not_progress locks in that `AgentRunSpec.retry_wait_callback` (not `progress_callback`) is what `_build_request_kwargs` forwards to the provider as `on_retry_wait`. - tests/channels/test_channel_manager_delta_coalescing.py::TestRetryWaitFiltering runs `_dispatch_outbound` end-to-end and asserts that `_retry_wait: True` messages never reach channel send. Both tests fail on origin/main and pass with this PR's fix applied. Made-with: Cursor	2026-04-18 13:50:05 +08:00
Xubin Ren	c8d834a504	fix(loop): document subagent-followup persistence and guard empty content - Add inline rationale for persisting before ContextBuilder and for passing current_message="" on subagent follow-ups (avoids double-projection after merge). - Skip persistence for empty subagent content (no-op messages should not pollute history). - Add regression test covering the empty-content guard. Made-with: Cursor	2026-04-18 13:30:22 +08:00
xzq.xu	1c939e8a5f	fix(loop): persist subagent follow-up events in history	2026-04-18 13:30:22 +08:00
04cb	c27b4d07c4	fix(utils): recurse into PPTX groups and tables when extracting text (#3250 )	2026-04-18 12:30:42 +08:00
JunghwanNA	34fccb2ee9	Prevent self-inspection from leaking configured secrets MyTool blocks direct access to sensitive nested paths, but its formatter still printed scalar fields for small config objects. That let `my(action="check", key="web_config.search")` expose `api_key` in plain text even though the docs promise sensitive sub-fields are protected. This keeps the change narrow: sensitive nested config fields are omitted from MyTool's formatted output, and regression coverage locks the behavior in. Constraint: Must preserve existing read-only inspection behavior for non-sensitive fields Constraint: Keep scope limited to MyTool rather than introducing broader redaction plumbing Rejected: Rework global context/tool redaction around MyTool \| broader than needed for the leak path Confidence: high Scope-risk: narrow Reversibility: clean Directive: If more nested config rendering is added later, filter sensitive field names at the formatter boundary as well as the path resolver Tested: PYTHONPATH=$PWD pytest -q tests/agent/tools/test_self_tool.py /Users/jh0927/Workspace/nanobot-validation-artifacts-2026-04-18/test_my_tool_secret_leak_regression.py Not-tested: Full repository test suite Related: #3259	2026-04-18 00:59:08 +08:00
JunghwanNA	c196b5b0c2	Prevent failed SSE requests from masquerading as successful completions The streaming API currently logs backend exceptions but still emits the same `finish_reason: "stop"` + `[DONE]` terminator used for successful responses. That makes a failed streamed request look successful to OpenAI-compatible clients. This keeps the fix narrow: track whether the stream backend failed and suppress the success terminator in that case. A regression test locks in the expected behavior. Constraint: Keep the non-streaming response path untouched Constraint: Follow up on the known limitation called out during PR #3222 review without redesigning the SSE protocol Rejected: Introduce a custom SSE error event shape in the same patch \| expands API surface and review scope Confidence: high Scope-risk: narrow Reversibility: clean Directive: If explicit streamed error events are added later, keep them distinct from the success stop+[DONE] terminator to preserve client retry semantics Tested: PYTHONPATH=$PWD pytest -q tests/test_api_stream.py /Users/jh0927/Workspace/nanobot-validation-artifacts-2026-04-18/test_api_stream_error_regression.py Not-tested: Full repository test suite Related: #3260 Related: #3222	2026-04-18 00:44:44 +08:00
Steve	39dd59f2ba	fix(cron): state per-action requirements in descriptions, keep list/remove callable The previous patch promoted `message` into top-level `required`, which solved the `add` loop but broke `list` and `remove`: `ToolRegistry.prepare_call` enforces `required` via `validate_params`, so `cron(action="list")` and `cron(action="remove", job_id=...)` — both documented in `SKILL.md` — started failing schema validation with the same "missing required message" shape that #3113 describes for `add`. Instead: - Keep `required=["action"]` so `list`/`remove` stay callable. - Prefix `message`'s description with `REQUIRED when action='add'.` and `job_id`'s with `REQUIRED when action='remove'.` so LLMs see the real per-action contract up front. - Keep the improved runtime error message from the previous commit for the case an LLM still omits `message` on `add`. Also add `tests/cron/test_cron_tool_schema_contract.py` to lock in: - `list` and `remove` pass schema validation with no `message` - `add` with `message` passes - `add` without `message` surfaces the actionable runtime error - field descriptions carry the REQUIRED hints - top-level `required` stays `["action"]` Existing `tests/cron/test_cron_tool_list.py` cases bypass schema validation by calling `_list_jobs()` / `_remove_job()` directly, which is why CI didn't catch the regression; the new test goes through `ToolRegistry.prepare_call`.	2026-04-17 22:52:48 +08:00
Xubin Ren	b8d327dc41	test + docs: lock should_execute_tools guard semantics (#3220 ) Two small follow-ups to the guard: 1. Fix the should_execute_tools docstring so it matches the actual code. The previous version said "Only execute when finish_reason explicitly signals tool intent" but the code also accepts finish_reason == "stop". Explain why (some compliant providers emit "stop" with legitimate tool calls — openai_compat_provider.py already mirrors this at lines ~633 / ~678 where ("tool_calls", "stop") are both treated as the terminal tool-call state). Without this, a strict "tool_calls"-only guard would regress 15 existing runner tests that construct LLMResponse with tool_calls but no explicit finish_reason (default = "stop"). 2. Add tests/providers/test_llm_response.py. This locks the three cases: - no tool calls -> never executes - tool calls + "tool_calls"/stop -> executes - tool calls + refusal / content_filter / error / length / ... -> blocked These are exactly the boundary cases the #3220 fix is about; without a test here a future refactor could silently revert the guard. Body + tests only, no behavior change beyond the existing PR's intent. Made-with: Cursor	2026-04-17 20:39:46 +08:00
Cheng Yongru	aabc3d5017	fix(memory): fall back to raw_archive on LLM error response When chat_with_retry returns an error response (finish_reason='error') instead of raising an exception, archive() previously treated the error message as a valid summary and wrote it to history.jsonl, while the original session data was already cleared by /new — causing irreversible data loss. Fix: check finish_reason after the LLM call and raise RuntimeError on error responses, which naturally falls through to the existing raw_archive fallback. This preserves the original messages in history.jsonl instead of losing them. Fixes #3244	2026-04-17 20:15:07 +08:00
Mariano Campo	d0e65ebf70	fix(exec): pass allowed_env_keys to exec tool calls in subagents	2026-04-17 16:32:25 +08:00
Xubin Ren	3ae4333cef	test(email): cover smtp_username / imap_username / case-insensitive self-address match The original regression only exercised a from_address match with all three identity fields set to the same value, so it couldn't distinguish whether _self_addresses actually picks up smtp_username and imap_username or just collapses on from_address. Add a parametrized test covering: - smtp_username-only match (from_address empty, imap_username different) — simulates SMTP relays that rewrite outbound From to the login identity. - imap_username-only match — simulates mailbox-identity setups. - Case-insensitive match — inbound From arriving upper-cased must still hit. No production code changes. Made-with: Cursor	2026-04-17 16:25:16 +08:00
yorkhellen	1011ea5ac8	fix(email): ignore self-sent mailbox messages Skip inbound emails that come from the bot's own configured addresses so a mailbox wired to the same SMTP/IMAP account does not trigger infinite reply loops.	2026-04-17 16:25:16 +08:00
chengyongru	8c0c4e5b31	refactor(agent): tighten comments, extract constant, strengthen edge case test - Extract synthetic user message string to module-level constant - Tighten comments in _snip_history recovery branch - Strengthen no-user edge case test to verify safety net interaction	2026-04-17 16:20:53 +08:00
chengyongru	44b526c4ee	fix(agent): preserve user message in _snip_history to prevent GLM error 1214 When _snip_history truncates the message history and the only user message ends up outside the kept window, providers like GLM reject the resulting system→assistant sequence with error 1214 ("messages 参数非法"). Two-layer fix: 1. _snip_history now walks backwards through non_system messages to recover the nearest user message when none exists in the kept window. 2. _enforce_role_alternation inserts a synthetic user message "(conversation continued)" when the first non-system message is a bare assistant (no tool_calls), serving as a safety net for any edge cases that slip through. Co-authored-by: darlingbud <darlingbud@users.noreply.github.com>	2026-04-17 16:20:53 +08:00
Xubin Ren	5badb75f6c	review: tighten scope and add regression tests Follow-ups from review of #3194: - ci.yml: drop unconditional --ignore=tests/channels/test_matrix_channel.py. That test file already calls pytest.importorskip("nio") at module top, so it self-skips on Windows (where nio isn't installed) without also hiding 62 tests from Linux CI. - filesystem.py: hoist `import os` to the module top and drop the duplicate inline import in ReadFileTool.execute. Document the CRLF->LF normalization as intentional (primarily a Windows UX fix so downstream StrReplace/Grep match consistently regardless of where the file was written). - test_read_enhancements.py: lock down two new behaviors * TestFileStateHashFallback: check_read warns when content changes but mtime is unchanged (coarse-mtime filesystems on Windows). * TestReadFileLineEndingNormalization: ReadFileTool strips CRLF and preserves LF-only files untouched. - test_tool_validation.py: restore list2cmdline/shlex.quote in test_exec_head_tail_truncation. The temp_path-based form was correct, but dropping the quoting broke on any Windows path containing spaces (e.g. C:\Users\John Doe\...). CI runners happen not to have spaces so this slipped through. Tests: 1993 passed locally. Made-with: Cursor	2026-04-17 16:11:37 +08:00
Jiajun Xie	3db2eb66e4	ci: add Windows and Python 3.14 support	2026-04-17 16:11:37 +08:00
Mohamed Elkholy	ce5272c153	fix(transcription): honor api_base for OpenAI transcription provider Complete the symmetry left by #3214: ChannelManager._resolve_transcription_base already resolves providers.openai.api_base, but BaseChannel.transcribe_audio instantiated OpenAITranscriptionProvider without forwarding it, and the provider __init__ did not accept the parameter. Self-hosted OpenAI-compatible Whisper endpoints (LiteLLM, vLLM, etc.) configured via config.json were therefore ignored for the OpenAI backend. - OpenAITranscriptionProvider.__init__ now accepts api_base with env fallback (OPENAI_TRANSCRIPTION_BASE_URL) matching the Groq pattern. - BaseChannel.transcribe_audio forwards self.transcription_api_base to OpenAI. - Tests mirror the existing Groq coverage: manager propagation for provider "openai", BaseChannel-to-provider argument passing, and provider default vs override for api_url. Fully backward-compatible: when api_base is None and the env var is unset, the default https://api.openai.com/v1/audio/transcriptions is used. Refs #3213, follow-up to #3214.	2026-04-17 13:46:51 +08:00
Xubin Ren	d57af5c1d1	test(channels): cover groq transcription api base propagation	2026-04-17 13:46:51 +08:00
Xubin Ren	cc5a666d5d	review(dream): harden line-age annotation per review feedback Follow-up to #3212, fully backward compatible: - Extract the 14-day staleness threshold as `_STALE_THRESHOLD_DAYS` module constant and pass it into the Phase 1 prompt template as `{{ stale_threshold_days }}`. The number lived in three places before (code threshold, prompt instruction, docstring); now there is one. - Add `DreamConfig.annotate_line_ages` (default True = current behavior) and propagate it through `Dream.__init__` and the gateway wiring in cli/commands.py. Gives users a knob to disable the feature without a code patch if an LLM reacts poorly to the `← Nd` suffix. - Harden `_annotate_with_ages` against dirty working trees: when HEAD blob line count disagrees with the working-tree content length, skip annotation entirely instead of assigning ages to the wrong lines. The previous `i >= len(ages)` guard only handled one direction of the mismatch. - Inline-comment the `max_iterations` 10→15 bump with a pointer to exp002 so future blame has context. - Add 4 regression tests: end-to-end `← 30d` reaches prompt, 14/15 threshold boundary, `annotate_line_ages=False` bypasses git entirely (verified via `assert_not_called`), length-mismatch defense, and template-var rendering. Made-with: Cursor	2026-04-17 13:45:38 +08:00
chengyongru	35f3084c03	feat(dream): per-line age annotations + dedup-aware prompt + max_iter=15 Three improvements to Dream's memory consolidation: 1. Per-line git-blame age annotations: MEMORY.md lines get `← Nd` suffixes (N>14) from dulwich annotate. SOUL.md/USER.md excluded as permanent. LLM uses content judgment, not just age, to decide what to prune. 2. Dedup-aware Phase 1 prompt: reframed as dual-task (extract facts + deduplicate existing files) with explicit redundancy patterns to scan for. Validated through 20 experiments (exp-002 prompt + max_iter=15 was best, averaging -1643 chars/5.4% compression per run). 3. Phase 1 analysis as commit body: dream git commits now include the full Phase 1 analysis for transparency via /dream-log. 4. max_iterations raised from 10 to 15: 30% improvement over 10 with no risk; 20 showed diminishing returns (exp-020: -701 vs exp-017: -1643).	2026-04-17 13:45:38 +08:00
Xubin Ren	459a4d7311	test(discord): cover allow_channels filtering in _should_accept_inbound Locks in the two key boundaries of the new channel-based filter: 1. When an incoming channel id is in allow_channels, messages are forwarded. 2. When an incoming channel id is not in allow_channels, messages are silently dropped. The empty-list backward-compatible path is already covered by every existing test that omits allow_channels (default_factory=list). Made-with: Cursor	2026-04-17 02:14:33 +08:00
whs	4fce8d8b8d	feat(api): add SSE streaming for /v1/chat/completions Wire up the existing on_stream/on_stream_end callbacks from process_direct() to emit OpenAI-compatible SSE chunks when stream=true. Non-streaming path is untouched.	2026-04-17 01:54:49 +08:00
Xubin Ren	90b7d940e8	refactor(config): nest MyTool settings under tools.my (with legacy-key migration)	2026-04-16 15:58:20 +00:00
chengyongru	b51da93cbb	feat(agent): add SelfTool for runtime self-inspection and configuration Add a built-in tool that lets the agent inspect and modify its own runtime state (model, iterations, context window, etc.). Key features: - inspect: view current config, usage stats, and subagent status - modify: adjust parameters at runtime (protected by type/range validation) - Subagent observability: inspect running subagent tasks (phase, iteration, tool events, errors) — subagents are no longer a black box - Watchdog corrects out-of-bounds values on each iteration - Enabled by default in read-only mode (self_modify: false) - All changes are in-memory only; restart restores defaults - Comprehensive test suite (90 tests) Includes a self-awareness skill (always-on) with progressive disclosure: SKILL.md for core rules, references/examples.md for detailed scenarios.	2026-04-16 23:44:26 +08:00
Mohamed Elkholy	1304ff78cc	perf(tools): cache ToolRegistry.get_definitions() between mutations get_definitions() sorts tools on every LLM iteration for prompt cache stability. Cache the sorted result and invalidate on register/unregister so the sort only runs when the tool set actually changes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 21:52:36 +08:00
Xubin Ren	7ce8f247a0	test(api): cover remote image URL rejection	2026-04-16 21:02:33 +08:00
chengyongru	e1fdca7d40	fix(status): correct context percentage calculation and sync consolidator - Pass resolved self.context_window_tokens to Consolidator instead of raw parameter that could be None, preventing consolidation failures - Calculate percentage against input budget (ctx - max_completion - 1024) instead of raw context window, consistent with Consolidator/snip formulas - Pass actual max_completion_tokens from provider to build_status_content - Cap percentage display at 999 to prevent runaway values - Add tests for budget-based percentage and cap behavior	2026-04-16 20:30:39 +08:00
Xubin Ren	92a5125108	Merge PR #3141 : fix(skills): use yaml.safe_load for frontmatter parsing to handle multiline descriptions fix(skills): use yaml.safe_load for frontmatter parsing to handle multiline descriptions	2026-04-16 20:07:15 +08:00
Xubin Ren	a2f4090e41	fix(msteams): secure inbound defaults and ref persistence Default Microsoft Teams inbound auth validation to enabled, update the README to match, and prevent denied senders from persisting conversation refs before allowlist checks pass. Made-with: Cursor	2026-04-16 13:22:07 +08:00
chengyongru	abe0145f99	fix(msteams): harden availability check and migrate docs to README - Check both jwt and cryptography in MSTEAMS_AVAILABLE guard so partial installs fail early with a clear message instead of at runtime - Add aclose() to test FakeHttpClient so stop() won't crash - Move MSTEAMS.md into README.md following the same details/summary pattern used by every other channel - Note in README that validateInboundAuth defaults to false	2026-04-16 13:22:07 +08:00
T3chC0wb0y	ee99200341	refactor(msteams): remove business references	2026-04-16 13:22:07 +08:00
T3chC0wb0y	9b4264fce2	refactor(msteams): remove FWDIOC references	2026-04-16 13:22:07 +08:00
T3chC0wb0y	fecef07c60	refactor(msteams): remove obsolete restart notify config	2026-04-16 13:22:07 +08:00
Bob Johnson	9f8774fbdd	fix(msteams): remove hardcoded quote test fallback	2026-04-16 13:22:07 +08:00
chengyongru	63753dbfea	fix(msteams): remove optional deps from dev extras and gate tests PyJWT and cryptography are optional msteams deps; they should not be bundled into the generic dev install. Tests now skip the entire file when the deps are missing, following the dingtalk pattern.	2026-04-16 13:22:07 +08:00
Bob Johnson	4d795f74d5	Fix MSTeams PR review follow-ups	2026-04-16 13:22:07 +08:00
T3chC0wb0y	824dcca5e2	Add Microsoft Teams channel on current nightly base	2026-04-16 13:22:07 +08:00
chengyongru	d64e963258	test(memory): add regression tests for missing cursor key Cover read_unprocessed_history skipping cursorless entries and _next_cursor safe fallback when last entry has no cursor.	2026-04-16 12:32:38 +08:00
Xubin Ren	2b8e90d8fd	test(config): cover LM Studio nullable api key	2026-04-16 02:49:54 +08:00
Xubin Ren	a6ea06e6bf	docs(providers): explain MiniMax thinking endpoint Document why MiniMax thinking mode uses a separate Anthropic-compatible provider and list the matching base URLs. Add a small registry test so the new provider stays wired to the expected backend and API key. Made-with: Cursor	2026-04-16 01:00:45 +08:00
04cb	eacc9fbb5f	refactor(providers): drop unreachable GenerationSettings fallback	2026-04-15 23:52:38 +08:00
04cb	54f7ad3752	fix(providers): guard chat_with_retry against explicit None max_tokens (#3102 )	2026-04-15 23:52:38 +08:00
chengyongru	015833e34b	Merge branch 'main' into fix/skills-yaml-frontmatter	2026-04-15 16:56:23 +08:00
dongzeyu001	6829b8b475	unit test fix	2026-04-15 16:51:02 +08:00

1 2 3 4 5 ...

593 Commits