nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-04-30 06:45:55 +00:00

Author	SHA1	Message	Date
Xubin Ren	cc5a666d5d	review(dream): harden line-age annotation per review feedback Follow-up to #3212, fully backward compatible: - Extract the 14-day staleness threshold as `_STALE_THRESHOLD_DAYS` module constant and pass it into the Phase 1 prompt template as `{{ stale_threshold_days }}`. The number lived in three places before (code threshold, prompt instruction, docstring); now there is one. - Add `DreamConfig.annotate_line_ages` (default True = current behavior) and propagate it through `Dream.__init__` and the gateway wiring in cli/commands.py. Gives users a knob to disable the feature without a code patch if an LLM reacts poorly to the `← Nd` suffix. - Harden `_annotate_with_ages` against dirty working trees: when HEAD blob line count disagrees with the working-tree content length, skip annotation entirely instead of assigning ages to the wrong lines. The previous `i >= len(ages)` guard only handled one direction of the mismatch. - Inline-comment the `max_iterations` 10→15 bump with a pointer to exp002 so future blame has context. - Add 4 regression tests: end-to-end `← 30d` reaches prompt, 14/15 threshold boundary, `annotate_line_ages=False` bypasses git entirely (verified via `assert_not_called`), length-mismatch defense, and template-var rendering. Made-with: Cursor	2026-04-17 13:45:38 +08:00
chengyongru	35f3084c03	feat(dream): per-line age annotations + dedup-aware prompt + max_iter=15 Three improvements to Dream's memory consolidation: 1. Per-line git-blame age annotations: MEMORY.md lines get `← Nd` suffixes (N>14) from dulwich annotate. SOUL.md/USER.md excluded as permanent. LLM uses content judgment, not just age, to decide what to prune. 2. Dedup-aware Phase 1 prompt: reframed as dual-task (extract facts + deduplicate existing files) with explicit redundancy patterns to scan for. Validated through 20 experiments (exp-002 prompt + max_iter=15 was best, averaging -1643 chars/5.4% compression per run). 3. Phase 1 analysis as commit body: dream git commits now include the full Phase 1 analysis for transparency via /dream-log. 4. max_iterations raised from 10 to 15: 30% improvement over 10 with no risk; 20 showed diminishing returns (exp-020: -701 vs exp-017: -1643).	2026-04-17 13:45:38 +08:00
Xubin Ren	90b7d940e8	refactor(config): nest MyTool settings under tools.my (with legacy-key migration)	2026-04-16 15:58:20 +00:00
chengyongru	b51da93cbb	feat(agent): add SelfTool for runtime self-inspection and configuration Add a built-in tool that lets the agent inspect and modify its own runtime state (model, iterations, context window, etc.). Key features: - inspect: view current config, usage stats, and subagent status - modify: adjust parameters at runtime (protected by type/range validation) - Subagent observability: inspect running subagent tasks (phase, iteration, tool events, errors) — subagents are no longer a black box - Watchdog corrects out-of-bounds values on each iteration - Enabled by default in read-only mode (self_modify: false) - All changes are in-memory only; restart restores defaults - Comprehensive test suite (90 tests) Includes a self-awareness skill (always-on) with progressive disclosure: SKILL.md for core rules, references/examples.md for detailed scenarios.	2026-04-16 23:44:26 +08:00
Xubin Ren	92a5125108	Merge PR #3141 : fix(skills): use yaml.safe_load for frontmatter parsing to handle multiline descriptions fix(skills): use yaml.safe_load for frontmatter parsing to handle multiline descriptions	2026-04-16 20:07:15 +08:00
chengyongru	d64e963258	test(memory): add regression tests for missing cursor key Cover read_unprocessed_history skipping cursorless entries and _next_cursor safe fallback when last entry has no cursor.	2026-04-16 12:32:38 +08:00
chengyongru	015833e34b	Merge branch 'main' into fix/skills-yaml-frontmatter	2026-04-15 16:56:23 +08:00
chengyongru	6fbada5363	refactor(context): deduplicate system prompt — markdown skills index, skip template MEMORY.md - Convert skills summary from verbose XML (4-5 lines/skill) to compact markdown list (1 line/skill) with inline path for read_file lookup - Exclude always-loaded skills (e.g. memory) from the skills index to avoid duplicating content already in the Active Skills section - Skip injecting the Memory section when MEMORY.md still matches the bundled template (i.e. Dream hasn't populated it yet)	2026-04-15 15:49:30 +08:00
yanghan-cyber	a1b544fd23	fix(skills): use yaml.safe_load for frontmatter parsing to handle multiline descriptions The hand-rolled line-by-line YAML parser treated each line independently, so YAML multiline scalars (folded `>` and literal `\|`) were captured as the literal characters ">" or "\|" instead of the actual text content.	2026-04-14 15:29:59 +08:00
yeyitech	65a15f39ee	test(loop): cover /stop checkpoint recovery	2026-04-14 14:15:22 +08:00
yeyitech	ee061f0595	fix(web): serialize duckduckgo search calls	2026-04-14 14:10:06 +08:00
Xubin Ren	a38bc637bd	fix(runner): preserve injection flag after max-iteration drain Keep late follow-up injections observable when they are drained during max-iteration shutdown so loop-level response suppression still makes the right decision. Made-with: Cursor	2026-04-14 00:30:30 +08:00
chengyongru	a1e1eed2f1	refactor(runner): consolidate all injection drain paths and deduplicate tests - Migrate "after tools" inline drain to use _try_drain_injections, completing the refactoring (all 6 drain sites now use the helper). - Move checkpoint emission into _try_drain_injections via optional iteration parameter, eliminating the leaky split between helper and caller for the final-response path. - Extract _make_injection_callback() test helper to replace 7 identical inject_cb function bodies. - Add test_injection_cycle_cap_on_error_path to verify the cycle cap is enforced on error exit paths.	2026-04-14 00:30:30 +08:00
chengyongru	d849a3fa06	fix(agent): drain injection queue on error/edge-case exit paths When the agent runner exits due to LLM error, tool error, empty response, or max_iterations, it breaks out of the iteration loop without draining the pending injection queue. This causes leftover messages to be re-published as independent inbound messages, resulting in duplicate or confusing replies to the user. Extract the injection drain logic into a `_try_drain_injections` helper and call it before each break in the error/edge-case paths. If injections are found, continue the loop instead of breaking. For max_iterations (where the loop is exhausted), drain injections to prevent re-publish without continuing.	2026-04-14 00:30:30 +08:00
chengyongru	becaff3e9d	fix(agent): skip auto-compact for sessions with active agent tasks Prevent proactive compaction from archiving sessions that have an in-flight agent task, avoiding mid-turn context truncation when a task runs longer than the idle TTL.	2026-04-13 12:51:37 +08:00
Xubin Ren	6484c7c47a	fix(agent): close interrupted early-persisted user turns Track text-only user messages that were flushed before the turn loop completes, then materialize an interrupted assistant placeholder on the next request so session history stays legal and later turns do not skip their own assistant reply. Made-with: Cursor	2026-04-13 10:26:09 +08:00
Xubin Ren	b964a894d2	test(agent): cover early user-message persistence Use session.add_message for the pre-turn user-message flush and add focused regression tests for crash-time persistence and duplicate-free successful saves. Made-with: Cursor	2026-04-13 10:26:09 +08:00
Xubin Ren	7a7f5c9689	fix(dream): use valid builtin skill template paths Point Dream skill creation at a readable builtin skill-creator template, keep skill writes rooted at the workspace, and document the new skill discovery behavior in README. Made-with: Cursor	2026-04-12 16:49:55 +08:00
Xubin Ren	09c238ca0f	Merge origin/main into pr-2959 Resolve the config plumbing conflicts and keep disabled skill filtering consistent for subagent prompts after syncing with main. Made-with: Cursor	2026-04-12 02:02:39 +00:00
layla	f25cdb7138	Merge branch 'main' into fix/tool-call-result-order-2943	2026-04-11 22:00:07 +08:00
04cb	4cd4ed8ada	fix(agent): preserve tool results on fatal error to prevent orphan tool_calls (#2943 )	2026-04-11 21:50:44 +08:00
Xubin Ren	cf8381f517	feat(agent): enhance message injection handling and content merging	2026-04-11 21:43:23 +08:00
Xubin Ren	f6c39ec946	feat(agent): enhance session key handling for follow-up messages	2026-04-11 21:43:23 +08:00
chengyongru	36d2a11e73	feat(agent): mid-turn message injection for responsive follow-ups (#2985 ) * feat(agent): add mid-turn message injection for responsive follow-ups Allow user messages sent during an active agent turn to be injected into the running LLM context instead of being queued behind a per-session lock. Inspired by Claude Code's mid-turn queue drain mechanism (query.ts:1547-1643). Key design decisions: - Messages are injected as natural user messages between iterations, no tool cancellation or special system prompt needed - Two drain checkpoints: after tool execution and after final LLM response ("last-mile" to prevent dropping late arrivals) - Bounded by MAX_INJECTION_CYCLES (5) to prevent consuming the iteration budget on rapid follow-ups - had_injections flag bypasses _sent_in_turn suppression so follow-up responses are always delivered Closes #1609 * fix(agent): harden mid-turn injection with streaming fix, bounded queue, and message safety - Fix streaming protocol violation: Checkpoint 2 now checks for injections BEFORE calling on_stream_end, passing resuming=True when injections found so streaming channels (Feishu) don't prematurely finalize the card - Bound pending queue to maxsize=20 with QueueFull handling - Add warning log when injection batch exceeds _MAX_INJECTIONS_PER_TURN - Re-publish leftover queue messages to bus in _dispatch finally block to prevent silent message loss on early exit (max_iterations, tool_error, cancel) - Fix PEP 8 blank line before dataclass and logger.info indentation - Add 12 new tests covering drain, checkpoints, cycle cap, queue routing, cleanup, and leftover re-publish	2026-04-11 21:43:23 +08:00
Xubin Ren	322142f7ad	Merge origin/main into main	2026-04-11 09:32:05 +00:00
Xubin Ren	84e840659a	refactor(config): rename auto compact config key Prefer the more user-friendly idleCompactAfterMinutes name for auto compact while keeping sessionTtlMinutes as a backward-compatible alias. Update tests and README to document the retained recent-context behavior and the new preferred key.	2026-04-11 15:56:41 +08:00
Xubin Ren	1cb28b39a3	feat(agent): retain recent context during auto compact Keep a legal recent suffix in idle auto-compacted sessions so resumed chats preserve their freshest live context while older messages are summarized. Recover persisted summaries even when retained messages remain, and document the new behavior.	2026-04-11 15:56:41 +08:00
chengyongru	d03458f034	fix(agent): eliminate race condition in auto compact summary retrieval Make Consolidator.archive() return the summary string directly instead of writing to history.jsonl then reading back via get_last_history_entry(). This eliminates a race condition where concurrent _archive calls for different sessions could read each other's summaries from the shared history file (cross-user context leak in multi-user deployments). Also removes Consolidator.get_last_history_entry() — no longer needed.	2026-04-11 15:56:41 +08:00
chengyongru	fb6dd111e1	feat(agent): auto compact — proactive session compression to reduce token cost and latency (#2982 ) When a user is idle for longer than a configured TTL, nanobot proactively compresses the session context into a summary. This reduces token cost and first-token latency when the user returns — instead of re-processing a long stale context with an expired KV cache, the model receives a compact summary and fresh input.	2026-04-11 15:56:41 +08:00
Xubin Ren	2bef9cb650	fix(agent): preserve interrupted tool-call turns Keep tool-call assistant messages valid across provider sanitization and avoid trailing user-only history after model errors. This prevents follow-up requests from sending broken tool chains back to the gateway.	2026-04-10 05:37:25 +00:00
Xubin Ren	c579d67887	fix(memory): preserve consolidation turn boundaries under chunk cap Made-with: Cursor	2026-04-10 12:58:58 +08:00
Xubin Ren	363a0704db	refactor(runner): update message processing to preserve historical context - Adjusted message handling in AgentRunner to ensure that historical messages remain unchanged during context governance. - Introduced tests to verify that backfill operations do not alter the saved message boundary, maintaining the integrity of the conversation history.	2026-04-10 04:46:48 +00:00
Xubin Ren	c625c0c2a7	Merge origin/main and add regression tests for streaming error delivery - Merged latest main (no conflicts) - Added test_llm_error_not_appended_to_session_messages: verifies error content stays out of session messages - Added test_streamed_flag_not_set_on_llm_error: verifies _streamed is not set when LLM returns an error, so ChannelManager delivers it Made-with: Cursor	2026-04-09 23:10:46 +08:00
yanghan-cyber	10f6c875a5	fix(agent): deliver LLM errors to streaming channels and avoid polluting session context When the LLM returns an error (e.g. 429 quota exceeded, stream timeout), streaming channels silently drop the error message because `_streamed=True` is set in metadata even though no content was actually streamed. This change: - Skips setting `_streamed` when stop_reason is "error", so error messages go through the normal channel.send() path and reach the user - Stops appending error content to session history, preventing error messages from polluting subsequent conversation context - Exposes stop_reason from _run_agent_loop to enable the above check	2026-04-09 23:10:46 +08:00
chenyahui	e9c4fe6824	feat(skills): add disabled_skills config to exclude skills from loading Introduce a disabled_skills option in the config schema that allows users to specify a list of skill names to be excluded. The setting is threaded from config through Nanobot -> AgentLoop -> ContextBuilder -> SkillsLoader. Disabled skills are filtered out from list_skills, get_always_skills, and build_skills_summary. Four new test cases cover the filtering behavior.	2026-04-09 14:11:47 +08:00
Xubin Ren	dadf453097	Merge origin/main into fix/sanitize-messages-non-claude Resolved conflict in azure_openai_provider.py by keeping main's Responses API implementation (role alternation not needed for the Responses API input format). Made-with: Cursor	2026-04-09 04:45:45 +00:00
whs	b4c7cd654e	fix: use effective session key for _active_tasks in unified mode	2026-04-09 11:09:25 +08:00
whs	985f9c443b	tests: add unified_session coverage for /new and consolidation	2026-04-09 11:09:25 +08:00
whs	743e73da3f	feat(session): add unified_session config to share one session across all channels	2026-04-09 11:09:25 +08:00
Xubin Ren	6bf101c79b	fix(hook): keep composite hooks backward compatible Avoid AttributeError regressions when hooks define their own __init__ or when a CompositeHook wraps another composite. Made-with: Cursor	2026-04-08 23:41:31 +08:00
Xubin Ren	c092896922	fix(tool-hint): handle quoted paths in exec hints Preserve path folding for quoted exec command paths with spaces so hint previews do not fall back to mid-path truncation. Add regression coverage for quoted Unix and Windows path cases. Made-with: Cursor	2026-04-08 23:05:52 +08:00
chengyongru	b16865722b	fix(tool-hint): fold paths in exec commands and deduplicate by formatted string 1. exec tool hints previously used val[:40] blind character truncation, cutting paths mid-segment. Now detects file paths via regex and abbreviates them with abbreviate_path. Supports Windows, Unix absolute, and ~/ home paths. 2. Deduplication now compares fully formatted hint strings instead of tool names alone. Fixes ls /Desktop and ls /Downloads being incorrectly merged as "ls /Desktop × 2". Co-authored-by: xzq.xu <zhiqiang.xu@nodeskai.com>	2026-04-08 23:05:52 +08:00
Xubin Ren	edb821e10d	feat(agent): prompt behavior directives, tool descriptions, and loop robustness	2026-04-08 02:22:25 +08:00
Xubin Ren	ce7986e492	fix(memory): add timestamp and cap to recent history injection	2026-04-08 00:03:11 +08:00
Xubin Ren	05d8062c70	test: add regression tests for unprocessed history injection in system prompt Made-with: Cursor	2026-04-07 23:41:05 +08:00
Xubin Ren	82dec12f66	refactor: extract tool hint formatting to utils/tool_hints.py - Move _tool_hint implementation from loop.py to nanobot/utils/tool_hints.py - Keep thin delegation in AgentLoop._tool_hint for backward compat - Update test imports to test format_tool_hints directly Made-with: Cursor	2026-04-07 15:15:07 +08:00
chengyongru	3e3a7654f8	fix(agent): address code review findings for tool hint enhancement - C1: Fix IndexError on empty list arguments via _get_args() helper - I1: Remove redundant branch in _fmt_known - I2: Export abbreviate_path from nanobot.utils.__init__ - I3: Fix _abbreviate_url negative-budget format consistency - S1: Move FORMATS to class-level _TOOL_HINT_FORMATS constant - S2: Add list_dir to FORMATS registry (ls path) - G1-G5: Add tests for empty list args, None args, URL edge cases, mixed folding groups, and list_dir format	2026-04-07 15:15:07 +08:00
chengyongru	8ca9960077	feat(agent): rewrite _tool_hint with registry, path abbreviation, and call folding	2026-04-07 15:15:07 +08:00
Xubin Ren	02597c3ec9	fix(runner): silent retry on empty response before finalization	2026-04-07 15:03:41 +08:00
Jack Lu	bcb8352235	refactor(agent): streamline hook method calls and enhance error logging - Introduced a helper method `_for_each_hook_safe` to reduce code duplication in hook method implementations. - Updated error logging to include the method name for better traceability. - Improved the `SkillsLoader` class by adding a new method `_skill_entries_from_dir` to simplify skill listing logic. - Enhanced skill loading and filtering logic, ensuring workspace skills take precedence over built-in ones. - Added comprehensive tests for `SkillsLoader` to validate functionality and edge cases.	2026-04-06 02:51:10 +08:00

1 2

75 Commits