nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-04-14 23:19:55 +00:00

Author	SHA1	Message	Date
Xubin Ren	cf8381f517	feat(agent): enhance message injection handling and content merging	2026-04-11 21:43:23 +08:00
Xubin Ren	f6c39ec946	feat(agent): enhance session key handling for follow-up messages	2026-04-11 21:43:23 +08:00
chengyongru	36d2a11e73	feat(agent): mid-turn message injection for responsive follow-ups (#2985 ) * feat(agent): add mid-turn message injection for responsive follow-ups Allow user messages sent during an active agent turn to be injected into the running LLM context instead of being queued behind a per-session lock. Inspired by Claude Code's mid-turn queue drain mechanism (query.ts:1547-1643). Key design decisions: - Messages are injected as natural user messages between iterations, no tool cancellation or special system prompt needed - Two drain checkpoints: after tool execution and after final LLM response ("last-mile" to prevent dropping late arrivals) - Bounded by MAX_INJECTION_CYCLES (5) to prevent consuming the iteration budget on rapid follow-ups - had_injections flag bypasses _sent_in_turn suppression so follow-up responses are always delivered Closes #1609 * fix(agent): harden mid-turn injection with streaming fix, bounded queue, and message safety - Fix streaming protocol violation: Checkpoint 2 now checks for injections BEFORE calling on_stream_end, passing resuming=True when injections found so streaming channels (Feishu) don't prematurely finalize the card - Bound pending queue to maxsize=20 with QueueFull handling - Add warning log when injection batch exceeds _MAX_INJECTIONS_PER_TURN - Re-publish leftover queue messages to bus in _dispatch finally block to prevent silent message loss on early exit (max_iterations, tool_error, cancel) - Fix PEP 8 blank line before dataclass and logger.info indentation - Add 12 new tests covering drain, checkpoints, cycle cap, queue routing, cleanup, and leftover re-publish	2026-04-11 21:43:23 +08:00
Xubin Ren	2bef9cb650	fix(agent): preserve interrupted tool-call turns Keep tool-call assistant messages valid across provider sanitization and avoid trailing user-only history after model errors. This prevents follow-up requests from sending broken tool chains back to the gateway.	2026-04-10 05:37:25 +00:00
Xubin Ren	363a0704db	refactor(runner): update message processing to preserve historical context - Adjusted message handling in AgentRunner to ensure that historical messages remain unchanged during context governance. - Introduced tests to verify that backfill operations do not alter the saved message boundary, maintaining the integrity of the conversation history.	2026-04-10 04:46:48 +00:00
Xubin Ren	c625c0c2a7	Merge origin/main and add regression tests for streaming error delivery - Merged latest main (no conflicts) - Added test_llm_error_not_appended_to_session_messages: verifies error content stays out of session messages - Added test_streamed_flag_not_set_on_llm_error: verifies _streamed is not set when LLM returns an error, so ChannelManager delivers it Made-with: Cursor	2026-04-09 23:10:46 +08:00
yanghan-cyber	10f6c875a5	fix(agent): deliver LLM errors to streaming channels and avoid polluting session context When the LLM returns an error (e.g. 429 quota exceeded, stream timeout), streaming channels silently drop the error message because `_streamed=True` is set in metadata even though no content was actually streamed. This change: - Skips setting `_streamed` when stop_reason is "error", so error messages go through the normal channel.send() path and reach the user - Stops appending error content to session history, preventing error messages from polluting subsequent conversation context - Exposes stop_reason from _run_agent_loop to enable the above check	2026-04-09 23:10:46 +08:00
Xubin Ren	edb821e10d	feat(agent): prompt behavior directives, tool descriptions, and loop robustness	2026-04-08 02:22:25 +08:00
Xubin Ren	02597c3ec9	fix(runner): silent retry on empty response before finalization	2026-04-07 15:03:41 +08:00
Xubin Ren	e4b335ce81	refactor: extract runtime response guards into utils runtime module	2026-04-02 13:54:40 +00:00
Xubin Ren	714a4c7bb6	fix(runtime): address review feedback on retry and cleanup	2026-04-02 10:57:12 +00:00
Xubin Ren	eefd7e60f2	Merge remote-tracking branch 'origin/main' into feat/runtime-hardening	2026-04-02 10:40:49 +00:00
chengyongru	da08dee144	feat(provider): show cache hit rate in /status (#2645 )	2026-04-02 12:51:45 +08:00
Xubin Ren	fbedf7ad77	feat: harden agent runtime for long-running tasks	2026-04-01 19:12:49 +00:00
Xubin Ren	5bf0f6fe7d	refactor: unify agent runner lifecycle hooks	2026-03-27 12:41:17 +08:00
Xubin Ren	e7d371ec1e	refactor: extract shared agent runner and preserve subagent progress on failure	2026-03-27 02:49:43 +08:00

16 Commits