nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-19 08:02:30 +00:00

History

chengyongru 5ff9146a24 fix(channel): coalesce queued stream deltas to reduce API calls

When LLM generates faster than channel can process, asyncio.Queue
accumulates multiple _stream_delta messages. Each delta triggers a
separate API call (~700ms each), causing visible delay after LLM
finishes.

Solution: In _dispatch_outbound, drain all queued deltas for the same
(channel, chat_id) before sending, combining them into a single API
call. Non-matching messages are preserved in a pending buffer for
subsequent processing.

This reduces N API calls to 1 when queue has N accumulated deltas.

2026-03-27 21:43:57 +08:00

agent

refactor: unify agent runner lifecycle hooks

2026-03-27 12:41:17 +08:00

channels

fix(channel): coalesce queued stream deltas to reduce API calls

2026-03-27 21:43:57 +08:00

cli

refactor: replace litellm with native openai + anthropic SDKs

2026-03-25 01:58:48 +08:00

config

refactor(tests): optimize unit test structure