nanobot

mirror/nanobot

Fork 0

mirror of https://github.com/HKUDS/nanobot.git synced 2026-04-11 21:53:37 +00:00

Commit Graph

Author	SHA1	Message	Date
chengyongru	6af81bc4a3	feat(agent): auto compact — proactive session compression to reduce token cost and latency (#2982 ) When a user is idle for longer than a configured TTL, nanobot proactively compresses the session context into a summary. This reduces token cost and first-token latency when the user returns — instead of re-processing a long stale context with an expired KV cache, the model receives a compact summary and fresh input.	2026-04-10 17:43:42 +08:00

Author

SHA1

Message

Date

chengyongru

6af81bc4a3

feat(agent): auto compact — proactive session compression to reduce token cost and latency (#2982 )

When a user is idle for longer than a configured TTL, nanobot **proactively** compresses the session context into a summary. This reduces token cost and first-token latency when the user returns — instead of re-processing a long stale context with an expired KV cache, the model receives a compact summary and fresh input.

2026-04-10 17:43:42 +08:00

1 Commits