nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-04-27 13:25:52 +00:00

Author	SHA1	Message	Date
hlg	899a9073ce	fix(memory): do not fall back to raw entry when strip_think empties it `append_history` previously used `strip_think(entry) or entry.rstrip()` as a safety net, so if the entire entry was a template-token leak (e.g. `<think>reasoning</think>` or `<channel\|>` alone), the raw leaked text was still persisted to history — later re-introducing the very content `strip_think` was meant to scrub, via consolidation / replay. Persist the cleaned content directly. When cleanup empties a non-empty entry, log at debug and store an empty-content record (cursor continuity preserved). Adds 3 regression tests in test_memory_store.py covering: - Well-formed thinking blocks are stripped before persistence. - Pure-leak entries persist as empty, not as raw text. - Malformed prefix leaks (`<channel\|>`) also persist as empty.	2026-04-20 17:04:48 +08:00
hlg	8e7d8bef6a	fix(utils): handle malformed think tags and channel markers in strip_think Some models / Ollama renderers occasionally emit tokenizer-level template leaks that the existing regexes miss: 1. Malformed opening tags with no closing `>`, running straight into user-facing content — e.g. `<think广场照明灯目前…` (observed with Gemma 4 via Ollama). The earlier `<think>[\s\S]?</think>` and `^\s<think>[\s\S]$` patterns both require `>`, so these leak into rendered messages. 2. Harmony-style channel markers like `<channel\|>` / `<\|channel\|>` at the start of a response. 3. Orphan `</think>` / `</thought>` closing tags left behind when only the opener was consumed upstream. Handles each case conservatively: - Malformed `<think` / `<thought` only match when the next char is NOT a tag-name continuation (`[A-Za-z0-9_\-:>/]`). Explicit ASCII class instead of `\w` because Python's Unicode `\w` matches CJK and would defeat the primary fix. - Orphan closing tags and channel markers are stripped only at the start or end of the text*. `strip_think` is also applied before persisting history (memory.py), so mid-text stripping would silently rewrite transcripts where the tokens themselves are discussed. Preserves: `<thinker>`, `<think-foo>`, `<think_foo>`, `<think1>`, `<think:foo>`, `<thought/>`, literal `` `</think>` `` / `` `<channel\|>` `` inside prose or code blocks. Adds 16 new regression tests covering both the leak cases and the preserved-prose cases.	2026-04-20 17:04:48 +08:00
chengyongru	f900c5bb8e	fix(telegram): address code review issues from cherry-pick merge - Fix critical plain-text fallback that was sending raw HTML tags to users: keep raw markdown available for the fallback path - Extract TELEGRAM_HTML_MAX_LEN (4096) constant to replace hardcoded magic number and document the difference from TELEGRAM_MAX_MESSAGE_LEN - Add fallback to _send_text for extra HTML chunks when HTML parse fails - Add missing @pytest.mark.asyncio decorator on test_send_delta_stream_end_html_expansion_does_not_overflow	2026-04-20 16:58:46 +08:00
stutiredboy	2eea82f5ee	fix(telegram): split oversized stream buffer mid-flight Cherry-picked from #3311 (stutiredboy). Streaming edits called edit_message_text(text=buf.text) without chunking, so once accumulated deltas crossed Telegram's 4096-char limit an ongoing stream would fail with BadRequest. Extracts _flush_stream_overflow helper that edits the first chunk in place, sends any middle chunks, and re-anchors the buffer to a new message for the tail so subsequent deltas keep streaming. Co-Authored-By: stutiredboy <stutiredboy@users.noreply.github.com>	2026-04-20 16:58:46 +08:00
himax12	fd8f08cc83	fix(telegram): convert markdown to HTML before splitting to avoid message length overflow Cherry-picked from #3316 (himax12). When streaming completes in send_delta(), the code was splitting raw markdown text by 4000, then converting to HTML. The markdown-to-HTML conversion adds 10-33% characters, which could push the result over Telegram's 4096 character limit. The fix converts markdown to HTML first, then splits by 4096 (actual Telegram limit), ensuring the edited message always fits. Fixes #3315	2026-04-20 16:58:46 +08:00
jhkim43	297b852f6e	feat(telegram): change to mid-stream split per review feedback(#2967 PR)	2026-04-20 16:58:46 +08:00
chengyongru	ecfbb0ed4f	refactor(email): use _remember_processed_uid in SPF/DKIM reject paths Replaces inline dedup logic with the existing helper to match the style of _is_self_address and other reject branches, and to keep the _processed_uids eviction logic in one place.	2026-04-20 16:46:49 +08:00
flobo3	ffac8d3b0a	fix: deduplicate SPF/DKIM-rejected emails to stop log spam	2026-04-20 16:46:49 +08:00
Xubin Ren	26fd2c099a	build: ship THIRD_PARTY_NOTICES and fix webui packaging in wheel	2026-04-20 08:22:10 +00:00
chengyongru	68466b1c2a	fix(agent): propagate effective session key through subagent pipeline The previous fix hardcoded session_key_override as channel:chat_id which broke unified session mode where pending queues use "unified:default". Propagate the effective key from _set_tool_context through SpawnTool into the origin dict so _announce_result routes to the correct pending queue in both normal and unified session modes.	2026-04-20 14:47:14 +08:00
chengyongru	2193a64c80	fix(agent): align subagent result session key with main agent for mid-turn injection When mid-turn message injection (PR #2985) was introduced, the pending queue routing uses the effective session key to match incoming messages against active sessions. Subagent results, however, use channel="system" which produces a session key of "system:feishu:ou_..." instead of the main agent's "feishu:ou_...", causing the result to bypass the pending queue and be dispatched as a competing independent task. Fix: set session_key_override to the original channel:chat_id so _effective_session_key returns the correct key and the subagent result gets routed into the main agent's pending queue.	2026-04-20 14:47:14 +08:00
chengyongru	79821a571f	fix: suppress intermediate progress output in cron jobs Cron jobs now pass on_progress=_silent to process_direct, matching the heartbeat pattern. Previously, tool hints and streaming deltas were published to the user channel via bus during execution, but the final response could be rejected by evaluate_response — leaving users with confusing partial output and no conclusion. Closes #3319	2026-04-20 11:43:54 +08:00
chengyongru	8eddacf2f8	fix(webui): sync code block theme with dark mode toggle instantly - Replace one-time DOM read with MutationObserver on <html> class - Remove hardcoded #0a0a0a background, let oneDark/oneLight own it - Add light-mode header/copy-button colors (bg-zinc-100 for light) - Bump font size from 13px to 14px, line-height from 1.55 to 1.6 - Add subtle border to distinguish code block edges	2026-04-20 00:21:07 +08:00
chengyongru	a3adec08a9	style(webui): improve typography with Apple-inspired font stack and CJK support - Add explicit CJK fonts (PingFang SC, Noto Sans SC, Microsoft YaHei) and programmer fonts (JetBrains Mono, Fira Code, Cascadia Code) to Tailwind config - Bump prose base size from prose-sm (14px) to prose-lg (18px) for sharper CJK rendering - Unify user/assistant message font size at 18px with CJK-aware line-height (1.8) - Replace pure black/white foreground with Apple-style warm grays (#1d1d1f / #f5f5f7) - Override Tailwind Typography colors to use design tokens for consistency - Add negative letter-spacing on headings for tighter, more polished look	2026-04-20 00:21:07 +08:00
Xubin Ren	56a779c128	fix(session): repair read-only corrupt session paths	2026-04-20 00:17:50 +08:00
aiguozhi123456	efb04a1712	fix(session): use atomic writes and add corrupt-file repair SessionManager.save() previously used bare open("w") which could truncate the JSONL file if the process crashed mid-write. Now writes to a .tmp file and atomically replaces via os.replace(), matching the pattern already used in qq.py. _load() now attempts _repair() before returning None, recovering valid lines from partially-written files. 12 new tests cover atomic save correctness, temp-file cleanup on failure, and repair of truncated/corrupt JSONL. cowork-with:opencode(glm-5.1)	2026-04-20 00:17:50 +08:00
Alfredo Arenas	5d976d79ff	test(discord): update tests for bot-to-bot fix (#3217 ) The old test `test_on_message_ignores_bot_messages` asserted the previous (incorrect) contract that ALL bot-authored messages are dropped. With #3217 only self-loops are dropped, so this test was replaced with three more precise tests: - test_on_message_ignores_self_messages: verifies self-loop guard (author_id == _bot_user_id is dropped) - test_on_message_accepts_messages_from_other_bots: new test for the fix itself — other bots' messages flow through - test_on_message_stops_typing_on_handle_exception: preserves the typing cleanup assertion from the original test Net result: +1 behavior tested, same behaviors retained. Co-authored with Claude Opus 4.7	2026-04-19 23:32:40 +08:00
Alfredo Arenas	3fd24c72fd	fix(discord): allow bot-to-bot messaging, only drop self-loops (#3217 ) Previously the Discord channel dropped every message from any bot account via `if message.author.bot`, which prevented legitimate multi-agent setups (one bot asking another for help, bot-to-bot @mentions, etc.) from working. Narrow the guard to only drop messages from this bot's own account by comparing against self._bot_user_id (already populated in on_ready). Self-loop protection is preserved — each bot instance still ignores its own outbound messages. Co-authored with Claude Opus 4.7	2026-04-19 23:32:40 +08:00
coldxiangyu	7527961b19	fix(cron): drop top-level oneOf so OpenAI Codex/Responses accept tool schema PR #3125 added a top-level `oneOf` branch to `_CRON_PARAMETERS` to advertise per-action required fields. OpenAI Codex/Responses rejects `oneOf`/`anyOf`/`allOf`/`enum`/`not` at the root of function parameters, so any agent that registers the cron tool now fails to start with: HTTP 400: Invalid schema for function 'cron': schema must have type 'object' and not have 'oneOf'/'anyOf'/'allOf'/'enum'/'not' at the top level. Remove the top-level `oneOf`. The original intent of #3125 (stop LLMs from looping on the #3113 contract mismatch) is preserved by: - `validate_params` — runtime-enforces `message` for `action='add'` and `job_id` for `action='remove'` - field descriptions — each schema field already flags "REQUIRED when action='...'" so the LLM sees the contract The regression test is updated to lock the invariant in the other direction: the top-level schema must not contain `oneOf`/`anyOf`/`allOf`/`not`, and the REQUIRED hints must stay on `message` and `job_id`. Verified: - tests/cron/ 70 passed - tests/agent/test_loop_cron_timezone.py + tests/providers/ 232 passed Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2026-04-19 21:54:38 +08:00
Xubin Ren	97ae9cb318	docs: refine README for WebUI development workflow clarity	2026-04-19 13:42:02 +00:00
Xubin Ren	d920f07715	Merge PR #3310 : feat(webui): add initial browser UI with websocket chat and i18n feat(webui): add initial browser UI with websocket chat and i18n	2026-04-19 21:41:07 +08:00
Xubin Ren	b3049f7323	fix(webui): stabilize empty session history state	2026-04-19 13:38:47 +00:00
Xubin Ren	f9e1d92abd	docs: update README and webui documentation for WebUI development workflow	2026-04-19 13:10:36 +00:00
Xubin Ren	c4b3837c5f	Merge remote-tracking branch 'origin/main' into nanobot-webui	2026-04-19 12:36:52 +00:00
Xubin Ren	46e11a68a7	test: speed up cron and restart timing tests Replace fixed sleep-based waits with condition polling in cron tests and mock the restart delay in CLI restart tests to reduce suite runtime without changing behavior.	2026-04-19 12:35:57 +00:00
Xubin Ren	b6d63fb1ec	fix: normalize responses circuit breaker keys Made-with: Cursor	2026-04-19 20:16:25 +08:00
Mohamed Elkholy	3036b16140	style: fix import sorting (ruff I001)	2026-04-19 20:16:25 +08:00
Mohamed Elkholy	4aad6b737d	style: move loguru import to module top level Addresses reviewer suggestion to keep imports conventional.	2026-04-19 20:16:25 +08:00
Mohamed Elkholy	baba3b2160	fix(providers): add circuit breaker for Responses API fallback When the Responses API fails repeatedly (3 consecutive compatibility errors), skip it and fall back directly to Chat Completions. Unlike a permanent disable, the circuit re-probes after 5 minutes so recovery is automatic when the API comes back. Success resets the counter. Keyed per (model, reasoning_effort) so a failure with one model does not affect others.	2026-04-19 20:16:25 +08:00
Xubin Ren	ccd6c05f71	fix: include pending summaries in consolidation estimates Made-with: Cursor	2026-04-19 20:06:11 +08:00
Xubin Ren	54b659929e	test: cover summary persistence after token consolidation Made-with: Cursor	2026-04-19 20:06:11 +08:00
Jiajun Xie	d95bc9c9c4	fix: unify summary injection strategy between consolidation paths - Track last_summary in maybe_consolidate_by_tokens() to persist the summary - Change return to break in the consolidation loop to allow summary persistence - Save summary to session.metadata['_last_summary'] for consistency with AutoCompact._archive() - Ensures compressed content remains visible to the model via prepare_session() injection Fixes #3274	2026-04-19 20:06:11 +08:00
Xubin Ren	107eae14d7	docs: add badges for commit activity and closed issues in README	2026-04-19 19:25:05 +08:00
Xubin Ren	508e247c82	docs: remove feature showcase and update memory and Python SDK documentation for clarity and completeness	2026-04-19 19:25:05 +08:00
Xubin Ren	ed150a4228	docs: enhance README installation instructions for better readability	2026-04-19 19:25:05 +08:00
Xubin Ren	622c467839	docs: refine README description for clarity	2026-04-19 19:25:05 +08:00
Xubin Ren	53fb3c199a	docs: update README and docs for clarity and consistency	2026-04-19 19:25:05 +08:00
Xubin Ren	8ff7b56cb2	docs: refactor README into a docs-first landing page	2026-04-19 19:25:05 +08:00
Xubin Ren	4650b23d75	feat(webui): add i18n support and locale switcher	2026-04-19 06:39:06 +00:00
Xubin Ren	be10ba1f0d	Merge remote-tracking branch 'origin/main' into nanobot-webui	2026-04-19 05:15:27 +00:00
Alfredo Arenas	2d0442976e	test(cli): update _make_console tests for isatty-based fix (#3265 ) The old test `test_make_console_uses_force_terminal` hardcoded `force_terminal is True`, which contradicts the fix: we now defer to sys.stdout.isatty() so piped / non-TTY output gets plain text instead of ANSI escape codes. Split into two tests covering both branches: - test_make_console_force_terminal_when_stdout_is_tty: TTY path (force_terminal=True, rich output) - test_make_console_force_terminal_false_when_stdout_is_not_tty: non-TTY path (force_terminal=False, plain text) — regression guard for the bug reported in #3265 Co-authored with Claude Opus 4.7	2026-04-19 04:19:59 +08:00
Alfredo Arenas	261b843839	fix(cli): respect sys.stdout.isatty() in stream renderer (#3265 )	2026-04-19 04:19:59 +08:00
Xubin Ren	9773d4b8ab	Merge PR #3112 : fix(config): return provider default api base in config resolution fix(config): return provider default api base in config resolution	2026-04-19 04:14:46 +08:00
Xubin Ren	384bad17b4	Merge origin/main into fix/config-default-api-base Made-with: Cursor	2026-04-18 20:08:21 +00:00
Xubin Ren	3218307f80	Merge PR #3125 : fix: harden cron tool contract fix: harden cron tool contract	2026-04-19 04:01:27 +08:00
Xubin Ren	9c0dc8b276	fix: drop generic repeated tool-call guard The global guard changed baseline agent and subagent behavior without proving a real no-progress loop. Keep this PR focused on the cron contract hardening and validation fixes. Made-with: Cursor	2026-04-18 19:59:58 +00:00
Xubin Ren	adc1e843b4	Merge origin/main into fix/cron-contract-repeat-guard Made-with: Cursor	2026-04-18 19:42:48 +00:00
Xubin Ren	e08507f3ce	fix: handle git worktrees in GitStore nested repo protection Treat `.git` files the same as `.git` directories so GitStore refuses to initialize inside git worktrees, and add a focused regression test for that checkout shape. Made-with: Cursor	2026-04-19 03:38:22 +08:00
Lê Bảo Long	ff5b97dc34	Remove .oss from .gitignore	2026-04-19 03:38:22 +08:00
longle325	fb28678b64	fix: prevent GitStore from creating nested repos and overwriting .gitignore (#2980 ) GitStore.init() now checks if the workspace is already inside a git repository before calling porcelain.init(). If so, it refuses to create a nested repo. Additionally, existing .gitignore files are preserved by appending only missing Dream-specific entries rather than overwriting. Closes #2980	2026-04-19 03:38:22 +08:00

1 2 3 4 5 ...

2159 Commits