nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-20 16:42:25 +00:00

Author	SHA1	Message	Date
chengyongru	2e56fb95b6	fix(weixin): treat ret=-2 as rate limit with 60s backoff Reference wxclawbot-cli docs: ret=-2 is a rate limit (~7 msgs / 5 min per bot), NOT a stale session signal. Empty/missing errmsg is the normal rate-limit response; only 'unknown error' correlates with stale session per hermes-agent. Changes: - _is_stale_session_ret: only match 'unknown error', not empty errmsg - _send_text/_send_media_file: on ret=-2 wait 60s then retry once instead of retrying without context_token - Remove stale-session retry for empty errmsg (was burning quota) - Update tests to cover rate-limit backoff path	2026-05-08 10:09:12 +08:00
chengyongru	9665a0bb1a	fix(weixin): distinguish stale session from rate limit on ret=-2 Reference hermes-agent#17228 / #18100 / PR#18105. iLink returns ret=-2 / errcode=-2 for two different reasons: - stale context_token: errmsg is empty/None or "unknown error" - genuine rate limit: errmsg is populated (e.g. "frequency limit") Previously we swallowed all ret=-2 responses, which caused silent message drops when the context_token was stale. Changes: - Add _is_stale_session_ret() to detect empty/"unknown error" errmsg - _send_text/_send_media_file retry once without context_token on stale session signal, then raise on persistent failure so ChannelManager can retry with backoff - Remove error-swallowing behavior - Update tests to expect raises and add TestIsStaleSessionRet coverage	2026-05-08 09:40:53 +08:00
chengyongru	639b4bae32	fix(weixin): treat ret=-2 as non-fatal on sendmessage and align client_id format The iLink sendmessage API frequently returns ret=-2 (parameter error / rate limit / expired token) even when HTTP status is 200. The openclaw reference plugin ignores the JSON body for sendmessage entirely and only checks HTTP status. Our previous strict ret checking turned ret=-2 into RuntimeError, causing ChannelManager retries which only made things worse. Changes: - _send_text: swallow ret=-2 after one retry without context_token. Log request body + response at warning level for diagnostics. - _send_media_file: same ret=-2 swallowing. - _generate_client_id: change format to ``nanobot:{timestamp}-{hex}`` to match openclaw-weixin ``{prefix}:{Date.now()}-{hex}``. - Update tests to expect swallowing instead of raising for ret=-2.	2026-05-07 18:07:32 +08:00
chengyongru	9b4df02651	fix(weixin): retry send without expired context_token on ret=-2 When the iLink API returns ret=-2 (parameter error), it is often caused by an expired context_token rather than a malformed payload. After a gateway restart, the cached token can become stale within ~90 seconds if no new inbound message refreshes it, causing all outbound replies to fail silently. Changes: - _send_text: retry once without context_token when ret=-2 and a token was present; if the retry succeeds, clear the expired token from cache. - Remove leftover @staticmethod on _check_response_error so self.logger and the body parameter work correctly. - Bump WEIXIN_CHANNEL_VERSION from 2.1.1 -> 2.1.7 to match the reference openclaw-weixin plugin. - Add tests covering the ret=-2 retry path, failure path, and no-token path. References: - openclaw/openclaw#61174 (context_token expiry after long agent turns) - hermes-agent#21011 (ret=-2 rate limiting / parameter error)	2026-05-07 17:38:11 +08:00
chengyongru	2b57766743	fix(weixin): check both ret and errcode on send to avoid silent drops The iLink API signals failures through either `ret` or `errcode`. `_poll_once` already checked both, but `_send_text` and `_send_media_file` only checked `errcode`. When the API returned `ret != 0` with `errcode == 0`, the send appeared successful but the message was never delivered, causing the "still losing messages" issue. - Add `_check_response_error` helper that validates both fields - Use it in `_send_text` and `_send_media_file` - Add debug log after successful text send for observability - Add test for nonzero ret with zero errcode Refs: previous inbound fix (suppress -> explicit try/except)	2026-05-07 16:20:08 +08:00
chengyongru	563fcaf002	fix(weixin): log exceptions instead of silently dropping messages in poll loop Replace `with suppress(Exception)` in `_poll_once` message processing and the `start()` poll loop with explicit `try/except` blocks that log errors via `logger.exception`. Previously, any exception during message processing (e.g. in `_handle_message`) was swallowed silently, causing inbound messages to disappear without a trace. Also add tests verifying that: - `_poll_once` logs and continues when `_process_message` fails - the poll loop logs and continues when `_poll_once` fails	2026-05-07 15:15:53 +08:00
Xubin Ren	ac18a8baad	feat(webui): add localized slash commands Add a session-scoped slash command palette sourced from backend command metadata, and keep welcome-page quick actions localized across all WebUI languages. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-07 00:20:28 +08:00
chengyongru	49c07aa45a	style: address code review feedback - Consistent "WeChat" prefix in context_token error message - Use object() instead of httpx.AsyncClient() in new tests to avoid resource leak warnings	2026-05-06 23:52:50 +08:00
chengyongru	98c2f7cc27	fix(weixin): raise exceptions instead of silently dropping messages _send_text() swallowed API errors (non-zero errcode) with just a warning log, and send() had three silent return paths (no client, session paused, no context_token). Neither triggered ChannelManager's retry logic, causing persistent message loss until a new inbound message refreshed the context_token. Now all failure paths raise RuntimeError, matching BaseChannel's contract and enabling proper retry behavior.	2026-05-06 23:52:50 +08:00
chengyongru	4efd904ccc	fix(webui): require token_issue_secret for LAN access with frontend auth When host is set to 0.0.0.0, the gateway now enforces that either token or token_issue_secret must be configured — it refuses to start otherwise. Bootstrap endpoint behavior: - token_issue_secret configured: always validate regardless of source IP (handles reverse-proxy scenarios where all connections appear as localhost) - No secret: only localhost can bootstrap (local dev mode) The frontend shows an authentication form when bootstrap returns 401/403, persists the secret in localStorage, and retries automatically on reload.	2026-05-06 23:51:51 +08:00
chengyongru	034bea1a44	fix(webui): require token_issue_secret for non-localhost bootstrap The previous LAN-access fix (PR #3656) relaxed the bootstrap localhost check when host was 0.0.0.0, but did not require any authentication — any device on the network could obtain a token without credentials. New behavior: - token_issue_secret configured: always validate, regardless of source IP (handles reverse-proxy scenarios where all connections appear as localhost). - No secret configured: only localhost can bootstrap (local dev mode). This supersedes the host-based check from PR #3656.	2026-05-06 23:51:51 +08:00
chengyongru	bad584cb0e	fix(webui): allow LAN access when host is 0.0.0.0 The webui bootstrap endpoint (/webui/bootstrap) rejected all non-localhost connections with HTTP 403, preventing the embedded webui from working when accessed from another device on the LAN — even when host was set to 0.0.0.0. Skip the localhost check when the server is explicitly bound to 0.0.0.0 or ::, since that signals intent to accept external connections.	2026-05-06 23:00:23 +08:00
Xubin Ren	790a03ec28	feat(webui): polish chat layout and titles Align the WebUI sidebar and chat chrome with the updated design, and generate WebUI session titles asynchronously without blocking turns. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-06 22:20:35 +08:00
chengyongru	40b4e01b13	merge: resolve conflict with main in transcription.py Keep _post_transcription_with_retry from PR branch, drop inline httpx calls that were replaced by the shared retry helper.	2026-05-06 21:26:28 +08:00
chengyongru	05e0106592	refactor(logging): preserve tracebacks and add channel context - Preserve tracebacks: logger.error in except blocks → logger.exception - Channel context: BaseChannel injects self.logger = logger.bind(channel=name) - Third-party bridge: redirect_lib_logging() replaces ad-hoc stdlib-to-loguru bridges - Log levels: network timeouts downgraded from ERROR → WARNING - Fix --verbose flag to actually work with loguru (set handler to DEBUG)	2026-05-06 21:17:45 +08:00
mohamed-elkholy95	7ebf611be8	fix(transcription): retry Whisper calls and guard malformed responses A single transient failure between the agent and an OpenAI/Groq Whisper endpoint currently vanishes as `return ""` in transcribe(). The voice message arrives as the empty string and there is no way to tell real silence apart from a failed upload. A malformed but successful response body is even worse: the JSON-decode error escapes the helper unhandled. Add a shared `_post_transcription_with_retry` used by both providers. Retry behaviour: - exponential backoff 1s -> 2s -> 4s, up to 3 retries (4 attempts) - retryable HTTP statuses: 408, 429, 500, 502, 503, 504 - retryable exceptions: TimeoutException, ConnectError, ReadError, WriteError, RemoteProtocolError Non-transient failures short-circuit to "" on the first attempt -- retrying a misconfigured key or a broken upload only burns rate-limit quota. Branches that short-circuit: - missing API key, missing audio file - file-read errors (PermissionError, OSError) on the audio path, preserving the nightly contract for direct provider callers - HTTP auth/4xx body issues via raise_for_status() - response.json() parse failures - non-dict JSON payloads Sharing one helper means OpenAI and Groq cannot drift apart silently. Thread `language` through the helper. The multipart files dict is rebuilt inside the per-attempt loop, so when a caller sets self.language the `language` field is sent on every attempt -- not just the first. Tests cover: - every advertised retryable status and exception, parameterized - language present on attempts 1 and 2 of a 503->200 sequence - language absent when unset; present when set (both providers) - malformed JSON body and non-dict JSON body short-circuit to "" - PermissionError on file read short-circuits with no HTTP attempt - max-attempts give-up, exponential-backoff schedule, auth no-retry, missing-key / missing-file short-circuit Test stub fix: the _StubResponse in tests/channels/test_channel_plugins.py declared no status_code, which the new helper reads for retry classification. Set status_code = 200 so the stub advertises the successful response that those tests already simulate. Also moved the two transcription-provider imports to the top of that file (previously placed mid-file) so the file is ruff-clean (E402).	2026-05-06 15:52:25 +08:00
Xubin Ren	4db50f2e32	fix(channels): reject unauthorized inbound before side effects Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-05 23:16:36 +08:00
Xubin Ren	1813fc5021	test(telegram): cover silent allowlist rejection Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-05 23:16:36 +08:00
futurist	358997554c	fix-feishu-media-path	2026-05-05 22:28:44 +08:00
ramonpaolo	be83525f99	test(webui): cover turn-end streaming regressions	2026-05-03 22:28:40 +08:00
Xubin Ren	2fa15ccf1b	fix: improve media failure diagnostics and token fallback coverage	2026-05-02 11:37:07 +00:00
coldxiangyu	4860a9a6c9	fix(matrix): stop sync loop on irrecoverable auth errors When the Matrix homeserver returns M_UNKNOWN_TOKEN / M_FORBIDDEN / M_UNAUTHORIZED (or soft_logout), the previous _sync_loop kept retrying sync_forever every 2 seconds forever, spamming the homeserver and filling logs (#1851). The auth state cannot recover by retrying, so this is pure noise and a soft DoS on the homeserver. - Extract `_is_fatal_auth_response()` helper - In `_on_sync_error`, on fatal auth: set `_running=False` and call `stop_sync_forever()` so the loop exits cleanly - Add exponential backoff (2s → 60s cap) to the generic exception path in `_sync_loop` so transient network blips also stop hammering Closes #1851 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 23:59:09 +08:00
Xubin Ren	e157392250	fix(agent): scope subagent reply dedupe to origin message Made-with: Cursor	2026-05-01 11:47:24 +00:00
hinotoi-agent	ad952e0da2	fix(dingtalk): block SSRF in outbound media fetches	2026-05-01 19:31:45 +08:00
copilot-swe-agent[bot]	0284174df9	fix: prevent empty Matrix messages when progress callback sends empty content Agent-Logs-Url: https://github.com/halldorjanetzko/nanobot/sessions/df528c59-8214-41a0-9b79-9d1d41857107 Co-authored-by: halldorjanetzko <158819146+halldorjanetzko@users.noreply.github.com>	2026-05-01 19:31:04 +08:00
coldxiangyu	15007afd4a	fix(matrix): skip events received before bot startup Matrix sync replays the room timeline on each startup or `/restart`, causing already-handled messages to be reprocessed (#3553). Even with `store_sync_tokens=True`, the sync token isn't reliably re-injected when restoring a session via access_token + load_store(), so the client re-reads recent timeline entries. Filter `event.server_timestamp` against the process start time so old events are dropped at the `_on_message` / `_on_media_message` entry points. Trade-off: messages received during downtime won't be processed, which matches the issue reporter's expectation. Closes #3553 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 19:30:33 +08:00
Xubin Ren	f8fd9f0011	fix(feishu): keep streaming replies in existing topics Made-with: Cursor	2026-04-30 13:42:37 +08:00
hanyuanling	d82f25e4d4	fix(feishu): respect reply_to_message for group threads	2026-04-30 13:42:37 +08:00
Xubin Ren	26e953f0b9	Revert "fix(feishu): streaming card and tool hint respect reply_to_message in…" This reverts commit 651b6b933f2db26713b5668d0c103d1b022e858c.	2026-04-30 13:27:37 +08:00
04cb	651b6b933f	fix(feishu): streaming card and tool hint respect reply_to_message in groups	2026-04-30 12:51:08 +08:00
chengyongru	74270bb8a8	refactor(channels): resolve progress overrides at init-time like transcription	2026-04-29 16:43:09 +08:00
hanyuanling	a0443e8f9e	fix(channels): address progress override review	2026-04-29 16:43:09 +08:00
hanyuanling	0b111a0e0c	fix(channels): support per-channel progress controls	2026-04-29 16:43:09 +08:00
甘全	0053e68423	fix(feishu): skip reaction transition on resuming stream end Stream-end events are emitted at the end of every assistant turn. When the agent has more tool-call rounds queued, the runner sets `_resuming=True` on the metadata. Without a guard, every intermediate stream end removed the OnIt reaction (the first one wins, since `_reaction_ids.pop` empties the slot) and re-added `done_emoji`, producing a DONE reaction after every tool call instead of only at final completion. Wrap the OnIt removal and `done_emoji` add in a `not _resuming` guard so the OnIt indicator persists across tool-call rounds and DONE fires exactly once when the agent's final response lands. `_resuming` already flows through outbound metadata (`nanobot/agent/loop.py:747`) and survives `_coalesce_stream_deltas` because pure `_stream_end` messages without `_stream_delta` skip the merge branch. Tests: - test_no_removal_when_resuming - test_done_emoji_only_on_final_stream_end	2026-04-28 17:29:12 +08:00
Xubin Ren	50698c3d1c	test(telegram): cover local attachment filenames Add a regression test for preserving the original basename when Telegram sends local media bytes. Made-with: Cursor	2026-04-28 15:13:49 +08:00
Xubin Ren	2b886ffd1f	fix(command): expose history in chat command menus Made-with: Cursor	2026-04-27 18:23:35 +08:00
Xubin Ren	e31273ebaa	Merge origin/main into fix/discord-allow-channel-threads Made-with: Cursor	2026-04-27 09:26:24 +00:00
Xubin Ren	82c5083b15	fix(slack): preserve DM thread routing and strip trailing newlines	2026-04-27 09:01:04 +00:00
Xubin Ren	620d9e4f31	fix(slack): accept inbound file_share messages without dropping them Slack inbound events with subtype=file_share were silently dropped, so nanobot never saw messages that included attachments. Allow file_share through, download Slack-private files using the bot token into the local media dir, and pass them to the agent as media paths plus a "[file: name]" / "[image: name]" placeholder in the content. Reject responses that look like Slack's login HTML so an auth page is never saved as if it were the user's file. Document the required files:read scope alongside files:write so installs that read attachments are not quietly missing the permission.	2026-04-27 07:11:11 +00:00
Xubin Ren	8a0917db7a	fix(slack): polish thread UX and media support	2026-04-27 12:45:00 +08:00
Xubin Ren	038a140ad3	fix(slack): preserve thread context for proactive replies Capture Slack thread metadata for cron and message-tool deliveries so replies stay in the originating thread, and hydrate first thread mentions with recent Slack context. Made-with: Cursor	2026-04-27 02:10:38 +08:00
Xubin Ren	65b0ae81af	Merge origin/main into webui-settings Made-with: Cursor	2026-04-26 13:05:32 +00:00
Xubin Ren	d0e1b1393a	fix(feishu): scope streaming buffers by message Keep concurrent Feishu group replies from sharing one streaming card buffer when sessions are split by topic or top-level message. Made-with: Cursor	2026-04-26 16:09:31 +08:00
chengyongru	39eea1b762	feat(feishu): per-message session for group top-level messages Align with deer-flow: group top-level messages (no root_id) now get their own session keyed by message_id instead of sharing a single group-wide session. Topic replies continue to share session via root_id.	2026-04-26 16:09:31 +08:00
chengyongru	0e92936cf3	chore(test): remove stale reaction_id from test metadata The production code no longer reads reaction_id from metadata, so remove the leftover key from the test_no_removal_when_message_id_missing test case.	2026-04-26 16:09:31 +08:00
chengyongru	3eb8838dd9	fix(test): update reaction cleanup test for _reaction_ids dict The stream-end reaction cleanup now reads from _reaction_ids instead of metadata, so pre-populate the dict in the test instead of passing reaction_id via metadata.	2026-04-26 16:09:31 +08:00
chengyongru	2a9fc9392b	fix(feishu): use message_id as reply target and fix keyword-only arg Align reply targeting with deer-flow: always reply to the inbound message_id (not root_id). The Feishu Reply API keeps responses in the same topic automatically when the target message is inside a topic. Also fix run_in_executor calls that passed reply_in_thread as a positional arg to a keyword-only parameter, and route standalone tool hints through the reply API for group chats.	2026-04-26 16:09:31 +08:00
chengyongru	d36fba8bf5	feat(feishu): add reply_in_thread for visual topic grouping When reply_to_message config is enabled, the bot's first reply now uses reply_in_thread=True to create a visual topic/thread in the Feishu client. Subsequent chunks fall back to regular create. The reply_to_message default remains False for backward compatibility. Failed replies still fall back to regular send — messages are never silently dropped.	2026-04-26 16:09:31 +08:00
chengyongru	13bb31c789	feat(feishu): add thread-scoped session isolation for group chats Thread replies (messages with root_id != message_id) in group chats now get their own session key: feishu:{chat_id}:{root_id}. This means each Feishu thread has an independent conversation context. Top-level group messages and all private chat messages keep the default session key (no override), consistent with Telegram and Slack channel behavior. Co-authored-by: shenchengtsi <228445050+shenchengtsi@users.noreply.github.com>	2026-04-26 16:09:31 +08:00
Xubin Ren	b440e76d2f	feat(webui): add model settings runtime refresh	2026-04-25 18:05:06 +00:00

1 2 3 4

155 Commits