nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-23 10:02:34 +00:00

Author	SHA1	Message	Date
Xubin Ren	e52fe2a8e2	feat(webui): render video media attachments Add signed media URLs to live WebSocket replies and teach the WebUI to classify and render video attachments, so bot-sent videos can play inline in both live chats and session history. Made-with: Cursor	2026-04-25 03:20:40 +08:00
Xubin Ren	be05189f39	feat(channels): add video support for Telegram and WebSocket Telegram previously sent all video files as documents via send_document, so users saw a file icon instead of an inline player. WebSocket only accepted image MIME types, rejecting video uploads entirely. Telegram: - Recognize video extensions (mp4/mov/avi/mkv/webm/3gp) in _get_media_type - Route videos through send_video with supports_streaming=True - Add VIDEO/VIDEO_NOTE/ANIMATION to inbound message filters - Add video MIME mappings to _get_extension - Fix: local file sends now use _call_with_retry (previously no retry) WebSocket: - Expand upload MIME whitelist with video/mp4, video/webm, video/quicktime - Add per-type size limits (_MAX_VIDEO_BYTES=20MB, _MAX_VIDEOS_PER_MESSAGE=1) - Expand media serving endpoint to serve video with correct Content-Type Agent: - Add "video" to message tool media parameter description - Add .mp4 example to identity.md system prompt Made-with: Cursor	2026-04-25 02:20:13 +08:00
Xubin Ren	06503cd0fc	fix(telegram): keep callback_data under Telegram's 64-byte cap ``InlineKeyboardButton(label, callback_data=label)`` fails Telegram's API when the label exceeds 64 bytes UTF-8. An LLM-generated long option (realistic in multilingual flows) used to 400 the ``send_message`` call silently — user got nothing, agent heard a successful retry-then-drop. Decouple display from wire: button text keeps the full label, callback_data gets truncated at a UTF-8 char boundary. Tap echoes the prefix back as the user message; the LLM understands a prefix of its own option just fine, and the display the user saw was always the full string. Locks: helper boundary behavior (ASCII, CJK, short labels pass through) and end-to-end ``_build_keyboard`` integration with an over-cap label. Made-with: Cursor	2026-04-23 13:26:06 +08:00
Xubin Ren	6bc2983ab1	fix(telegram): fall back buttons to inline text when keyboard disabled Buttons are semantic options, not a separate channel protocol: a user who taps "Yes" and a user who types "yes" arrive at the agent as the same string. Dropping ``msg.buttons`` when ``inline_keyboards=False`` was the worst of both worlds — the agent got told "Message sent with N button(s)" while the user saw a question with no options. Splice the labels into the message text instead. The LLM produces the same ``message(buttons=...)`` call regardless of channel; the channel layer picks the richest rendering it can afford — native keyboard when enabled, bracketed inline text otherwise. Layout is preserved (one row per line). Other channels can adopt the same helper incrementally. Locks: canonical ``_buttons_as_text`` format, flag-off send-path splices labels, flag-on send-path keeps content clean and rides ``reply_markup``. Made-with: Cursor	2026-04-23 13:26:06 +08:00
Gunnar Thielebein	8d33c1cb37	feat(telegram): add inline keyboard buttons	2026-04-23 13:26:06 +08:00
Xubin Ren	707c0d7f3a	fix(websocket): scrub partial media batches, nosniff /api/media	2026-04-23 00:07:27 +08:00
Xubin Ren	61a28c2c0a	feat(webui): support image uploads in composer and message bubbles	2026-04-23 00:07:27 +08:00
k	123d69bfb7	fix: allow specifying transcription language	2026-04-22 12:41:32 +08:00
flobo3	1826ab44fa	feat(transcription): add language parameter for Groq Whisper STT	2026-04-22 12:41:32 +08:00
chengyongru	d4e34f8c67	fix(commands): intercept non-priority commands during active turn Non-priority slash commands (e.g. /new, /help, /dream-log) arriving while a session has an active LLM turn were silently queued into the pending injection buffer and later injected as raw user messages into the LLM conversation. This caused the model to respond to "/new" as plain text instead of executing the command. Root cause: the run() loop only checked priority commands (/stop, /restart, /status) before routing messages to the pending queue. All other command tiers (exact, prefix) bypassed command dispatch entirely. Changes: - Add CommandRouter.is_dispatchable_command() to match exact/prefix tiers, mirroring the existing is_priority() pattern. - In run(), intercept dispatchable commands before pending queue insertion and dispatch them directly via _dispatch_command_inline(). - Extract _cancel_active_tasks() from cmd_stop for reuse; cmd_new now cancels active tasks before clearing the session to prevent shared mutable state corruption from concurrent asyncio coroutines. - Update /new semantics: stops active task first, then clears session. - Update documentation in help text, docs, and Discord command list.	2026-04-21 21:50:37 +08:00
hussein1362	f8a023218d	fix(telegram): improve markdown rendering for modern LLM output Problem: Modern LLMs (GPT-5.4, Claude, Gemini) produce markdown-heavy responses with numbered lists, headers, and nested formatting. The Telegram channel's _markdown_to_telegram_html() converter has gaps that leave these poorly formatted: 1. Numbered lists (1. 2. 3.) have zero handling — sent as raw text 2. Headers (# Title) are stripped to plain text, losing visual hierarchy 3. Mid-stream edits send raw markdown (users see bold and ### headers while the response generates, before the final HTML conversion) Root Cause: _markdown_to_telegram_html() handles bullets (- *) but skips numbered lists entirely. Headers are stripped of # but not given any emphasis. The streaming path in send_delta() sends buf.text as-is during mid-stream edits (plain text, no parse_mode) — only the final _stream_end edit converts to HTML. Fix: 1. Headers now render as <b>bold</b> in the final HTML (using placeholder markers that survive HTML escaping, restored after all other processing) 2. Numbered lists are normalized (extra whitespace after the dot is cleaned) 3. New _strip_md_block() function strips markdown syntax for readable plain-text preview during streaming mid-edits The final _stream_end HTML conversion is unchanged — it still produces full HTML with parse_mode=HTML. Only the intermediate edits are improved. Tests: Added 10 new tests covering: - Headers converting to bold HTML - Numbered list preservation and whitespace normalization - Headers with HTML special characters - Mixed formatting (headers + bullets + numbers + bold) - _strip_md_block for inline formatting, headers, bullets, numbers, links - Streaming mid-edit markdown stripping (initial send + edit)	2026-04-21 21:35:34 +08:00
chengyongru	f900c5bb8e	fix(telegram): address code review issues from cherry-pick merge - Fix critical plain-text fallback that was sending raw HTML tags to users: keep raw markdown available for the fallback path - Extract TELEGRAM_HTML_MAX_LEN (4096) constant to replace hardcoded magic number and document the difference from TELEGRAM_MAX_MESSAGE_LEN - Add fallback to _send_text for extra HTML chunks when HTML parse fails - Add missing @pytest.mark.asyncio decorator on test_send_delta_stream_end_html_expansion_does_not_overflow	2026-04-20 16:58:46 +08:00
stutiredboy	2eea82f5ee	fix(telegram): split oversized stream buffer mid-flight Cherry-picked from #3311 (stutiredboy). Streaming edits called edit_message_text(text=buf.text) without chunking, so once accumulated deltas crossed Telegram's 4096-char limit an ongoing stream would fail with BadRequest. Extracts _flush_stream_overflow helper that edits the first chunk in place, sends any middle chunks, and re-anchors the buffer to a new message for the tail so subsequent deltas keep streaming. Co-Authored-By: stutiredboy <stutiredboy@users.noreply.github.com>	2026-04-20 16:58:46 +08:00
himax12	fd8f08cc83	fix(telegram): convert markdown to HTML before splitting to avoid message length overflow Cherry-picked from #3316 (himax12). When streaming completes in send_delta(), the code was splitting raw markdown text by 4000, then converting to HTML. The markdown-to-HTML conversion adds 10-33% characters, which could push the result over Telegram's 4096 character limit. The fix converts markdown to HTML first, then splits by 4096 (actual Telegram limit), ensuring the edited message always fits. Fixes #3315	2026-04-20 16:58:46 +08:00
jhkim43	297b852f6e	feat(telegram): change to mid-stream split per review feedback(#2967 PR)	2026-04-20 16:58:46 +08:00
chengyongru	ecfbb0ed4f	refactor(email): use _remember_processed_uid in SPF/DKIM reject paths Replaces inline dedup logic with the existing helper to match the style of _is_self_address and other reject branches, and to keep the _processed_uids eviction logic in one place.	2026-04-20 16:46:49 +08:00
flobo3	ffac8d3b0a	fix: deduplicate SPF/DKIM-rejected emails to stop log spam	2026-04-20 16:46:49 +08:00
Alfredo Arenas	3fd24c72fd	fix(discord): allow bot-to-bot messaging, only drop self-loops (#3217 ) Previously the Discord channel dropped every message from any bot account via `if message.author.bot`, which prevented legitimate multi-agent setups (one bot asking another for help, bot-to-bot @mentions, etc.) from working. Narrow the guard to only drop messages from this bot's own account by comparing against self._bot_user_id (already populated in on_ready). Self-loop protection is preserved — each bot instance still ignores its own outbound messages. Co-authored with Claude Opus 4.7	2026-04-19 23:32:40 +08:00
Xubin Ren	9ed3031a42	feat(webui): add initial webui with websocket chat flow	2026-04-18 18:51:53 +00:00
Xubin Ren	6bfb75ed03	feat(websocket): multiplex multiple chat_ids over a single connection	2026-04-18 16:49:12 +08:00
chengjun.zhu	9c19de67bf	fix: 错误消息流转路径：1. 当 LLM 服务出现临时性错误（如网络波动、超时、429限流等）时， base.py 中的 _run_with_retry 方法会启动重试机制。2. 在重试等待期间， _sleep_with_heartbeat 方法会周期性调用 on_retry_wait 回调函数，发送类似 'Model request failed, retry in 1s (attempt 1)' 的心跳消息。3. 之前 on_retry_wait 参数被错误地绑定到 _bus_progress ，导致这些内部诊断消息被当作普通进度消息发送到飞书客户端。4. manager.py 的消息分发器没有过滤这类重试心跳消息。修复方案：1. loop.py - 新增重试等待回调- 新增独立的 _on_retry_wait 回调函数，为重试消息添加 _retry_wait: True 元数据标识- 在 AgentRunSpec 中传入 retry_wait_callback 参数。2. runner.py - 支持重试回调参数- 在 AgentRunSpec 数据类中新增 retry_wait_callback 字段- 在 _build_request_kwargs 中将 on_retry_wait 参数从 progress_callback 改为 retry_wait_callback。3. manager.py - 过滤重试心跳消息- 在 _dispatch_outbound 方法中新增过滤逻辑，丢弃所有带 _retry_wait 标识的消息，确保重试心跳不会发送到任何客户端。	2026-04-18 13:50:05 +08:00
yorkhellen	1011ea5ac8	fix(email): ignore self-sent mailbox messages Skip inbound emails that come from the bot's own configured addresses so a mailbox wired to the same SMTP/IMAP account does not trigger infinite reply loops.	2026-04-17 16:25:16 +08:00
Mohamed Elkholy	ce5272c153	fix(transcription): honor api_base for OpenAI transcription provider Complete the symmetry left by #3214: ChannelManager._resolve_transcription_base already resolves providers.openai.api_base, but BaseChannel.transcribe_audio instantiated OpenAITranscriptionProvider without forwarding it, and the provider __init__ did not accept the parameter. Self-hosted OpenAI-compatible Whisper endpoints (LiteLLM, vLLM, etc.) configured via config.json were therefore ignored for the OpenAI backend. - OpenAITranscriptionProvider.__init__ now accepts api_base with env fallback (OPENAI_TRANSCRIPTION_BASE_URL) matching the Groq pattern. - BaseChannel.transcribe_audio forwards self.transcription_api_base to OpenAI. - Tests mirror the existing Groq coverage: manager propagation for provider "openai", BaseChannel-to-provider argument passing, and provider default vs override for api_url. Fully backward-compatible: when api_base is None and the env var is unset, the default https://api.openai.com/v1/audio/transcriptions is used. Refs #3213, follow-up to #3214.	2026-04-17 13:46:51 +08:00
flobo3	0401ca9dbc	fix: pass apiBase from config to GroqTranscriptionProvider	2026-04-17 13:46:51 +08:00
Bongjin Lee	48d430bf5e	feat: add channel-based filtering for Discord Add `allow_channels` config option to DiscordConfig that restricts bot responses to specific Discord channels. When the list is empty (default), the bot responds in all channels (backward compatible). - Add `allow_channels: list[str]` field to DiscordConfig schema - Add channel ID check in _handle_message_create after user filtering Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-17 02:14:33 +08:00
Xubin Ren	a2f4090e41	fix(msteams): secure inbound defaults and ref persistence Default Microsoft Teams inbound auth validation to enabled, update the README to match, and prevent denied senders from persisting conversation refs before allowlist checks pass. Made-with: Cursor	2026-04-16 13:22:07 +08:00
chengyongru	abe0145f99	fix(msteams): harden availability check and migrate docs to README - Check both jwt and cryptography in MSTEAMS_AVAILABLE guard so partial installs fail early with a clear message instead of at runtime - Add aclose() to test FakeHttpClient so stop() won't crash - Move MSTEAMS.md into README.md following the same details/summary pattern used by every other channel - Note in README that validateInboundAuth defaults to false	2026-04-16 13:22:07 +08:00
chengyongru	49223e639e	fix(msteams): add auth warning and restore unrelated pyproject change Warn when validate_inbound_auth is disabled (default) so operators are aware the webhook accepts unverified requests. Restore pymupdf to the dev optional-dependencies group — its removal in the original PR was unrelated to the Teams channel feature.	2026-04-16 13:22:07 +08:00
T3chC0wb0y	818a095a90	style(msteams): hoist time import	2026-04-16 13:22:07 +08:00
T3chC0wb0y	ee99200341	refactor(msteams): remove business references	2026-04-16 13:22:07 +08:00
T3chC0wb0y	9b4264fce2	refactor(msteams): remove FWDIOC references	2026-04-16 13:22:07 +08:00
T3chC0wb0y	fecef07c60	refactor(msteams): remove obsolete restart notify config	2026-04-16 13:22:07 +08:00
Bob Johnson	9f8774fbdd	fix(msteams): remove hardcoded quote test fallback	2026-04-16 13:22:07 +08:00
Bob Johnson	4d795f74d5	Fix MSTeams PR review follow-ups	2026-04-16 13:22:07 +08:00
T3chC0wb0y	824dcca5e2	Add Microsoft Teams channel on current nightly base	2026-04-16 13:22:07 +08:00
Leo fu	2c0cd085a4	fix(discord): remove duplicate channel_id assignment in message handler channel_id is already assigned from self._channel_key(message.channel) earlier in the same function. The second identical assignment on line 453 is dead code left over from a copy-paste.	2026-04-16 02:11:13 +08:00
dongzeyu001	8572b7478f	Fix wecom mix msg parse	2026-04-15 16:51:02 +08:00
Xubin Ren	1f33df1ea6	fix: preserve empty dict allow_from handling Keep dict-backed channel configs compatible with both allow_from and allowFrom without losing empty-list semantics, and add focused regression coverage for the allow-list boundary. Made-with: Cursor	2026-04-15 01:26:51 +08:00
samy	73cf9a220b	fix: handle dict config in is_allowed() and _validate_allow_from() getattr() on a dict never finds custom keys — it only searches object attributes, not dict keys. When channel config is loaded as a Pydantic extra field (which is a plain dict), getattr(config, 'allow_from', []) always returns the default [], causing all access to be denied regardless of the allowFrom configuration. Fix both is_allowed() and _validate_allow_from() to use isinstance checks, falling back to dict.get() for dict configs while preserving getattr() for object-style configs.	2026-04-15 01:26:51 +08:00
Xubin Ren	0a51344483	fix(slack): keep cross-target sends out of origin threads When Slack resolves a named target to another conversation, do not reuse the origin thread timestamp on the destination send, and keep reaction cleanup anchored to the source conversation. Made-with: Cursor	2026-04-14 20:19:48 +08:00
yeyitech	873be5180b	feat(slack): resolve named message targets	2026-04-14 20:19:48 +08:00
chengyongru	0adce5405b	fix(feishu): remove resuming to avoid 10-min streaming card timeout Feishu streaming cards auto-close after 10 minutes from creation, regardless of update activity. With resuming enabled, a single card lives across multiple tool-call rounds and can exceed this limit, causing the final response to be silently lost. Remove the _resuming logic from send_delta so each tool-call round gets its own short-lived streaming card (well under 10 min). Add a fallback that sends a regular interactive card when the final streaming update fails.	2026-04-14 16:53:42 +08:00
bahtya	f879d81b28	fix(channels/qq): propagate network errors in send() instead of swallowing The catch-all except Exception in QQ send() was swallowing aiohttp.ClientError and OSError that _send_media correctly re-raises. Add explicit catch for network errors before the generic handler.	2026-04-13 00:30:45 +08:00
bahtya	fa98524944	fix(channels): prevent retry amplification and silent message loss across channels Audited all channel implementations for overly broad exception handling that causes retry amplification or silent message loss during network errors. This is the same class of bug as #3050 (Telegram _send_text). Fixes by channel: Telegram (send_delta): - _stream_end path used except Exception for HTML edit fallback - Network errors (TimedOut, NetworkError) triggered redundant plain text edit, doubling connection demand during pool exhaustion - Changed to except BadRequest, matching the _send_text fix Discord: - send() caught all exceptions without re-raising - ChannelManager._send_with_retry() saw successful return, never retried - Messages silently dropped on any send failure - Added raise after error logging DingTalk: - _send_batch_message() returned False on all exceptions including network errors — no retry, fallback text sent unnecessarily - _read_media_bytes() and _upload_media() swallowed transport errors, causing _send_media_ref() to cascade through doomed fallback attempts - Added except httpx.TransportError handlers that re-raise immediately WeChat: - Media send failure triggered text fallback even for network errors - During network issues: 3×(media + text) = 6 API calls per message - Added specific catches: TimeoutException/TransportError re-raise, 5xx HTTPStatusError re-raises, 4xx falls back to text QQ: - _send_media() returned False on all exceptions - Network errors triggered fallback text instead of retry - Added except (aiohttp.ClientError, OSError) that re-raises Tests: 331 passed (283 existing + 48 new across 5 channel test files) Fixes: #3054 Related: #3050, #3053	2026-04-13 00:30:45 +08:00
bahtya	7e91aecd7d	fix(telegram): narrow exception catch in _send_text to prevent retry amplification Previously _send_text() caught all exceptions (except Exception) when sending HTML-formatted messages, falling back to plain text even for network errors like TimedOut and NetworkError. This caused connection demand to double during pool exhaustion scenarios (3 retries × 2 fallback attempts = 6 calls per message instead of 3). Now only catches BadRequest (HTML parse errors), letting network errors propagate immediately to the retry layer where they belong. Fixes: HKUDS/nanobot#3050	2026-04-13 00:30:45 +08:00
Dianqi Ji	ee946d96ca	feat(channels/feishu): add domain config for Lark global support Add 'domain' field to FeishuConfig (Literal['feishu', 'lark'], default 'feishu'). Pass domain to lark.Client.builder() and lark.ws.Client to support Lark global (open.larksuite.com) in addition to Feishu China (open.feishu.cn). Existing configs default to 'feishu' for backward compatibility. Also add documentation for domain field in README.md and add tests for domain config.	2026-04-12 09:56:17 +08:00
chengyongru	9f433cab01	fix(wecom): use reply_stream for progress messages to avoid errcode=40008 The plain reply() uses cmd="reply" which does not support "text" msgtype and causes WeCom API to return errcode=40008 (invalid message type). Unify both progress and final text messages to use reply_stream() (cmd="aibot_respond_msg"), differentiating via finish flag. Fixes #2999	2026-04-11 21:47:19 +08:00
chengyongru	f6f712a2ae	fix(wecom): harden upload/download, extract media type helper - Use asyncio.to_thread for file I/O to avoid blocking event loop - Add 200MB upload size limit with early rejection - Fix file handle leak by using context manager - Use memoryview for upload chunking to reduce peak memory - Add inbound download size check to prevent OOM - Use asyncio.to_thread for write_bytes in download path - Extract inline media_type detection to _guess_wecom_media_type()	2026-04-11 21:47:19 +08:00
chengyongru	f900e4f259	fix(wecom): harden upload and inbound media handling - Use asyncio.to_thread for file I/O to avoid blocking event loop - Add 200MB upload size limit with early rejection - Fix file handle leak by using context manager - Free raw bytes early after chunking to reduce memory pressure - Add file attachments to media_paths (was text-only, inconsistent with image) - Use robust _sanitize_filename() instead of os.path.basename() for path safety - Remove re-raise in send() for consistency with QQ channel - Fix truncated media_id logging for short IDs	2026-04-11 21:47:19 +08:00
gem12	48f6bbd256	feat(channels): Add full media support for QQ and WeCom channels QQ channel improvements (on top of nightly): - Add top-level try/except in _on_message and send() for resilience - Use defensive getattr() for attachment attributes (botpy version compat) - Skip file_name for image uploads to avoid QQ rendering as file attachment - Extract only file_info from upload response to avoid extra fields - Handle protocol-relative URLs (//...) in attachment downloads WeCom channel improvements: - Add _upload_media_ws() for WebSocket 3-step media upload protocol - Send media files (image/video/voice/file) via WeCom rich media API - Support progress messages (plain reply) vs final response (streaming) - Support proactive send when no frame available (cron push) - Pass media_paths to message bus for downstream processing	2026-04-11 21:47:19 +08:00

1 2 3 4 5 ...

439 Commits