nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-06-19 09:14:03 +00:00

Author	SHA1	Message	Date
chengyongru	6eb178113e	fix(mcp): sanitize MCP capability names for model API compatibility MCP resource/prompt/tool names containing spaces or special characters (e.g. "PostgreSQL System Information") were forwarded verbatim to model provider APIs, causing validation errors from both Anthropic and OpenAI which require names matching ^[a-zA-Z0-9_-]{1,128}$. Add _sanitize_name() that replaces invalid characters with underscores and collapses consecutive underscores. Applied in MCPToolWrapper, MCPResourceWrapper, MCPPromptWrapper constructors and the enabled_tools filtering logic. Closes #3468	2026-04-27 11:49:50 +08:00
Xubin Ren	ca66dd8cd1	Merge PR #3463 : fix(agent): expose session timestamps in model context fix(agent): expose session timestamps in model context	2026-04-27 02:22:37 +08:00
Xubin Ren	4a4ba1efc1	Merge branch 'main' into fix/session-history-timestamps Made-with: Cursor	2026-04-26 18:13:11 +00:00
Xubin Ren	038a140ad3	fix(slack): preserve thread context for proactive replies Capture Slack thread metadata for cron and message-tool deliveries so replies stay in the originating thread, and hydrate first thread mentions with recent Slack context. Made-with: Cursor	2026-04-27 02:10:38 +08:00
Xubin Ren	7037764186	docs: clarify maintainer and contribution licensing	2026-04-26 18:01:55 +00:00
Xubin Ren	df37a36174	fix(agent): expose session timestamps in model context Include persisted turn timestamps when assembling LLM prompts so relative-date references like yesterday and today have concrete anchors. Made-with: Cursor	2026-04-26 17:42:58 +00:00
Xubin Ren	c64ec3e73c	Merge PR #3454 : feat(webui): add ask-user choices and model settings feat(webui): add ask-user choices and model settings	2026-04-26 22:19:39 +08:00
Xubin Ren	b2aec5528a	refactor(agent): move provider refresh into subsystem owners	2026-04-26 14:18:37 +00:00
Xubin Ren	f670da6c70	refactor(providers): move provider snapshot creation into factory	2026-04-26 14:05:13 +00:00
Xubin Ren	65b0ae81af	Merge origin/main into webui-settings Made-with: Cursor	2026-04-26 13:05:32 +00:00
Xubin Ren	82b8a3af7e	fix(provider): handle incomplete DeepSeek reasoning history	2026-04-26 20:47:55 +08:00
Xubin Ren	3b82e14f85	fix(shell): preserve login PATH for path append Made-with: Cursor	2026-04-26 20:32:38 +08:00
yorkhellen	814345dd78	fix: update tests for path_append env dict change	2026-04-26 20:32:38 +08:00
yorkhellen	2f2ac96ac7	fix: update tests for path_append env dict change	2026-04-26 20:32:38 +08:00
yorkhellen	23dde7b84c	fix: prevent shell injection via path_append in ExecTool	2026-04-26 20:32:38 +08:00
Xubin Ren	727086ddac	test: tighten consolidation ratio coverage Made-with: Cursor	2026-04-26 20:24:42 +08:00
chengyongru	fca56d324a	test: add unit tests for configurable consolidation_ratio Cover ratio propagation, schema validation, and consolidation behavior with different ratio values (0.1, 0.5, 0.9).	2026-04-26 20:24:42 +08:00
Subal	80ee4483f8	feat: make consolidation ratio configurable	2026-04-26 20:24:42 +08:00
chengyongru	3de843a229	fix(provider): gate reasoning-to-content fallback behind spec flag The non-streaming parse path unconditionally promoted the `reasoning` response field to `content` when content was empty. This was intended for StepFun (whose API returns the actual answer in `reasoning`), but it applied to every OpenAI-compatible provider — causing internal thinking chains from models like Xiaomi MIMO to be leaked as formal replies. Add `reasoning_as_content: bool` to ProviderSpec (default False) and set it only for StepFun. The fallback now requires this flag rather than running globally. Fixes #3443	2026-04-26 20:11:08 +08:00
Xubin Ren	6036355ac5	fix(message): limit session recording to proactive sends Only mark message-tool deliveries for channel-session recording while cron jobs are running, avoiding duplicate session writes during normal user turns. Made-with: Cursor	2026-04-26 20:08:21 +08:00
Xubin Ren	799db33517	fix(heartbeat): record proactive deliveries in channel sessions Route heartbeat, cron, and message-tool deliveries through one gateway helper so user-visible proactive messages are available when the channel replies. Made-with: Cursor	2026-04-26 20:08:21 +08:00
hussein1362	1572626100	fix(heartbeat): inject delivered messages into channel session for reply continuity When heartbeat delivers output to a channel (e.g. Telegram), the message is a raw OutboundMessage that bypasses the channel's session. If the user replies, their reply enters a different session with no context about the heartbeat message, so the agent cannot follow through. This change injects the delivered heartbeat message as an assistant turn into the target channel's session before publishing the outbound. When the user replies, the channel session has conversational context. Handles unified_session mode by resolving to UNIFIED_SESSION_KEY when enabled, matching the agent loop's own session routing. No changes to agent/loop.py, session/manager.py, channels, providers, or config schema — uses existing add_message() and save() APIs.	2026-04-26 20:08:21 +08:00
Xubin Ren	1e11b35b45	fix(providers): tighten local endpoint detection Parse the endpoint host before disabling keepalive so public hostnames that merely contain private-network substrings keep the default connection pool behavior. Made-with: Cursor	2026-04-26 16:14:24 +08:00
hussein1362	5943ab386d	fix(providers): disable HTTP keepalive for local/LAN endpoints Local model servers (Ollama, llama.cpp, vLLM) often close idle HTTP connections before the client-side keepalive timer expires. When two LLM calls happen seconds apart — for example the heartbeat _decide() phase followed immediately by process_direct() — the second call grabs a now-dead pooled connection, causing a transient APIConnectionError on every first attempt. The fix detects local endpoints via: - ProviderSpec.is_local (Ollama, LM Studio, vLLM, OVMS) - Private-network URL patterns (localhost, 127.x, 192.168.x, 10.x, 172.16-31.x, host.docker.internal, [::1]) For these endpoints, the AsyncOpenAI client is created with a custom httpx.AsyncClient that sets keepalive_expiry=0, forcing a fresh TCP connection for each request. This is cheap on LAN (sub-5ms connect) and eliminates the stale-connection retry tax entirely. Cloud providers (OpenAI, Anthropic, OpenRouter, etc.) keep the default 5-second keepalive, which is fine for high-frequency API usage. The private-network heuristic also covers the common case where users configure provider='openai' but point apiBase at a LAN IP running llama.cpp — the spec says is_local=False, but the URL clearly is.	2026-04-26 16:14:24 +08:00
Xubin Ren	d0e1b1393a	fix(feishu): scope streaming buffers by message Keep concurrent Feishu group replies from sharing one streaming card buffer when sessions are split by topic or top-level message. Made-with: Cursor	2026-04-26 16:09:31 +08:00
chengyongru	39eea1b762	feat(feishu): per-message session for group top-level messages Align with deer-flow: group top-level messages (no root_id) now get their own session keyed by message_id instead of sharing a single group-wide session. Topic replies continue to share session via root_id.	2026-04-26 16:09:31 +08:00
chengyongru	0e92936cf3	chore(test): remove stale reaction_id from test metadata The production code no longer reads reaction_id from metadata, so remove the leftover key from the test_no_removal_when_message_id_missing test case.	2026-04-26 16:09:31 +08:00
chengyongru	3eb8838dd9	fix(test): update reaction cleanup test for _reaction_ids dict The stream-end reaction cleanup now reads from _reaction_ids instead of metadata, so pre-populate the dict in the test instead of passing reaction_id via metadata.	2026-04-26 16:09:31 +08:00
chengyongru	2a9fc9392b	fix(feishu): use message_id as reply target and fix keyword-only arg Align reply targeting with deer-flow: always reply to the inbound message_id (not root_id). The Feishu Reply API keeps responses in the same topic automatically when the target message is inside a topic. Also fix run_in_executor calls that passed reply_in_thread as a positional arg to a keyword-only parameter, and route standalone tool hints through the reply API for group chats.	2026-04-26 16:09:31 +08:00
chengyongru	8717832771	perf(feishu): make reaction non-blocking to speed up inbound dispatch Reaction emoji is now added as a fire-and-forget background task instead of blocking the inbound message pipeline. This removes one API round-trip from the critical path before the agent starts processing.	2026-04-26 16:09:31 +08:00
chengyongru	d36fba8bf5	feat(feishu): add reply_in_thread for visual topic grouping When reply_to_message config is enabled, the bot's first reply now uses reply_in_thread=True to create a visual topic/thread in the Feishu client. Subsequent chunks fall back to regular create. The reply_to_message default remains False for backward compatibility. Failed replies still fall back to regular send — messages are never silently dropped.	2026-04-26 16:09:31 +08:00
chengyongru	13bb31c789	feat(feishu): add thread-scoped session isolation for group chats Thread replies (messages with root_id != message_id) in group chats now get their own session key: feishu:{chat_id}:{root_id}. This means each Feishu thread has an independent conversation context. Top-level group messages and all private chat messages keep the default session key (no override), consistent with Telegram and Slack channel behavior. Co-authored-by: shenchengtsi <228445050+shenchengtsi@users.noreply.github.com>	2026-04-26 16:09:31 +08:00
Xubin Ren	b440e76d2f	feat(webui): add model settings runtime refresh	2026-04-25 18:05:06 +00:00
T3chC0wb0y	fd3d7ea752	fix(msteams): normalize nbsp in inbound text	2026-04-26 00:56:06 +08:00
T3chC0wb0y	722d935d37	fix(msteams): prune bad notify refs	2026-04-26 00:56:06 +08:00
T3chC0wb0y	7e65884acb	fix(msteams): send threaded replies via replyToId	2026-04-26 00:56:06 +08:00
Xubin Ren	a58d9fd357	feat(webui): render ask_user choices Made-with: Cursor	2026-04-25 15:46:47 +00:00
Xubin Ren	403ce23d22	fix(agent): tighten ask_user CLI handling Made-with: Cursor	2026-04-25 22:10:19 +08:00
Xubin Ren	3b1ea99ee1	fix(agent): render ask_user options without buttons Made-with: Cursor	2026-04-25 22:10:19 +08:00
Xubin Ren	cfc76ffbbf	feat(agent): add ask_user tool Made-with: Cursor	2026-04-25 22:10:19 +08:00
Xubin Ren	830211b5d4	docs: simplify macOS launchd setup Made-with: Cursor	2026-04-25 19:36:20 +08:00
Xubin Ren	8a4c338a01	docs: tighten macOS launchd setup Made-with: Cursor	2026-04-25 19:36:20 +08:00
choiking	41f7eae7b4	docs: add macOS launchd gateway setup	2026-04-25 19:36:20 +08:00
Xubin Ren	39a5a77874	fix(feishu): send videos with media message type	2026-04-24 20:00:56 +00:00
yorkhellen	076e4166d7	fix(agent): add LLM request timeout to prevent session lock starvation	2026-04-25 03:40:34 +08:00
Xubin Ren	e52fe2a8e2	feat(webui): render video media attachments Add signed media URLs to live WebSocket replies and teach the WebUI to classify and render video attachments, so bot-sent videos can play inline in both live chats and session history. Made-with: Cursor	2026-04-25 03:20:40 +08:00
Xubin Ren	be05189f39	feat(channels): add video support for Telegram and WebSocket Telegram previously sent all video files as documents via send_document, so users saw a file icon instead of an inline player. WebSocket only accepted image MIME types, rejecting video uploads entirely. Telegram: - Recognize video extensions (mp4/mov/avi/mkv/webm/3gp) in _get_media_type - Route videos through send_video with supports_streaming=True - Add VIDEO/VIDEO_NOTE/ANIMATION to inbound message filters - Add video MIME mappings to _get_extension - Fix: local file sends now use _call_with_retry (previously no retry) WebSocket: - Expand upload MIME whitelist with video/mp4, video/webm, video/quicktime - Add per-type size limits (_MAX_VIDEO_BYTES=20MB, _MAX_VIDEOS_PER_MESSAGE=1) - Expand media serving endpoint to serve video with correct Content-Type Agent: - Add "video" to message tool media parameter description - Add .mp4 example to identity.md system prompt Made-with: Cursor	2026-04-25 02:20:13 +08:00
Matt Van Horn	ee14e2df56	perf(document): lazy-import heavy document parsers Move pypdf, python-docx, openpyxl, and python-pptx imports from module level into the _extract_pdf / _extract_docx / _extract_xlsx / _extract_pptx functions that actually use them. These four libraries became core dependencies in v0.1.5.post2 (~25 MB combined) and were paying the import cost on every nanobot startup even when no document parsing was needed for the session. The module-level SUPPORTED_EXTENSIONS set and the extract_text() dispatch stay as-is; the "[error: <lib> not installed]" branches move from the old module-level None sentinels into the corresponding extractor's try/except ImportError block. Behavior for the error message and for successful parses is identical. All 20 tests in tests/test_document_parsing.py pass unchanged. Fixes #3422	2026-04-25 02:10:30 +08:00
Xubin Ren	3441d5f89c	test(anthropic): cover remaining opus-4-7 temperature branches The existing test only verified the adaptive path. Add two more cases: - enabled thinking (high): temperature must also be omitted - no thinking (None): temperature must still be omitted Made-with: Cursor	2026-04-24 15:33:59 +08:00
04cb	9239429a00	fix(anthropic): omit temperature for opus-4-7 (#3417 )	2026-04-24 15:33:59 +08:00

1 2 3 4 5 ...

2169 Commits