nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-05-04 16:55:54 +00:00

Author	SHA1	Message	Date
Xubin Ren	69bcf26ef4	chore: update README with news for v0.1.5.post3 release	2026-04-29 10:59:19 +00:00
Xubin Ren	0b1631f33d	chore: bump version to 0.1.5.post3 and update README news - pyproject.toml + __init__.py: 0.1.5.post2 → 0.1.5.post3 - README: add daily news entries for 2026-04-22 through 2026-04-28 Made-with: Cursor v0.1.5.post3	2026-04-29 10:50:57 +00:00
Xubin Ren	3d7099b421	fix(memory): clean atomic write test hygiene Made-with: Cursor	2026-04-29 16:57:50 +08:00
yorkhellen	53ca2836e7	fix(memory): also fsync directory for rename durability	2026-04-29 16:57:50 +08:00
yorkhellen	2af45945e2	fix(memory): ensure atomic write for history.jsonl Use temp file + os.replace + fsync to prevent partial writes on crash. Add tests for atomic write behavior and tmp file cleanup on exception.	2026-04-29 16:57:50 +08:00
chengyongru	74270bb8a8	refactor(channels): resolve progress overrides at init-time like transcription	2026-04-29 16:43:09 +08:00
hanyuanling	a0443e8f9e	fix(channels): address progress override review	2026-04-29 16:43:09 +08:00
hanyuanling	0b111a0e0c	fix(channels): support per-channel progress controls	2026-04-29 16:43:09 +08:00
Xubin Ren	67b4d113c9	chore: update pyproject.toml	2026-04-29 08:25:09 +00:00
Jiajun Xie	95715f5211	fix: sanitize Matrix user_id for Windows-safe store file names - Replace ':' with '_' in store_name to avoid WinError 123 - Pass sanitized store_name via AsyncClientConfig - Fixes issue #3506 where Matrix channel fails on Windows due to colon in user_id causing invalid file paths in matrix-nio's DefaultStore	2026-04-29 16:04:49 +08:00
masterlyj	2b9b41f9c3	test(providers): cover reasoning_effort="none" and gemma auto-routing - Anthropic: "none" must not enable extended thinking - Azure: "none" must not suppress temperature or inject reasoning body - DeepSeek/DashScope/Kimi: "none" sends thinking disabled, skips reasoning_effort field - Gemini: gemma keyword enables auto-routing for gemma models	2026-04-29 15:41:11 +08:00
masterlyj	b94bc18e59	fix: treat reasoning_effort="none" as thinking disabled and route gemma to Gemini provider - Do not send reasoning_effort="none" to APIs (prevents 400 on gemma/Gemini) - Treat "none" as thinking disabled in thinking_style, Kimi, and reasoning_content backfill paths - Fix Anthropic extended thinking not respecting "none" - Fix Azure OpenAI temperature suppression and reasoning body for "none" - Fix Codex reasoning body for "none" - Add "gemma" keyword to Gemini ProviderSpec for correct auto routing	2026-04-29 15:41:11 +08:00
chengyongru	28f9bbff31	feat(web_search): add olostep provider Adds Olostep (https://www.olostep.com) as an optional web_search backend using the official olostep Python SDK (client.answers.create()). Changes: - pyproject.toml: adds olostep>=0.1.0 optional dependency - schema.py: adds olostep to provider comment in WebSearchConfig - web.py: adds _search_olostep() with lazy import and provider branching - docs/configuration.md: documents Olostep setup under web search config - tests: unit tests for the new provider Backward compatible: existing users see no behavior change unless they opt into provider: "olostep". No hard dependency at runtime path. Co-authored-by: umerkay <umerkk164@gmail.com>	2026-04-28 19:09:38 +08:00
甘全	0053e68423	fix(feishu): skip reaction transition on resuming stream end Stream-end events are emitted at the end of every assistant turn. When the agent has more tool-call rounds queued, the runner sets `_resuming=True` on the metadata. Without a guard, every intermediate stream end removed the OnIt reaction (the first one wins, since `_reaction_ids.pop` empties the slot) and re-added `done_emoji`, producing a DONE reaction after every tool call instead of only at final completion. Wrap the OnIt removal and `done_emoji` add in a `not _resuming` guard so the OnIt indicator persists across tool-call rounds and DONE fires exactly once when the agent's final response lands. `_resuming` already flows through outbound metadata (`nanobot/agent/loop.py:747`) and survives `_coalesce_stream_deltas` because pure `_stream_end` messages without `_stream_delta` skip the merge branch. Tests: - test_no_removal_when_resuming - test_done_emoji_only_on_final_stream_end	2026-04-28 17:29:12 +08:00
Xubin Ren	278ef22776	docs(config): document provider extra body Show how to configure OpenAI-compatible request body extensions such as sampling and chat template parameters. Made-with: Cursor	2026-04-28 15:56:13 +08:00
hussein1362	415e617398	feat(providers): add extra_body config for OpenAI-compatible endpoints Add an `extra_body` field to `ProviderConfig` that merges arbitrary key-value pairs into every OpenAI-compatible request body. This is the escape hatch for provider-specific features that nanobot does not have first-class fields for. Real-world use cases this unblocks via config alone (no code changes): - vLLM/TGI `chat_template_kwargs` (e.g. `enable_thinking: false`) - vLLM guided decoding (`guided_json`, `guided_regex`) - Local model sampling params (`repetition_penalty`, `top_k`, `min_p`) - Any future provider-specific param without a new PR each time The config extra_body is applied last via recursive deep-merge, so it can extend or override provider-specific defaults (e.g. thinking params) without clobbering sibling keys set by internal logic. Changes: - Add `extra_body: dict[str, Any] \| None` to `ProviderConfig` - Pass it through `factory.py` to `OpenAICompatProvider.__init__` - Deep-merge into `_build_kwargs` after all internal extra_body entries - Add `_deep_merge` helper (recursive dict merge, does not mutate inputs) - 21 tests: deep-merge semantics, provider init, _build_kwargs integration, thinking coexistence, real-world patterns (guided_json, repetition_penalty), and schema validation	2026-04-28 15:56:13 +08:00
Xubin Ren	58f8c04bd5	Merge PR #3382 : feat(web-tools): Improve to allow bypassing Cloudflare captchas feat(web-tools): Improve to allow bypassing Cloudflare captchas	2026-04-28 15:27:47 +08:00
Xubin Ren	f4d8783f5e	test(web): cover configurable fetch behavior Ensure custom user agents are applied to direct web requests and disabling Jina Reader forces the local readability path. Made-with: Cursor	2026-04-28 07:25:47 +00:00
Xubin Ren	18432c313f	Merge origin/main into web-tools Made-with: Cursor	2026-04-28 07:17:05 +00:00
Xubin Ren	50698c3d1c	test(telegram): cover local attachment filenames Add a regression test for preserving the original basename when Telegram sends local media bytes. Made-with: Cursor	2026-04-28 15:13:49 +08:00
Simon	e36e70fe16	fix(channels): send telegram attachments with named file path	2026-04-28 15:13:49 +08:00
Xubin Ren	48f3cc6390	fix(agent): stop on workspace violations from tool errors Treat workspace and safety guard failures as fatal regardless of whether they arrive from tool preparation, returned tool output, or raised exceptions. Made-with: Cursor	2026-04-28 15:13:27 +08:00
lihua	f19d767b0f	权限错误要打断循环	2026-04-28 15:13:27 +08:00
Celina Hanouti	2b455b1e14	feat(providers): add Hugging Face inference provider	2026-04-28 14:55:28 +08:00
Xubin Ren	ad4802600e	refactor(config): make max messages default explicit Use 120 as the config-level default and normalize zero back to that limit so session replay always receives an explicit message cap. Made-with: Cursor	2026-04-28 14:54:32 +08:00
hussein1362	d45ffcf519	feat(config): wire max_messages into session history replay The max_messages config field in AgentDefaults was accepted by the schema but never threaded through to the actual get_history() calls in the agent loop. Both call sites in _process_message hardcoded the default, so sessions with slow or local models accumulated unbounded history that inflated prompt tokens and caused LLM timeouts. Changes: - Add max_messages field to AgentDefaults (default 0 = use built-in constant, any positive value caps history replay) - Store the value on AgentLoop and pass it to get_history() when non-zero - Wire the config through all three AgentLoop construction sites in commands.py (gateway, API server, CLI chat) - 14 focused tests covering schema validation, init storage, history slicing, boundary alignment, integration wiring, and the zero/default path	2026-04-28 14:54:32 +08:00
Xubin Ren	97981b911a	fix(slack): skip empty progress messages that render as blank lines	2026-04-27 12:48:24 +00:00
Xubin Ren	12b9782f3e	docs(deployment): clarify container user and config directory usage	2026-04-27 11:07:34 +00:00
Xubin Ren	fdfecd3ba6	refactor(codex): name progress delta capability semantically Use a provider capability name that describes user-visible progress delta support instead of the runner implementation detail. Made-with: Cursor	2026-04-27 18:48:05 +08:00
hanyuanling	ae14142a87	fix(codex): stream progress deltas to channels	2026-04-27 18:48:05 +08:00
Xubin Ren	2b886ffd1f	fix(command): expose history in chat command menus Made-with: Cursor	2026-04-27 18:23:35 +08:00
Xubin Ren	8ed10ac7df	test(command): keep history tests lint-clean Made-with: Cursor	2026-04-27 18:23:35 +08:00
Leo fu	599e25dfbf	feat(command): add /history command to show recent session messages Adds /history [n] to display the last N user/assistant messages from the current session (default 10, max 50). - Tool and system messages are filtered out for readability - Long messages are truncated to 200 characters with an ellipsis - Multimodal content (image blocks) is collapsed to its text parts - Invalid count argument returns a usage hint - /history n uses prefix routing; /history uses exact routing Also registers /history in build_help_text().	2026-04-27 18:23:35 +08:00
hussein1362	e72c415473	fix(heartbeat): prevent internal reasoning leaks and finalization fallback in delivery Three failure modes addressed: 1. Model reflects HEARTBEAT.md instructions back as output instead of executing them ("HEARTBEAT.md has active tasks listed...") 2. Model narrates decision logic ("Best judgment call: stay quiet") 3. Model produces empty output for silence, runner treats it as failure, finalization retry generates "couldn't produce a final answer" which gets delivered to the user Changes: - Add _is_deliverable() pre-filter in HeartbeatService._tick() that catches finalization fallback messages and leaked reasoning patterns before they reach the evaluator - Wrap Phase 2 task input with a delivery-awareness preamble telling the model its output goes directly to the user's messaging app - Add meta-reasoning suppression criterion to evaluator template No changes to agent/loop.py, runner.py, providers, or config schema.	2026-04-27 18:14:13 +08:00
hanyuanling	9dc99d1b34	fix(provider): bound OpenAI-compatible request timeouts	2026-04-27 17:47:31 +08:00
Xubin Ren	b8932bc041	Merge PR #3397 : fix(discord): full thread support with session isolation and allowlist enforcement fix(discord): full thread support with session isolation and allowlist enforcement	2026-04-27 17:36:53 +08:00
Xubin Ren	e31273ebaa	Merge origin/main into fix/discord-allow-channel-threads Made-with: Cursor	2026-04-27 09:26:24 +00:00
Xubin Ren	82c5083b15	fix(slack): preserve DM thread routing and strip trailing newlines	2026-04-27 09:01:04 +00:00
Xubin Ren	2fe8d21b6e	Merge PR #3459 : feat(session): enforce replay/file-cap invariants for history lifecycle feat(session): enforce replay/file-cap invariants for history lifecycle	2026-04-27 16:17:23 +08:00
Xubin Ren	eb4b3d9e26	refactor(session): internalize history/file-cap knobs as constants Move sessionHistoryMaxMessages, sessionHistoryMaxTokens, and sessionFileMaxMessages out of user-facing config into internal constants (HISTORY_MAX_MESSAGES=120, FILE_MAX_MESSAGES=2000). - Remove 3 fields from AgentDefaults and config pipeline - Sink enforce_file_cap into Session (was AgentLoop) - Auto-derive token budget from context window (was configurable) - Net -113 lines across 7 files; 723 tests green Made-with: Cursor	2026-04-27 08:06:50 +00:00
Xubin Ren	537c66a3f8	Merge PR #3440 : fix: Automatically clean up unsupported or expired MSTeams session fix: Automatically clean up unsupported or expired MSTeams session	2026-04-27 15:45:31 +08:00
hanyuanling	8e0ce59c0e	fix(provider): normalize DeepSeek non-string message content	2026-04-27 15:43:41 +08:00
Xubin Ren	29ebc2d355	Merge origin/main into feat/session-replay-file-cap-invariants Preserve main's timestamp/tool-context replay semantics while keeping the PR's session history and file-cap budgets. Made-with: Cursor	2026-04-27 07:32:00 +00:00
Xubin Ren	367a6db78c	test(msteams): align stale-ref test with sidecar metadata The PR stores ref freshness in the metadata sidecar, so the merged main test should assert updated_at there instead of in the refs payload. Made-with: Cursor	2026-04-27 07:30:17 +00:00
Xubin Ren	3d75aedcac	Merge origin/main into fix/msteams-prune-stale-refs Resolve the MSTeams stale-reference cleanup conflict by keeping the PR's locked, atomic sidecar-meta implementation and aligning the merged test expectation locally. Made-with: Cursor	2026-04-27 07:29:48 +00:00
Xubin Ren	311a7fe36e	fix(session): stop training the model to parrot [Message Time: ...] Past assistant turns in history were prefixed with "[Message Time: ...]" just like user turns. The model treated these as in-context demos and started prefixing its own replies with the same marker, leaking metadata to the user. Prompt-level warnings could not beat dozens of prior assistant samples. Annotate only user turns and proactive deliveries (_channel_delivery=True, i.e. cron / heartbeat pushes whose timing is the whole point and which are too infrequent to act as demos). Adjacent user-side timestamps still pin every normal assistant reply for relative-time reasoning. The now-redundant identity.md warning is removed along with the demonstration source.	2026-04-27 07:11:20 +00:00
Xubin Ren	620d9e4f31	fix(slack): accept inbound file_share messages without dropping them Slack inbound events with subtype=file_share were silently dropped, so nanobot never saw messages that included attachments. Allow file_share through, download Slack-private files using the bot token into the local media dir, and pass them to the agent as media paths plus a "[file: name]" / "[image: name]" placeholder in the content. Reject responses that look like Slack's login HTML so an auth page is never saved as if it were the user's file. Document the required files:read scope alongside files:write so installs that read attachments are not quietly missing the permission.	2026-04-27 07:11:11 +00:00
Xubin Ren	7dcf83e389	test(agent): cover threaded subagent routing Made-with: Cursor	2026-04-27 14:37:36 +08:00
mt-huerta	380309016a	fix(agent): complete thread-session routing for spawn dispatch and system-channel branch Builds on PR #3463 (commit 038a140), which introduced metadata and session_key parameters through _LoopHook and _set_tool_context for the cron and message tools. Three downstream gaps remained: 1. _set_tool_context's body still computes effective_key from channel:chat_id and passes that to spawn, even when the caller provides a thread-scoped session_key. The new parameter is wired in for cron/message but spawn dispatch ignores it. Result: subagent announces from threaded callers carry a channel-only session_key_override, dropping thread_ts. 2. _process_message's system-channel branch loads the session via key = f"{channel}:{chat_id}", ignoring msg.session_key_override. So even when the announce InboundMessage carries the right override (after fix 1), the consumer side discards it and routes to the channel-level session. 3. The OutboundMessage returned from the system-channel branch has no metadata, so slack's outbound dispatcher has no thread_ts to use and posts the LLM's reply to the channel top-level rather than the originating thread. This change closes all three gaps with three small edits in loop.py. Behavior change: - Slack channels with reply_in_thread: true: subagent announces and follow-up replies now arrive in the originating thread session instead of leaking into the channel-level session. - Other channels constructing thread-scoped session keys (matrix threads, telegram thread mode, etc.): the session-loading and effective-key fixes apply identically since they're platform-agnostic. The outbound thread_ts reconstruction is slack-specific by virtue of the session-key format slack uses; other channels would benefit from the same pattern but are out of scope for this PR. - Unified session mode: no change. Falls back to UNIFIED_SESSION_KEY when session_key is not provided. - CLI / non-channel callers: no change. They don't pass session_key and the fallback to f"{channel}:{chat_id}" matches prior behavior. Reproducer (slack with reply_in_thread: true): 1. From a slack thread, send a message that triggers a subagent spawn. 2. Before fix: announce lands in slack:<channel>.jsonl session, parent agent in the thread never sees the completion event, eventual reply (if any) posts to the channel top-level, not the thread. 3. After fix: announce lands in slack:<channel>:<thread_ts>.jsonl, parent agent in the thread responds within seconds, reply posts in the thread.	2026-04-27 14:37:36 +08:00
Xubin Ren	9b6f3d7abc	fix(agent): resolve message media against active workspace Made-with: Cursor	2026-04-27 14:31:39 +08:00

1 2 3 4 5 ...

2235 Commits