nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-04-30 14:56:01 +00:00

Author	SHA1	Message	Date
Xubin Ren	7f1913f619	fix(provider): add DeepSeek thinking toggle; backfill reasoning_content on legacy messages Two issues with DeepSeek V4 thinking mode support: 1. Missing thinking parameter injection. DeepSeek V4 requires `extra_body: {"thinking": {"type": "enabled/disabled"}}` — identical to VolcEngine/BytePlus. The code had this for volcengine, byteplus, dashscope, minimax, and kimi but not DeepSeek. This means `reasoning_effort=minimal` (thinking off) silently has no effect. Root cause: the thinking-style→wire-format mapping was an if/elif chain on provider names. DeepSeek was forgotten. Fix: make the mapping declarative via `ProviderSpec.thinking_style`: - "thinking_type" → {"thinking": {"type": "..."}} (DeepSeek, Volc, BytePlus) - "enable_thinking" → {"enable_thinking": bool} (DashScope) - "reasoning_split" → {"reasoning_split": bool} (MiniMax) `_build_kwargs` now does a single dict lookup. Adding a new provider with an existing wire format requires zero changes to the function. 2. Legacy session messages crash thinking-mode requests. When a session was started without thinking mode (or with a different model), assistant messages lack reasoning_content. DeepSeek V4 in thinking mode rejects these with 400: "The reasoning_content in the thinking mode must be passed back to the API." This affects ALL assistant messages, not just those with tool_calls (despite the docs only mentioning the tool_calls case). Fix: `_build_kwargs` backfills `reasoning_content: ""` on every assistant message missing it, but only when thinking mode is active. This is semantically neutral — the model treats empty reasoning_content as "no thinking happened on that turn". The backfill only touches the in-memory request copy; session files on disk are untouched. Tests: +5 (3 thinking toggle, 2 backfill). Full suite: 2377 passed. Made-with: Cursor	2026-04-24 15:06:39 +08:00
Xubin Ren	88c619901e	review(providers): tighten comments in reasoning_effort normalize path Made-with: Cursor	2026-04-22 12:49:55 +08:00
hlg	28c42628b0	fix: normalize DashScope reasoning_effort (minimal vs minimum) DashScope rejects the OpenAI-style value "minimal" with `'reasoning_effort.effort' must be one of: 'none', 'minimum', 'low', 'medium', 'high', 'xhigh'`, but nanobot was passing the string through verbatim. Users who tried the documented "minimal" to disable thinking got a 400; users who tried the DashScope-native "minimum" to work around it got `enable_thinking=True` because the internal comparison was a hard string match on "minimal". Introduce a semantic/wire split in `_build_kwargs`: - `semantic_effort` is the internal canonical form (OpenAI vocabulary). "minimum" on the way in is normalized to "minimal" here so both spellings share one meaning. - `wire_effort` is what we actually serialize. For DashScope with semantic_effort == "minimal" we translate to "minimum" on the way out; other providers are unchanged. - `thinking_enabled` and the Kimi thinking branch now compare on `semantic_effort`, so either user spelling correctly disables provider-side thinking. Tests: - Strengthen `test_dashscope_thinking_disabled_for_minimal` to assert the wire value is "minimum" in addition to the extra_body signal; the original version only checked extra_body and let the invalid-value bug slip through. - Add `test_dashscope_thinking_disabled_for_minimum_alias` so a user who read the DashScope docs and configured "minimum" still gets thinking off. - Add `test_non_dashscope_minimal_not_retranslated` to pin down that the DashScope-specific translation does not leak to OpenAI et al.	2026-04-22 12:49:55 +08:00
k	e5b288c6eb	fix: map MiniMax reasoning_effort to reasoning_split	2026-04-22 00:52:56 +08:00
Xubin Ren	6c24f24e9e	feat(models): add support for kimi-k2.6 with temperature override and update documentation	2026-04-20 18:18:06 +00:00
Xubin Ren	b6d63fb1ec	fix: normalize responses circuit breaker keys Made-with: Cursor	2026-04-19 20:16:25 +08:00
razzh	9e2278826f	feat(provider): enable Kimi thinking via extra_body for k2.5 and k2.6 - Inject `thinking={"type": "enabled\|disabled"}` via extra_body for Kimi thinking-capable models (kimi-k2.5, k2.6-code-preview). - Add _is_kimi_thinking_model helper to handle both bare slugs and OpenRouter-style prefixed names (e.g. moonshotai/kimi-k2.5). - reasoning_effort="minimal" maps to disabled; any other value enables it. - Add tests for enabled/disabled states and OpenRouter prefix handling.	2026-04-15 01:59:32 +08:00
Xubin Ren	b60e8dc0ba	test: cover missing tool-call arguments normalization Lock the strict-provider sanitization path so assistant tool calls without function.arguments are normalized to {} instead of being forwarded as missing values. Made-with: Cursor	2026-04-15 01:37:41 +08:00
Michael-lhh	f293ff7f18	fix: normalize tool-call arguments for strict providers Ensure assistant tool-call function.arguments is always emitted as valid JSON text so strict OpenAI-compatible backends (including Alibaba code models) do not reject requests. Add regressions for dict and malformed-string argument payloads in message sanitization. Made-with: Cursor	2026-04-15 01:37:41 +08:00
Xubin Ren	2bef9cb650	fix(agent): preserve interrupted tool-call turns Keep tool-call assistant messages valid across provider sanitization and avoid trailing user-only history after model errors. This prevents follow-up requests from sending broken tool chains back to the gateway.	2026-04-10 05:37:25 +00:00
Xubin Ren	dadf453097	Merge origin/main into fix/sanitize-messages-non-claude Resolved conflict in azure_openai_provider.py by keeping main's Responses API implementation (role alternation not needed for the Responses API input format). Made-with: Cursor	2026-04-09 04:45:45 +00:00
Xubin Ren	d084d10dc2	feat(openai): auto-route direct reasoning requests with responses fallback	2026-04-08 15:21:08 +00:00
Xubin Ren	ebf29d87ae	fix: include byteplus providers, guard None reasoning_effort, merge extra_body - Add byteplus and byteplus_coding_plan to thinking param providers - Only send extra_body when reasoning_effort is explicitly set - Use setdefault().update() to avoid clobbering existing extra_body - Add 7 regression tests for thinking params Made-with: Cursor	2026-04-06 16:12:08 +08:00
Xubin Ren	17d9d74ccc	fix(provider): omit temperature for GPT-5 models	2026-04-04 20:18:22 +08:00
Lingao Meng	519911456a	test(provider): fix incorrect assertion in reasoning_content sanitize test The test test_openai_compat_strips_message_level_reasoning_fields was added in fbedf7a and incorrectly asserted that reasoning_content and extra_content should be stripped from messages. This contradicts the intent of b5302b6 which explicitly added these fields to _ALLOWED_MSG_KEYS to preserve them through sanitization. Rename the test and fix assertions to match the original design intent: reasoning_content and extra_content at message level should be preserved, and extra_content inside tool_calls should also be preserved. Signed-off-by: Lingao Meng <menglingao@xiaomi.com>	2026-04-04 20:08:44 +08:00
Xubin Ren	fbedf7ad77	feat: harden agent runtime for long-running tasks	2026-04-01 19:12:49 +00:00
Xubin Ren	ace3fd6049	feat: add default OpenRouter app attribution headers	2026-03-27 11:40:23 +00:00
Xubin Ren	b5302b6f3d	refactor(provider): preserve extra_content verbatim for Gemini thought_signature round-trip Replace the flatten/unflatten approach (merging extra_content.google.* into provider_specific_fields then reconstructing) with direct pass-through: parse extra_content as-is, store on ToolCallRequest.extra_content, serialize back untouched. This is lossless, requires no hardcoded field names, and covers all three parsing branches (str, dict, SDK object) plus streaming.	2026-03-25 10:00:29 +08:00
Yohei Nishikubo	af84b1b8c0	fix(Gemini): update ToolCallRequest and OpenAICompatProvider to handle thought signatures in extra_content	2026-03-25 10:00:29 +08:00
Yohei Nishikubo	7b720ce9f7	feat(OpenAICompatProvider): enhance tool call handling with provider-specific fields	2026-03-25 10:00:29 +08:00
Xubin Ren	3dfdab704e	refactor: replace litellm with native openai + anthropic SDKs - Remove litellm dependency entirely (supply chain risk mitigation) - Add AnthropicProvider (native SDK) and OpenAICompatProvider (unified) - Merge CustomProvider into OpenAICompatProvider, delete custom_provider.py - Add ProviderSpec.backend field for declarative provider routing - Remove _resolve_model, find_gateway, find_by_model (dead heuristics) - Pass resolved spec directly into provider — zero internal lookups - Stub out litellm-dependent model database (cli/models.py) - Add anthropic>=0.45.0 to dependencies, remove litellm - 593 tests passed, net -1034 lines	2026-03-25 01:58:48 +08:00
chengyongru	72acba5d27	refactor(tests): optimize unit test structure	2026-03-24 15:12:22 +08:00

22 Commits