nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-06-15 07:14:08 +00:00

Author	SHA1	Message	Date
Peixian Gong	dd26b4407d	fix(providers): make GitHub Copilot backend work with GPT-5/o-series models Calling GitHub Copilot with `gpt-5.` / `o` models (e.g. `github_copilot/gpt-5.4`, `github_copilot/gpt-5.4-mini`) failed with a chain of misleading errors: 1. `Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead.` 2. `model "gpt-5.4-mini" is not accessible via the /chat/completions endpoint` (`unsupported_api_for_model`). 3. `The requested model is not supported.` (`model_not_supported`) even after routing to /responses. Root causes (each one masked the next): * The `github_copilot` ProviderSpec did not opt into `supports_max_completion_tokens`, so `_build_kwargs` always sent the legacy `max_tokens` parameter that GPT-5/o-series reject. * `_should_use_responses_api` was hard-gated to `spec.name == "openai"` plus a direct-OpenAI base URL, so the GitHub Copilot backend always went through /chat/completions even for models the Copilot gateway exposes only via /responses (e.g. `gpt-5.4-mini`). * When /responses did fail on github_copilot, the existing "compatibility marker" heuristic silently fell back to /chat/completions — which can never succeed for these models — so the real upstream error was hidden. * `_build_responses_body` did not honour `spec.strip_model_prefix`, so the request body sent `model="github_copilot/gpt-5.4-mini"` (with the routing prefix), which the Copilot gateway rejects with `model_not_supported`. (`_build_kwargs` already stripped it; this branch was missed.) Fix: * registry.py: set `supports_max_completion_tokens=True` on the `github_copilot` spec so requests use `max_completion_tokens`. * openai_compat_provider.py: - `_should_use_responses_api` now also allows the `github_copilot` spec, and skips the direct-OpenAI base check for it (the Copilot gateway is its own base URL). - `_build_responses_body` now strips the model routing prefix when `spec.strip_model_prefix` is set, matching `_build_kwargs`. - `chat` / `chat_stream` no longer fall back from /responses to /chat/completions on the `github_copilot` spec: the fallback cannot succeed for GPT-5/o-series and would mask the real gateway error. Tests: * tests/cli/test_commands.py: switched the `test_github_copilot_provider_refreshes_client_api_key_before_chat` fixture model from `gpt-5.1` to `gpt-4` so it continues to exercise the /chat/completions code path it was designed for (gpt-5.1 now correctly routes to /responses on github_copilot). * `pytest tests/providers/ tests/cli/test_commands.py` — 314 passed. * Verified end-to-end against the live Copilot gateway with both `github_copilot/gpt-5.4` and `github_copilot/gpt-5.4-mini`. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-04-22 14:28:19 +08:00
hussein1362	512bf59b3c	fix(session): fsync sessions on graceful shutdown to prevent data loss On filesystems with write-back caching (rclone VFS, NFS, FUSE mounts) the OS page cache may buffer recent session writes. If the process is killed before the cache flushes, the most recent conversation turns are silently lost — causing the agent to "forget" recent context and respond to stale history on the next startup. Changes: - session/manager.py: add fsync=True option to save() that flushes the file and its parent directory to durable storage. Add flush_all() that re-saves every cached session with fsync. Default save() behavior is unchanged (no fsync) to avoid performance regression in normal operation. - cli/commands.py: call agent.sessions.flush_all() in the gateway shutdown finally block, after stopping heartbeat/cron/channels. - tests/session/test_session_fsync.py: 8 tests covering fsync flag behavior, flush_all with empty/multiple/errored sessions, and data survival across simulated process restart. - tests/cli/test_commands.py: add sessions attribute to _FakeAgentLoop so the gateway health endpoint test passes with the new shutdown flush.	2026-04-22 13:19:53 +08:00
Xubin Ren	ef8bbab7b3	test(cli): lock _render_interactive_ansi force_terminal to isatty Made-with: Cursor	2026-04-22 13:12:29 +08:00
chengyongru	79821a571f	fix: suppress intermediate progress output in cron jobs Cron jobs now pass on_progress=_silent to process_direct, matching the heartbeat pattern. Previously, tool hints and streaming deltas were published to the user channel via bus during execution, but the final response could be rejected by evaluate_response — leaving users with confusing partial output and no conclusion. Closes #3319	2026-04-20 11:43:54 +08:00
Xubin Ren	b3049f7323	fix(webui): stabilize empty session history state	2026-04-19 13:38:47 +00:00
Xubin Ren	46e11a68a7	test: speed up cron and restart timing tests Replace fixed sleep-based waits with condition polling in cron tests and mock the restart delay in CLI restart tests to reduce suite runtime without changing behavior.	2026-04-19 12:35:57 +00:00
Alfredo Arenas	2d0442976e	test(cli): update _make_console tests for isatty-based fix (#3265 ) The old test `test_make_console_uses_force_terminal` hardcoded `force_terminal is True`, which contradicts the fix: we now defer to sys.stdout.isatty() so piped / non-TTY output gets plain text instead of ANSI escape codes. Split into two tests covering both branches: - test_make_console_force_terminal_when_stdout_is_tty: TTY path (force_terminal=True, rich output) - test_make_console_force_terminal_false_when_stdout_is_not_tty: non-TTY path (force_terminal=False, plain text) — regression guard for the bug reported in #3265 Co-authored with Claude Opus 4.7	2026-04-19 04:19:59 +08:00
Xubin Ren	384bad17b4	Merge origin/main into fix/config-default-api-base Made-with: Cursor	2026-04-18 20:08:21 +00:00
chengyongru	e1fdca7d40	fix(status): correct context percentage calculation and sync consolidator - Pass resolved self.context_window_tokens to Consolidator instead of raw parameter that could be None, preventing consolidation failures - Calculate percentage against input budget (ctx - max_completion - 1024) instead of raw context window, consistent with Consolidator/snip formulas - Pass actual max_completion_tokens from provider to build_status_content - Cap percentage display at 999 to prevent runaway values - Add tests for budget-based percentage and cap behavior	2026-04-16 20:30:39 +08:00
Xubin Ren	2b8e90d8fd	test(config): cover LM Studio nullable api key	2026-04-16 02:49:54 +08:00
Xubin Ren	25ded8e747	test: cover active task count in status Lock the /status task counter to the actual stop scope by asserting it sums unfinished dispatch tasks with running subagents for the current session. Made-with: Cursor	2026-04-15 01:49:42 +08:00
aiguozhi123456	634f4b45c1	feat: show active task count in /status output	2026-04-15 01:49:42 +08:00
Xubin Ren	e4b3f9bd28	security(gateway): keep health endpoint local by default Bind the gateway health listener to localhost by default and reduce the probe response to a minimal status payload so accidental public exposure leaks less information. Made-with: Cursor	2026-04-14 07:19:38 +00:00
Xubin Ren	4999e2f734	Merge origin/main into feat/health-endpoint Keep the gateway health endpoint patch current with the latest gateway runtime changes, and lock the new HTTP routes in with CLI regression coverage and README guidance. Made-with: Cursor	2026-04-14 06:32:31 +00:00
moranfong	0750d1f182	fix(config): return provider default api base in config resolution	2026-04-13 23:42:58 +08:00
Leo fu	42624f5bf3	test: update expected token display to match consistent 1000 divisor The test fixtures use 65536 as context_window_tokens. With the divisor corrected from 1024 to 1000, the display changes from 64k to 65k.	2026-04-09 10:40:20 +08:00
Xubin Ren	075bdd5c3c	refactor: move SafeFileHistory to module level + add regression tests - Promote _SafeFileHistory to module-level SafeFileHistory for testability - Add 5 regression tests: surrogates, normal text, emoji, mixed CJK, multi-surrogates Made-with: Cursor	2026-04-07 13:57:34 +08:00
Xubin Ren	c9d4b7b905	Merge remote-tracking branch 'origin/main' into pr-2449 Made-with: Cursor # Conflicts: # nanobot/utils/evaluator.py	2026-04-06 06:30:11 +00:00
Ben Lenarts	202938ae73	feat: support ${VAR} env var interpolation in config secrets Allow config.json to reference environment variables via ${VAR_NAME} syntax. Variables are resolved at runtime by resolve_config_env_vars(), keeping the raw templates in the Pydantic model so save_config() preserves them. This lets secrets live in a separate env file (e.g. loaded by systemd EnvironmentFile=) instead of plain text in config.json.	2026-04-06 13:43:26 +08:00
Jiajun Xie	f86f226c17	fix(cli): prevent spinner ANSI escape codes from being printed verbatim Fixes #2591 The "nanobot is thinking..." spinner was printing ANSI escape codes literally in some terminals, causing garbled output like: ?[2K?[32m⠧?[0m ?[2mnanobot is thinking...?[0m Root causes: 1. Console created without force_terminal=True, so Rich couldn't reliably detect terminal capabilities 2. Spinner continued running during user input prompt, conflicting with prompt_toolkit Changes: - Set force_terminal=True in _make_console() for proper ANSI handling - Add stop_for_input() method to StreamRenderer - Call stop_for_input() before reading user input in interactive mode - Add tests for the new functionality	2026-04-05 16:50:49 +08:00
Xubin Ren	30ea048f19	Merge remote-tracking branch 'origin/main' into pr-2717-review	2026-04-04 04:42:52 +00:00
imfondof	896d578677	fix(restart): show restart completion with elapsed time across channels	2026-04-04 02:21:42 +08:00
imfondof	ba7c07ccf2	fix(restart): send completion notice after channel is ready and unify runtime keys	2026-04-04 02:21:42 +08:00
chengyongru	b9616674f0	feat(agent): two-stage memory system with Dream consolidation Replace single-stage MemoryConsolidator with a two-stage architecture: - Consolidator: lightweight token-budget triggered summarization, appends to HISTORY.md with cursor-based tracking - Dream: cron-scheduled two-phase processor that analyzes HISTORY.md and updates SOUL.md, USER.md, MEMORY.md via AgentRunner with edit_file tools for surgical, fault-tolerant updates New files: MemoryStore (pure file I/O), Dream class, DreamConfig, /dream and /dream-log commands. 89 tests covering all components.	2026-04-02 22:42:25 +08:00
chengyongru	da08dee144	feat(provider): show cache hit rate in /status (#2645 )	2026-04-02 12:51:45 +08:00
RongLei	c5f0997381	fix: refresh copilot token before requests Address PR review feedback by avoiding an async method reference as the OpenAI client api_key. Initialize the client with a placeholder key, refresh the Copilot token before each chat/chat_stream call, and update the runtime client api_key before dispatch. Add a regression test that verifies the client api_key is refreshed to a real string before chat requests. Generated with GitHub Copilot, GPT-5.4.	2026-04-02 03:46:40 +08:00
RongLei	a37bc26ed3	fix: restore GitHub Copilot auth flow Implement the real GitHub device flow and Copilot token exchange for the GitHub Copilot provider. Also route github-copilot models through a dedicated backend and strip the provider prefix before API requests. Add focused regression coverage for provider wiring and model normalization. Generated with GitHub Copilot, GPT-5.4.	2026-04-02 03:46:40 +08:00
Xubin Ren	5635907e33	feat(api): load serve settings from config Read serve host, port, and timeout from config by default, keep CLI flags higher priority, and bind the API to localhost by default for safer local usage.	2026-03-29 15:32:33 +00:00
MrBob	b26a93c14a	fix: preserve cron reminder context for notifications	2026-03-24 15:56:23 -03:00
Xubin Ren	3dfdab704e	refactor: replace litellm with native openai + anthropic SDKs - Remove litellm dependency entirely (supply chain risk mitigation) - Add AnthropicProvider (native SDK) and OpenAICompatProvider (unified) - Merge CustomProvider into OpenAICompatProvider, delete custom_provider.py - Add ProviderSpec.backend field for declarative provider routing - Remove _resolve_model, find_gateway, find_by_model (dead heuristics) - Pass resolved spec directly into provider — zero internal lookups - Stub out litellm-dependent model database (cli/models.py) - Add anthropic>=0.45.0 to dependencies, remove litellm - 593 tests passed, net -1034 lines	2026-03-25 01:58:48 +08:00
chengyongru	72acba5d27	refactor(tests): optimize unit test structure	2026-03-24 15:12:22 +08:00

31 Commits