mirror of https://github.com/HKUDS/nanobot.git synced 2026-06-15 07:14:08 +00:00

docs: make onboarding friendlier for beginners (#4177 )

* docs: make onboarding friendlier for beginners

* docs: build clearer documentation paths

Maintainer edit: turn the onboarding follow-up into a layered docs structure for first-time setup, provider selection, troubleshooting, CLI reference, and source-level architecture. This keeps quick start focused while giving advanced users precise reference paths.

* docs: render architecture flow with mermaid

Maintainer edit: replace the ASCII architecture sketch with a GitHub-rendered Mermaid flowchart so the core runtime path is easier to scan in the PR and README docs.

* docs: recommend model presets for model config

Maintainer edit: make named modelPresets the primary model configuration path and expand fallback preset examples so string fallbacks are clearly preset names, not raw model IDs.

* docs: document api base urls and langfuse setup

Maintainer edit: explain when users need apiBase/base URL in quick start and provider docs, and add Langfuse tracing setup with troubleshooting links.

* docs: use python module pip consistently

Maintainer edit: keep install commands tied to the active Python interpreter by using python -m pip in the Azure optional dependency notes too.

* docs: add non-technical getting started path

Maintainer edit: add a wizard-first guide for users without terminal or JSON background, including a text TUI menu example and links from the main docs entrypoints.

* docs: avoid hard-wrapped prose in user docs

Maintainer edit: unwrap ordinary prose across user-facing documentation while preserving markdown structure, code blocks, tables, lists, and prompt/template files.

* docs: keep desktop list continuations nested

Maintainer edit: preserve list nesting after unwrapping prose in the desktop WebUI sync guide.

* docs: add one-command installer

Maintainer edit: add auditable macOS/Linux and Windows install scripts that install nanobot-ai and start the onboarding wizard, then document the commands in the main onboarding entrypoints.

* docs: add installer dry run mode

Maintainer edit: add --dry-run to the one-command installer scripts so users can preview Python detection, install source, pip command, and wizard behavior without changing their environment.

* docs: clean installer error output

Maintainer edit: make PowerShell installer failures print a concise Error: message instead of Write-Error call-site details.

* docs: add provider setup cookbook

Maintainer edit: add pasteable provider recipes for common hosted, local, fallback, runtime switching, and Langfuse setups, then link the cookbook from onboarding and troubleshooting entrypoints.

* docs: address review feedback

* docs: clarify reader paths

* docs: explain terminal basics for beginners

* docs: clarify wizard navigation

* docs: avoid duplicate onboarding steps

* docs: add setup status check

* docs: explain status output

* docs: remove provider recommendation wording

* docs: explain status diagnostics

* docs: reduce hard-wrapped guidance

* docs: migrate config examples to presets

* docs: clarify python command fallbacks

* docs: improve installer failure recovery

* docs: expand install troubleshooting

* docs: cover installer download failures

* docs: put stable install paths first

* docs: add bundled webui quick path

* docs: clarify provider-neutral setup

* docs: clarify gateway setup for chat surfaces

* docs: improve docs navigation paths

* docs: add configuration quick jump

* docs: clarify provider secret variables

* chore: request PR review acknowledgement

Empty commit: please read the PR review comments and reply on the PR to confirm that you have received them.

This commit intentionally changes no files; it exists only to notify the remote Codex run so it can end its active goal.

* docs: add README start here guide

* docs: avoid provider recommendation wording

* docs: guide next steps after first reply

* docs: explain merging JSON snippets

* docs: add CLI command chooser

* docs: add configuration task map

* docs: add deployment readiness guide

* docs: simplify WebUI entry paths

* docs: add provider recipe chooser

* docs: fix provider factual references

Update OpenRouter and LongCat model examples, align Bedrock guidance, and make fallback snippets schema-valid.

Also correct group policy wording and image-generation provider lists to match the current code.

* fix: keep PowerShell installer from closing caller shell

* docs: mention self-guided configuration

2026-06-10 00:36:22 +08:00

8.3 KiB

Raw Blame History

Architecture

This page maps nanobot's runtime behavior to source files. Use it when you are debugging internals, reviewing a PR, adding a provider/channel/tool, or trying to understand where a user-visible behavior comes from.

For the product-level mental model, read concepts.md first.

Core Flow

flowchart LR
    Channel["Channel<br/>CLI, WebUI, chat apps"] --> Bus["MessageBus<br/>InboundMessage"]
    Bus --> Loop["AgentLoop<br/>session, workspace, context"]
    Loop --> Runner["AgentRunner<br/>provider/tool loop"]
    Runner --> Provider["Provider<br/>LLM backend"]
    Provider --> Runner
    Runner --> Tools["Tools<br/>files, shell, web, MCP, cron"]
    Tools --> Runner
    Runner --> Loop
    Loop --> Outbound["MessageBus<br/>OutboundMessage"]
    Outbound --> Channel

    Loop -. reads/writes .-> State["Session, memory,<br/>hooks, skills, templates"]

Main files:

Area	Files
Message events and queue	`nanobot/bus/events.py`, `nanobot/bus/queue.py`
Turn orchestration	`nanobot/agent/loop.py`
Provider/tool conversation loop	`nanobot/agent/runner.py`
Context construction	`nanobot/agent/context.py`
Session storage and compaction	`nanobot/session/manager.py`
Long-term memory and Dream	`nanobot/agent/memory.py`

Agent Loop vs Agent Runner

AgentLoop owns the channel-facing turn:

receives inbound messages;
determines the effective session and workspace scope;
builds context;
wires hooks, progress, and channel metadata;
publishes outbound messages.

AgentRunner owns the model-facing loop:

sends messages to the selected provider;
handles streaming deltas and reasoning blocks;
executes tool calls;
feeds tool results back into the model;
stops when a final answer is produced or runtime limits are hit.

Keep this split in mind when debugging. If a problem is about channel routing, session keys, workspace selection, or outbound delivery, start in agent/loop.py. If it is about provider calls, tool calls, streaming, or iteration limits, start in agent/runner.py.

Providers

Provider metadata is centralized in nanobot/providers/registry.py. Configuration fields live in nanobot/config/schema.py.

Provider selection uses:

explicit agents.defaults.provider or preset provider;
provider registry keywords;
API key prefixes and API base URL hints;
local provider fallback when apiBase is configured;
gateway fallback for providers that can route many model families.

Provider implementations live in nanobot/providers/. Most hosted providers use the OpenAI-compatible implementation, while Anthropic, Azure OpenAI, AWS Bedrock, OpenAI Codex, and GitHub Copilot have specialized paths.

Useful docs:

providers.md for practical setup;
configuration.md#providers for exact provider reference.

Channels

Channels translate external platforms into InboundMessage events and send OutboundMessage events back to the platform.

Main files:

Area	Files
Base channel contract	`nanobot/channels/base.py`
Built-in channels	`nanobot/channels/*.py`
Discovery and lifecycle	`nanobot/channels/manager.py`
WebSocket/WebUI channel	`nanobot/channels/websocket.py`

Channels are discovered through built-in module scanning and plugin entry points. A custom channel should follow channel-plugin-guide.md.

WebUI and Gateway

nanobot gateway starts:

enabled chat channels;
the WebSocket channel when configured;
workspace-scoped cron service;
system jobs such as Dream and heartbeat;
the health endpoint on gateway.port.

The packaged WebUI is served by the WebSocket channel, not the health endpoint:

Surface	Default
Health endpoint	`http://127.0.0.1:18790/health`
WebUI/WebSocket	`http://127.0.0.1:8765`

WebUI source lives in webui/. The production build is written to nanobot/web/dist/ and bundled into the wheel.

Useful docs:

../webui/README.md for WebUI use and development;
websocket.md for protocol details.

Tools

Tools are discovered from nanobot/agent/tools/ and plugin entry points.

Important files:

Tool area	Files
Tool base and schema	`nanobot/agent/tools/base.py`, `nanobot/agent/tools/schema.py`
Discovery	`nanobot/agent/tools/registry.py`
Shell execution	`nanobot/agent/tools/shell.py`
Filesystem tools	`nanobot/agent/tools/filesystem.py`
Web search/fetch	`nanobot/agent/tools/web.py`
MCP tools	`nanobot/agent/tools/mcp.py`
Cron	`nanobot/agent/tools/cron.py`, `nanobot/cron/`
Image generation	`nanobot/agent/tools/image_generation.py`
Runtime self-inspection	`nanobot/agent/tools/self.py`

Tool behavior is part of the model contract. Keep user-visible tool names, schemas, and error messages stable unless a change is intentional.

Config and Paths

The config schema lives in nanobot/config/schema.py. Loading and saving live in nanobot/config/loader.py. Runtime path helpers live in nanobot/config/paths.py.

Defaults:

Path	Default
Config	`~/.nanobot/config.json`
Workspace	`~/.nanobot/workspace/`
Sessions	`<workspace>/sessions/*.jsonl`
Memory	`<workspace>/memory/`
Cron store	`<workspace>/cron/jobs.json`
WebUI/media/log runtime data	config directory subdirectories such as `webui/`, `media/`, and `logs/`

The schema accepts both camelCase and snake_case keys, but saves config with camelCase aliases.

Memory and Sessions

Session history is the near-term conversation replay. Memory is the longer-term workspace state.

Store	File area
Session JSONL files	`<workspace>/sessions/`
Long-term memory	`<workspace>/memory/MEMORY.md`
Consolidation source history	`<workspace>/memory/history.jsonl`
Bootstrap identity files	`<workspace>/SOUL.md`, `<workspace>/USER.md`, templates under `nanobot/templates/`

Dream is implemented in nanobot/agent/memory.py and scheduled by the runtime when enabled.

Security Boundaries

Security-sensitive code paths include:

Boundary	Files
Workspace scope	`nanobot/security/workspace_access.py`, `nanobot/security/workspace_policy.py`
Shell sandboxing	`nanobot/agent/tools/shell.py`
SSRF/network checks	`nanobot/security/network.py`, `nanobot/agent/tools/web.py`
PTH guard and CLI startup security	`nanobot/security/` and CLI entrypoints
Channel access control	channel config in `nanobot/channels/*.py`

When changing tools, channels, file access, WebUI workspace behavior, or network fetching, treat security as part of the functional behavior and update docs if the user-facing boundary changes.

Extension Points

Extension	How
Provider	Add `ProviderSpec` in `providers/registry.py`, add schema field in `config/schema.py`, implement provider only if the generic backend is not enough
Channel	Implement `BaseChannel`, expose an entry point, follow `channel-plugin-guide.md`
Tool	Implement a tool under `agent/tools/` or expose a plugin entry point
MCP	Add `tools.mcpServers` config
Skill	Add workspace skill files under `<workspace>/skills/` or built-in skills under `nanobot/skills/`

Prefer existing registry/discovery patterns over ad hoc wiring.

Testing and Verification

Common checks:

pytest tests/test_openai_api.py::test_function -v
ruff check nanobot/
cd webui && bun run test
cd webui && bun run build

Choose tests based on the changed surface:

Change	Minimum useful verification
Provider behavior	Provider unit tests or a mocked API path; `nanobot agent -m "Hello!"` with safe config when possible
Channel behavior	Channel tests plus `nanobot gateway` startup path
WebUI behavior	WebUI tests/build and, for routing/settings/chat changes, browser-level verification through the gateway
Tool behavior	Tool unit tests and an agent-run path when schema or model-facing behavior changes
Docs	Link checks, command accuracy against CLI/schema, and `git diff --check`

For user-facing flows, prefer at least one verification path through the public surface the user actually touches: CLI command, HTTP endpoint, WebSocket/WebUI, chat channel, or packaged import.

8.3 KiB Raw Blame History