nanobot

mirror of https://github.com/HKUDS/nanobot.git synced 2026-06-15 15:24:06 +00:00

History

Ilia Breitburg 0eb3010e40 feat(transcription): configurable STT model + OpenRouter provider

Add a `transcriptionModel` channel setting and an OpenRouter transcription
backend so voice messages can be transcribed through OpenRouter's
speech-to-text endpoint (e.g. nvidia/parakeet-tdt-0.6b-v3, openai/whisper-1),
alongside the existing Groq/OpenAI Whisper providers.

- schema: add channels.transcriptionModel (None = provider default)
- providers/transcription: extract a shared POST/retry skeleton; add a
  JSON+base64 OpenRouterTranscriptionProvider; make the STT model a
  constructor param on all providers instead of hardcoding it
- channels: route transcriptionProvider="openrouter" and thread the model
  through the manager to each channel
- docs + tests

Only dedicated STT models work on OpenRouter's transcription endpoint;
chat LLMs (e.g. google/gemini-3.5-flash) are rejected there.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

2026-06-09 04:01:37 +08:00

test_mcp_presets_api.py

feat(apps): unify CLI apps and MCP (#3991 )

2026-05-25 20:07:02 +08:00

test_mcp_presets_runtime.py

feat(mcp): add preset setup and capability mentions

2026-05-24 19:43:20 +08:00

test_settings_api.py

feat(transcription): configurable STT model + OpenRouter provider

2026-06-09 04:01:37 +08:00

test_token_usage.py

feat(desktop): polish desktop shell and shared WebUI surfaces (#4195 )

2026-06-06 19:49:33 +08:00

test_transcription_ws.py

feat(transcription): add shared voice input support (#4232 )

2026-06-09 01:08:49 +08:00