EPIC: M13-SpeechServices — TTS & STT Integration #388
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Overview
Integrate Text-to-Speech (TTS) and Speech-to-Text (STT) capabilities into Mosaic Stack using a tiered, OpenAI-compatible API architecture.
Architecture
All speech providers expose OpenAI-compatible endpoints ( for TTS, for STT), enabling a single NestJS integration pattern via the npm package with configurable base URLs.
TTS Tiers
STT
Key Insight
Speaches can serve both STT (faster-whisper) and TTS (Kokoro/Piper) in a single container for simplified deployment.
Research
Full research: jarvis-brain
Scope
Issues
Track sub-issues in this milestone.
M13-SpeechServices milestone complete. All 18 sub-issues (#389-#406) implemented and closed. 62 files changed, 13,613 lines added. 500+ tests across API and web packages. Branch: feature/m13-speech-services. PR to develop pending.