Create /api/speech/synthesize REST endpoint #396

New Issue

jason.woltje · 2026-02-15T07:34:35Z

jason.woltje commented

2026-02-15 07:34:35 +00:00

Description

Create REST endpoint for text-to-speech synthesis via the SpeechService.

Endpoint

===============================================================================
flac - Command-line FLAC encoder/decoder version 1.5.0
Copyright (C) 2000-2009 Josh Coalson
Copyright (C) 2011-2025 Xiph.Org Foundation

This program is free software; you can redistribute it and/or
modify it under the terms of the GNU General Public License
as published by the Free Software Foundation; either version 2
of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License along
with this program; if not, write to the Free Software Foundation, Inc.,
51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.

This is the short help; for all options use 'flac --help'; for more explanation
and examples please consult the manual. This manual is often distributed
alongside the program as a man page or an HTML file. It can also be found
online at https://xiph.org/flac/documentation_tools_flac.html

To encode:
flac [-#] [INPUTFILE [...]]

-# is -0 (fastest compression) to -8 (highest compression); -5 is the default

To decode:
flac -d [INPUTFILE [...]]

To test:
flac -t [INPUTFILE [...]]

Implementation

(add to existing controller)
Provider selection via request body or default config
Text length validation (configurable max, default 4096 chars)
Rate limiting per workspace
Requires authentication (workspace-scoped)
Streaming response option via Accept header

Acceptance Criteria

POST /api/speech/synthesize returns audio
Provider selection (default/premium/lightweight)
Voice selection
Speed control
Format selection
Text length validation
Workspace-scoped authentication
Unit + integration tests

## Description Create REST endpoint for text-to-speech synthesis via the SpeechService. ## Endpoint =============================================================================== flac - Command-line FLAC encoder/decoder version 1.5.0 Copyright (C) 2000-2009 Josh Coalson Copyright (C) 2011-2025 Xiph.Org Foundation This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA. =============================================================================== This is the short help; for all options use 'flac --help'; for more explanation and examples please consult the manual. This manual is often distributed alongside the program as a man page or an HTML file. It can also be found online at https://xiph.org/flac/documentation_tools_flac.html To encode: flac [-#] [INPUTFILE [...]] -# is -0 (fastest compression) to -8 (highest compression); -5 is the default To decode: flac -d [INPUTFILE [...]] To test: flac -t [INPUTFILE [...]] ## Implementation - (add to existing controller) - Provider selection via request body or default config - Text length validation (configurable max, default 4096 chars) - Rate limiting per workspace - Requires authentication (workspace-scoped) - Streaming response option via Accept header ## Acceptance Criteria - [ ] POST /api/speech/synthesize returns audio - [ ] Provider selection (default/premium/lightweight) - [ ] Voice selection - [ ] Speed control - [ ] Format selection - [ ] Text length validation - [ ] Workspace-scoped authentication - [ ] Unit + integration tests

jason.woltje added this to the M13-SpeechServices (0.0.13) milestone 2026-02-15 07:34:35 +00:00

jason.woltje closed this issue

2026-02-15 09:27:34 +00:00

jason.woltje commented

2026-02-15 09:30:13 +00:00

Completed as part of M13-SpeechServices milestone on branch feature/m13-speech-services. SP-EP-002: /api/speech/synthesize REST endpoint (commit 527262a, 17 tests). All quality gates passed (lint, typecheck, tests, security).

Completed as part of M13-SpeechServices milestone on branch feature/m13-speech-services. SP-EP-002: /api/speech/synthesize REST endpoint (commit 527262a, 17 tests). All quality gates passed (lint, typecheck, tests, security).

Sign in to join this conversation.

Branches Tags

main

fix/ci-glibc-image

fix/dockerfile-npmrc

fix/matrix-native-binary

fix/kaniko-cache

fix/base-image-kaniko-v2

fix/base-image-kaniko

feat/custom-base-image

ci/pnpm-cache

fix/interceptor-tests

fix/kanban-tests

feat/wire-chat

feat/usage-widget

fix/security-hardening

fix/project-domain-v2

feat/kanban-add-task

fix/project-domain-attach

fix/logs-page-clean

fix/workspace-members

fix/ci-lint-632

fix/file-manager-tags

fix/csrf-debug-log

fix/controller-type-imports

fix/system-admin-env

fix/gateway-cors-trusted-origins

feat/project-detail-page

fix/fleet-provider-form-dto-v2

fix/ms22-audit

fix/orchestrator-widgets

fix/fleet-provider-form-dto

fix/csrf-bearer-bypass

fix/ms22-missing-authmodule-imports

fix/container-lifecycle-config-module

fix/swarm-compose-ms22-vars

chore/ms22-p1-complete

feat/ms22-p1h-settings-ui

feat/ms22-p1f-onboarding-ui

feat/ms22-p1i-chat-proxy

feat/ms22-p1k-idle-reaper

feat/ms22-p1j-docker

feat/ms22-p1e-onboarding-api

feat/ms22-p1g-settings-api

feat/ms22-p1d-container-mgr

feat/ms22-p1c-config-api

chore/ms22-prd-tracking

feat/ms22-p1a-schema

feat/ms22-p1b-crypto

chore/ms22-p1-tasks

docs/ms22-architecture

feat/ms22-openclaw-docker

feat/ms22-openclaw-gateway-module

chore/ms21-complete

chore/ms21-final-tasks-done

fix/ms21-ui-001-qa

test/ms21-ui-tests

chore/ms21-tasks-sync

chore/ms22-phase0-complete

feat/ms22-ingest-clean

feat/ms21-ui-users-members

feat/ms22-task-agent

chore/tasks-final

chore/tasks-update

feat/ms21-session-invalidation

feat/ms21-rbac-settings

feat/ms21-teams-page

feat/ms21-users-page

feat/ms19-terminal-persistence

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: mosaic/stack#396