Streaming text to speech
Convert text to speech with real-time streaming audio delivery.
Key Features:
- Low-latency audio streaming - playback can start immediately as chunks arrive
- Automatic language detection
- Multiple format support: MP3 or PCM/WAV
- High-quality audio: 24kHz sample rate, 16-bit mono
Request Body:
text: Input text to convert to speech - requiredlanguage: Language code (default: “auto”) - “auto” for automatic detection, or ISO 639-1 codes: en, zh, hi, es, ar, bn, pt, ru, ja, pa, de, ko, fr, tr, it, th, pl, nl, id, vi, urformat: Audio format (default: “mp3”) - “mp3” for compressed MPEG audio (96 kbps) or “pcm” for uncompressed WAVmodel: TTS model identifier (default: “auto”) - “auto” selects the default provider; a registry-resolvable id dispatches to the matching provider (note: not all providers supportformat: "mp3")
Response:
- Streams audio chunks in real-time
- Returns
X-Language-Codeheader with detected/used language - Content-Type:
audio/mpeg(MP3) oraudio/wav(PCM)
Use Cases:
- Real-time applications requiring immediate audio playback
- Interactive voice responses
- Low-latency text-to-speech scenarios
Documentation Index
Fetch the complete documentation index at: https://docs.mka1.com/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Gateway auth: send Authorization: Bearer <mka1-api-key>. For multi-user server-side integrations, you can also send X-On-Behalf-Of: <external-user-id>.
Headers
Optional external end-user identifier forwarded by the API gateway.
Body
Request parameters for streaming text-to-speech conversion with language selection and format options.
Input text to convert to speech
1Language code: 'auto' for automatic detection, or ISO 639-1 language codes
auto, en, zh, hi, es, ar, bn, pt, ru, ja, pa, de, ko, fr, tr, it, th, pl, nl, id, vi, ur TTS model identifier. Defaults to 'auto', which selects the default provider. Registry-resolvable ids dispatch to the matching provider.
"auto"
Audio output format: 'mp3' for MPEG audio (default), or 'pcm' for uncompressed PCM/WAV (24kHz, 16-bit, mono)
mp3, pcm Response
Streaming audio in MP3 or PCM/WAV format with language code header
MP3 audio stream (when format=mp3): 24kHz sample rate, 96 kbps bitrate, 16-bit, mono