Endpoint for requesting text2audio inference
mode parameter:
custom_voice (default) — Use a preset voice from the model’s voice library. Requires the voice parameter.voice_clone — Clone a voice from a short reference audio clip. Requires the ref_audio parameter (3–10 seconds, max 10 MB). Optionally provide ref_text with a transcript of the reference audio for improved accuracy.voice_design — Create a new voice from a natural language description. Requires the instruct parameter (e.g. "A warm female voice with a British accent").slug, check specific limits and features, and verify available languages and voices.Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
application/json Audio generation parameters. Supports three TTS modes: custom_voice (default, preset speakers), voice_clone (clone from reference audio), voice_design (create voice from description).
Text to be converted to speech
"A beautiful sunset over mountains"
The model to use for speech generation. Available models can be retrieved via the GET /api/v1/client/models endpoint.
"Kokoro"
Language to be used during audio generation
"en-us"
Generated audio speech speed
1
Audio output format
"flac"
Sample rate of generated audio
24000
TTS mode: custom_voice (default), voice_clone, or voice_design. Determines which fields are required.
custom_voice, voice_clone, voice_design "custom_voice"
Name of the voice to be used. Required for custom_voice mode.
"af_sky"
Reference audio file for voice cloning. Supported formats: mp3, wav, flac, ogg, m4a. Max 10MB. Duration must be between 3-10 seconds (model-specific limits may apply). Required for voice_clone mode.
Optional transcript of the reference audio for improved voice cloning accuracy.
Natural language voice description for voice_design mode (e.g. "A warm female voice with a British accent"), or style/emotion control in custom_voice mode.
Optional HTTPS URL to receive webhook notifications for job status changes (processing, completed, failed). Must be HTTPS. Max 2048 characters. See Webhook Documentation for payload structure and authentication details.
2048"https://your-server.com/webhooks/deapi"
ID of the inference request.
Information from success endpoint