Endpoint for requesting text2audio inference
slug, check specific limits and features, and verify LoRA availability.Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
application/json Audio generation parameters
Text to be converted to speech
"A beautiful sunset over mountains"
The model to use for speech generation. Available models can be retrieved via the GET /api/v1/client/models endpoint.
"Kokoro"
Name of the voice to be used during audio generation
"af_sky"
Language to be used during audio generation
"en-us"
Generated audio speech speed
1
Audio output format
"flac"
Sample rate of generated audio
24000
ID of the inference request.
Information from success endpoint