Endpoint for calculating price for text2audio inference. Either text or count_text must be provided.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
application/json Audio generation parameters. Either text or count_text must be provided. Voice is optional (not needed for voice_clone or voice_design modes).
The model to use for speech generation. Available models can be retrieved via the GET /api/v1/client/models endpoint.
"Kokoro"
Language to be used during audio generation
"en-us"
Generated audio speech speed
1
Audio output format
"flac"
Sample rate of generated audio
24000
Text to be converted to speech. Either this or count_text must be provided.
"A beautiful sunset over mountains"
Number of characters for price calculation. Either this or text must be provided.
1000
TTS mode: custom_voice (default), voice_clone, or voice_design.
custom_voice, voice_clone, voice_design "custom_voice"
Name of the voice to be used. Only relevant for custom_voice mode.
"af_sky"
Natural language voice description for voice_design mode price calculation.
Calculated price for text2audio inference.