Endpoint for requesting audio2video inference
slug, check specific limits and features.Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
application/json Audio to video generation parameters
The main prompt for video generation
"A beautiful sunset over mountains with dramatic music"
Audio file to condition the video generation. Supported formats: MP3, WAV, OGG, FLAC. Maximum file size: 20 MB.
Width of the generated video in pixels
512
Height of the generated video in pixels
512
Number of video frames to generate
97
Random seed for generation
42
The model to use for video generation. Available models can be retrieved via the GET /api/v1/client/models endpoint.
"Ltx2_19B_Dist_FP8"
Elements to avoid in the generated video
"blur, darkness, noise"
Image for the first frame of video (optional). Supported formats: JPG, JPEG, PNG, GIF, BMP, WebP. Maximum file size: 10 MB.
Image for the last frame of video (optional). Supported formats: JPG, JPEG, PNG, GIF, BMP, WebP. Maximum file size: 10 MB.
Guidance scale for the generation
7.5
Number of inference steps
20
FPS of generated video
24
Optional HTTPS URL to receive webhook notifications for job status changes (processing, completed, failed). Must be HTTPS. Max 2048 characters. See Webhook Documentation for payload structure and authentication details.
2048"https://your-server.com/webhooks/deapi"
ID of the inference request.
Information from success endpoint