Generate a realistic talking video by combining a static portrait image with an audio file using Veed Fabric 1.0. The model produces a lip-synced video (MP4) where the person in the image speaks naturally in sync with the provided audio. Ideal for creating talking head videos from a single photo and voice recording.
Your Freepik API key. Required for authentication. Learn how to obtain an API key
Output video resolution. Controls the quality and dimensions of the generated talking video.
720p: HD resolution (1280x720). Best for high-quality previews and production use.480p: SD resolution (854x480). Suitable for drafts and faster processing.720p, 480p URL of the audio file for lip synchronization. The person in the image will speak in sync with this audio. Must be a publicly accessible URL. Supported formats include MP3, WAV, and M4A.
"https://example.com/speech-audio.mp3"
URL of the portrait image to animate into a talking video. The image should contain a clearly visible face for best lip-sync results. Must be a publicly accessible URL.
"https://example.com/portrait.jpg"
Optional callback URL that will receive asynchronous notifications whenever the task changes status. The payload sent to this URL is the same as the corresponding GET endpoint response, but without the data field.
"https://www.example.com/webhook"
OK - The task exists and the status is returned
{
"task_id": "046b6c7f-0b8a-43b9-b35d-6489e6daee91",
"status": "CREATED",
"generated": [
"https://openapi-generator.tech",
"https://openapi-generator.tech"
]
}