Skip to main content

Veed Fabric 1.0 integration

Powered by Veed Fabric 1.0 technology, this API generates realistic talking head videos by synchronizing lip movements to audio input.
Veed Fabric 1.0 is an AI-powered lip synchronization API that creates realistic talking videos from a static portrait image and an audio file. The model analyzes the speech patterns in the audio and animates the person in the image with natural lip movements, producing an MP4 video where the subject appears to speak in sync with the provided audio. Output resolution is configurable at 720p (1280x720) or 480p (854x480).

Key capabilities

  • Image + audio input: Combine a portrait photo with an audio file (MP3, WAV, M4A) to generate a talking video
  • Resolution options: Output at 720p (1280x720) for production use or 480p (854x480) for drafts and faster processing
  • Natural lip sync: AI-driven mouth movement generation that matches speech patterns accurately
  • MP4 video output: Generates a downloadable MP4 video file with the animated portrait
  • Async processing: Webhook notifications or polling for task completion

Use cases

  • Marketing videos: Create personalized spokesperson videos from a single photo
  • E-learning: Animate instructors for training materials and educational content
  • Social media: Generate talking head content at scale for multiple platforms
  • Customer support: Build video responses with consistent brand representatives
  • Localization: Produce dubbed videos with lip-synced speech in different languages

Generate talking video with Veed Fabric 1.0

Submit a portrait image URL and audio file URL along with the desired resolution. The service returns a task ID for async polling or webhook notification.

POST /v1/ai/lip-sync/veed-fabric-1-0

Create a new lip-sync video generation task

GET /v1/ai/lip-sync/veed-fabric-1-0

List all Veed Fabric 1.0 tasks

GET /v1/ai/lip-sync/veed-fabric-1-0/{task-id}

Get task status and results by ID

Parameters

ParameterTypeRequiredDefaultDescription
image_urlstringYes-URL of the portrait image to animate. Must be publicly accessible. Use a clear, front-facing photo for best results
audio_urlstringYes-URL of the audio file for lip synchronization. Supported formats: MP3, WAV, M4A. Must be publicly accessible
resolutionstringYes-Output video resolution: "720p" (1280x720) or "480p" (854x480)
webhook_urlstringNo-HTTPS URL for task completion notification

Frequently Asked Questions

Veed Fabric 1.0 is an AI lip-sync API that generates realistic talking videos from a static portrait image and an audio file. You submit an image URL and audio URL via the API, receive a task ID immediately, then poll for results or receive a webhook notification when processing completes. The output is an MP4 video where the person in the image speaks in sync with the provided audio.
Veed Fabric 1.0 accepts common image formats via a publicly accessible URL. For best lip-sync results, use a clear, front-facing portrait photo where the face is fully visible and well-lit.
Veed Fabric 1.0 supports MP3, WAV, and M4A audio formats. The audio file must be hosted at a publicly accessible URL. The model analyzes speech patterns in the audio to generate matching lip movements.
Two resolutions are available: 720p (1280x720) for production-quality output and 480p (854x480) for drafts or faster processing. The resolution is set via the required resolution parameter.
Veed Fabric 1.0 prioritizes generation quality with higher fidelity lip synchronization. Veed Fabric 1.0 Fast is optimized for reduced generation time while maintaining quality lip-sync results. Choose Veed Fabric 1.0 when output quality is the priority, or Veed Fabric 1.0 Fast when speed is more important.
Rate limits vary by subscription tier. See Rate Limits for current limits.
See the Pricing page for current rates and subscription options.

Best practices

  • Image quality: Use well-lit, front-facing portrait photos with a clearly visible face for the most accurate lip sync
  • Audio clarity: Clean audio with minimal background noise produces better lip-sync accuracy
  • Resolution choice: Use 720p for final production videos and 480p for quick previews or prototyping
  • Production integration: Use webhooks instead of polling for scalable applications
  • Error handling: Implement retry logic with exponential backoff for 503 errors
  • Veed Fabric 1.0 Fast: Faster lip-sync generation with optimized processing time
  • Latent Sync: Alternative lip synchronization technology
  • Voiceover: Generate speech audio from text to use as lip-sync input
  • Sound Effects: Generate sound effects from text descriptions