Veed Fabric 1.0 integration
Powered by Veed Fabric 1.0 technology, this API generates realistic talking head videos by synchronizing lip movements to audio input.
Key capabilities
- Image + audio input: Combine a portrait photo with an audio file (MP3, WAV, M4A) to generate a talking video
- Resolution options: Output at
720p(1280x720) for production use or480p(854x480) for drafts and faster processing - Natural lip sync: AI-driven mouth movement generation that matches speech patterns accurately
- MP4 video output: Generates a downloadable MP4 video file with the animated portrait
- Async processing: Webhook notifications or polling for task completion
Use cases
- Marketing videos: Create personalized spokesperson videos from a single photo
- E-learning: Animate instructors for training materials and educational content
- Social media: Generate talking head content at scale for multiple platforms
- Customer support: Build video responses with consistent brand representatives
- Localization: Produce dubbed videos with lip-synced speech in different languages
Generate talking video with Veed Fabric 1.0
Submit a portrait image URL and audio file URL along with the desired resolution. The service returns a task ID for async polling or webhook notification.POST /v1/ai/lip-sync/veed-fabric-1-0
Create a new lip-sync video generation task
GET /v1/ai/lip-sync/veed-fabric-1-0
List all Veed Fabric 1.0 tasks
GET /v1/ai/lip-sync/veed-fabric-1-0/{task-id}
Get task status and results by ID
Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
image_url | string | Yes | - | URL of the portrait image to animate. Must be publicly accessible. Use a clear, front-facing photo for best results |
audio_url | string | Yes | - | URL of the audio file for lip synchronization. Supported formats: MP3, WAV, M4A. Must be publicly accessible |
resolution | string | Yes | - | Output video resolution: "720p" (1280x720) or "480p" (854x480) |
webhook_url | string | No | - | HTTPS URL for task completion notification |
Frequently Asked Questions
What is Veed Fabric 1.0 and how does it work?
What is Veed Fabric 1.0 and how does it work?
Veed Fabric 1.0 is an AI lip-sync API that generates realistic talking videos from a static portrait image and an audio file. You submit an image URL and audio URL via the API, receive a task ID immediately, then poll for results or receive a webhook notification when processing completes. The output is an MP4 video where the person in the image speaks in sync with the provided audio.
What image formats does Veed Fabric 1.0 accept?
What image formats does Veed Fabric 1.0 accept?
Veed Fabric 1.0 accepts common image formats via a publicly accessible URL. For best lip-sync results, use a clear, front-facing portrait photo where the face is fully visible and well-lit.
What audio formats are supported?
What audio formats are supported?
Veed Fabric 1.0 supports MP3, WAV, and M4A audio formats. The audio file must be hosted at a publicly accessible URL. The model analyzes speech patterns in the audio to generate matching lip movements.
What output resolutions are available?
What output resolutions are available?
Two resolutions are available:
720p (1280x720) for production-quality output and 480p (854x480) for drafts or faster processing. The resolution is set via the required resolution parameter.What is the difference between Veed Fabric 1.0 and Veed Fabric 1.0 Fast?
What is the difference between Veed Fabric 1.0 and Veed Fabric 1.0 Fast?
Veed Fabric 1.0 prioritizes generation quality with higher fidelity lip synchronization. Veed Fabric 1.0 Fast is optimized for reduced generation time while maintaining quality lip-sync results. Choose Veed Fabric 1.0 when output quality is the priority, or Veed Fabric 1.0 Fast when speed is more important.
What are the rate limits for Veed Fabric 1.0?
What are the rate limits for Veed Fabric 1.0?
Rate limits vary by subscription tier. See Rate Limits for current limits.
How much does Veed Fabric 1.0 cost?
How much does Veed Fabric 1.0 cost?
See the Pricing page for current rates and subscription options.
Best practices
- Image quality: Use well-lit, front-facing portrait photos with a clearly visible face for the most accurate lip sync
- Audio clarity: Clean audio with minimal background noise produces better lip-sync accuracy
- Resolution choice: Use
720pfor final production videos and480pfor quick previews or prototyping - Production integration: Use webhooks instead of polling for scalable applications
- Error handling: Implement retry logic with exponential backoff for 503 errors
Related APIs
- Veed Fabric 1.0 Fast: Faster lip-sync generation with optimized processing time
- Latent Sync: Alternative lip synchronization technology
- Voiceover: Generate speech audio from text to use as lip-sync input
- Sound Effects: Generate sound effects from text descriptions