Video Demo
final_ovi_trailer.mp4
๐ Key Features
Ovi is a veo-3 like, video+audio generation model that simultaneously generates both video and audio content from text or text+image inputs.
๐ฌ Video+Audio Generation : Generate synchronized video and audio content simultaneously ๐ต High-Quality Audio Branch : We designed and pretrained our 5B audio branch from scratch using our high quality in-house audio datasets
: Generate synchronized video and audio content simultaneously ๐ Flexible Input : Supports text-only or text+image conditioning
: Supports text-only or text+image conditioning โฑ๏ธ 5-second Videos : Generates 5-second videos at 24 FPS, area of 720ร720, at various aspect ratios (9:16, 16:9, 1:1, etc) ๐ฏ High-Resolution Support : Feel free to try 960ร960 area (e.g., 720ร1280, 704ร1344, etc) - it could give outstanding results for both t2v and i2v! See examples below:
: Generates 5-second videos at 24 FPS, area of 720ร720, at various aspect ratios (9:16, 16:9, 1:1, etc) ๐ฌ Create videos now on wavespeed.ai : https://wavespeed.ai/models/character-ai/ovi/image-to-video & https://wavespeed.ai/models/character-ai/ovi/text-to-video
: https://wavespeed.ai/models/character-ai/ovi/image-to-video & https://wavespeed.ai/models/character-ai/ovi/text-to-video ๐ฌ Create videos now on HuggingFace : https://huggingface.co/spaces/akhaliq/Ovi
: https://huggingface.co/spaces/akhaliq/Ovi ๐ง ComfyUI Integration (WIP): ComfyUI support is now available via ComfyUI-WanVideoWrapper, related PR.
... continue reading