back to top
HomeSoftwareAI ToolsOvi AI Video + Audio Generator in ComfyUI — Best Open-Source Alternative...

Ovi AI Video + Audio Generator in ComfyUI — Best Open-Source Alternative to Veo 3 & Sora 2

- Advertisement -

File Information

PropertyDetails
NameComfyUI-Ovi
VersionLatest
PlatformWindows, Linux, macOS (via ComfyUI)
File TypeCustom Node Workflow
LicenseOpen Source (GitHub)
RepositoryComfyUI-Ovi
DependenciesPyTorch 2.4+, CUDA 12.x
VRAM Requirement16–24 GB (FP8) or >32 GB (BF16)
CategoryAI Video + Audio Generation Workflow

Description

Experience next-generation AI video and audio generation locally with Ovi in ComfyUI — the most powerful open-source workflow that rivals Google’s Veo 3 and OpenAI’s Sora 2.
With Ovi’s multimodal fusion engine and seamless integration into ComfyUI, you can create AI-generated videos with synchronized sound, all without depending on cloud services.

It’s inspired by Character.AI’s Ovi and integrates seamlessly into the ComfyUI node environment, offering a fully modular, GPU-accelerated, and privacy-friendly experience.

Think of it as a self-hosted alternative to proprietary systems like Veo 3 or Sora 2, giving you total creative freedom and zero cloud dependency.

Features of Ovi: Open Source Veo 3 & Sora 2 Alternative

FeatureDescription
Self-Bootstrapping LoaderAutomatically downloads and manages MMAudio assets and Ovi fusion weights.
Precision ControlChoose between BF16 (for 32 GB + GPUs) or FP8 (for 16–24 GB cards).
Attention SelectorSwitch dynamically between FlashAttention, SDPA, Sage, and more.
Multi-GPU OptimizationTargets specific GPUs in multi-card setups for faster inference.
Component ReuseReuses your existing Wan 2.2 VAE and UMT5 text encoder without duplication.
CPU Offload OptionMoves larger modules to RAM when VRAM is limited.
Automatic Directory SetupPlaces all required files (weights, encoders, VAEs) in proper directories automatically.
Fully Node-BasedIntegrated directly into ComfyUI as custom nodes, accessible under the “Ovi” category.
Fast & Flexible GenerationSupports text-to-video, iDirectory Structure
mage-to-video, video + audio fusion, and custom first-frame prompts.

Screenshots

Generation From Ovi AI Video + Audio Generator

System Requirements

ComponentMinimumRecommended
GPU16 GB (FP8 with offload)32 GB + (BF16 without offload)
CPU8-Core12 + Core
RAM32 GB64 GB + for large projects
Storage30 GB freeSSD preferred
CUDA12.x12.4 +
PyTorch2.4 +Latest Stable
OS SupportWindows, Linux, macOS (via ComfyUI)Windows/Linux preferred for CUDA acceleration

Directory Structure

ComfyUI/
├── models/
│   ├── diffusion_models/
│   │   ├── Ovi-11B-bf16.safetensors
│   │   └── Ovi-11B-fp8.safetensors
│   ├── text_encoders/umt5-xxl-enc-bf16.safetensors
│   └── vae/wan2.2_vae.safetensors
└── custom_nodes/ComfyUI-Ovi/ckpts/MMAudio/ext_weights/...

Available Ovi Nodes

NodeDescription
Ovi Engine LoaderDownloads missing weights, builds the fusion engine, and exposes OVI_ENGINE with selectable precision and device.
Ovi Wan Component LoaderConnects Ovi to existing Wan 2.2 VAE and UMT5 encoders.
Ovi Attention SelectorDynamically changes attention backend (FlashAttention, SDPA, etc.).
Ovi Video GeneratorGenerates AI-based video + audio latents from text prompts.
Ovi Latent DecoderConverts latents into viewable video + audio output.

How to Install Ovi Using ComfyUI

  1. Navigate to your ComfyUI custom nodes folder: cd ComfyUI/custom_nodes
  2. Clone the Ovi repository: git clone https://github.com/snicolast/ComfyUI-Ovi.git cd ComfyUI-Ovi
  3. Install dependencies: pip install -r requirements.txt
  4. Restart ComfyUI
    • Ovi nodes will now appear under the “Ovi” category in ComfyUI’s node search.

Workflow Example

  1. Drop Ovi Engine Loader — select your precision and enable CPU offload if needed.
  2. (Optional) Connect Ovi Wan Component Loader if your encoder/VAE is stored elsewhere.
  3. Add Attention Selector — pick FlashAttention, SDPA, or Auto.
  4. Generate Video — input your prompt (supports <S> speech and <AUDCAP> audio tags).
  5. Decode Latents — feed results into Ovi Latent Decoder for video + audio output.
  6. Export & Save — connect the outputs to your preferred save nodes in ComfyUI.

Troubleshooting & Tips

  • High VRAM after render: Use ComfyUI’s Unload Models button.
  • Missing weights: Place manually in the appropriate folders — loader will skip downloads if found.
  • Switching precision: Change in dropdown; no restart needed.
  • Backend errors: If FlashAttention/xFormers are missing, Ovi automatically falls back to native.

Why Ovi + ComfyUI is the Best Sora 2 & Veo 3 Alternative

Unlike closed-source AI video systems, ComfyUI-Ovi is:

  • 100 % open source and customizable
  • Runs completely offline
  • Uses existing ComfyUI assets (Wan 2.2, MMAudio)
  • Supports multi-GPU rendering
  • Lets you fine-tune, control precision, and select backend performance

Download Ovi AI Video Generator ComfyUI Workflow

Install Ovi AI Video + Audio Generator Best Veo 3 & Sora 2 alternative Directly

If you want to download and install Ovi AI Video Editor Diretly and run it using gradio interface then follow this Ovi Installation Guide, Enjoy!

LEAVE A REPLY

Please enter your comment!
Please enter your name here

- Advertisment -
YOU MAY ALSO LIKE
OpenPencil (Design-as-Code) AI-Native Open-Source UI Editor with Prompt-to-UI & Code Generation

OpenPencil (Design-as-Code): AI-Native UI Editor with Prompt-to-UI & Code Generation

0
This OpenPencil feels like it was built by someone who got tired of dragging rectangles around. It doesn’t pretend to be another Figma clone. The whole idea is to describe the UI, and it builds it. You can prompt an entire landing page and watch it take shape on the canvas. Or highlight a few elements and say, "make this tighter, change the spacing, switch the theme." It can even use a screenshot as a reference and rebuild something similar. When the prompt gets complicated, it breaks the job into smaller chunks and handles them in parallel. It feels closer to working in a dev environment that happens to draw your interface as you go.
OpenPencil AI Design Editor Figma Alternative

OpenPencil: Open-Source AI Design Editor & Powerful Figma Alternative

0
OpenPencil is an open-source, AI-native design editor built as a practical alternative to Figma. It opens and exports real .fig files, supports copy-paste between apps, and runs fully on your machine. It’s built with AI as a first-class feature, not an afterthought. You can describe a layout in chat and have it generated directly in your design file. No plugins or vendor lock-in. It’s also fully local. No account required. Your design files stay on your system unless you choose to share them. OpenPencil is still evolving, so it’s better suited for experimentation and forward-looking workflows than critical production work. But if you care about ownership, automation, and long-term control over your design stack, it’s worth paying attention.
stability matrix ai model manager

Stability Matrix – Local AI Model Manager for Image, Video, TTS & Generative Workflows

0
Stability Matrix is a multi-platform package manager and unified launcher built for Stable Diffusion and related AI tools. Instead of manually installing different WebUI builds, setting up Python environments, and managing Git updates yourself, it brings everything into one organized interface. It supports popular environments like Automatic1111, ComfyUI, SD.Next, InvokeAI, Fooocus, and others including WAN GP and additional AI workflows depending on the selected package. You can install, update, and manage these tools from a single dashboard without jumping between folders and terminals. One of the biggest advantages is that Stability Matrix makes it easier to install AI models locally and use them directly on your system. With the built-in Model Browser, you can import models from sources like CivitAI and Hugging Face, automatically place them in the correct folders, and manage previews and metadata. This means you can run image generation, music generation, video generation & even TTS models on your own machine without complex manual setup.

Don’t miss any Tech Story

Subscribe To Firethering NewsLetter

You Can Unsubscribe Anytime! Read more in our privacy policy