back to top
HomeSoftwareAI ToolsOvi AI Video + Audio Generator in ComfyUI — Best Open-Source Alternative...

Ovi AI Video + Audio Generator in ComfyUI — Best Open-Source Alternative to Veo 3 & Sora 2

- Advertisement -

File Information

PropertyDetails
NameComfyUI-Ovi
VersionLatest
PlatformWindows, Linux, macOS (via ComfyUI)
File TypeCustom Node Workflow
LicenseOpen Source (GitHub)
RepositoryComfyUI-Ovi
DependenciesPyTorch 2.4+, CUDA 12.x
VRAM Requirement16–24 GB (FP8) or >32 GB (BF16)
CategoryAI Video + Audio Generation Workflow

Description

Experience next-generation AI video and audio generation locally with Ovi in ComfyUI — the most powerful open-source workflow that rivals Google’s Veo 3 and OpenAI’s Sora 2.
With Ovi’s multimodal fusion engine and seamless integration into ComfyUI, you can create AI-generated videos with synchronized sound, all without depending on cloud services.

It’s inspired by Character.AI’s Ovi and integrates seamlessly into the ComfyUI node environment, offering a fully modular, GPU-accelerated, and privacy-friendly experience.

Think of it as a self-hosted alternative to proprietary systems like Veo 3 or Sora 2, giving you total creative freedom and zero cloud dependency.

Features of Ovi: Open Source Veo 3 & Sora 2 Alternative

FeatureDescription
Self-Bootstrapping LoaderAutomatically downloads and manages MMAudio assets and Ovi fusion weights.
Precision ControlChoose between BF16 (for 32 GB + GPUs) or FP8 (for 16–24 GB cards).
Attention SelectorSwitch dynamically between FlashAttention, SDPA, Sage, and more.
Multi-GPU OptimizationTargets specific GPUs in multi-card setups for faster inference.
Component ReuseReuses your existing Wan 2.2 VAE and UMT5 text encoder without duplication.
CPU Offload OptionMoves larger modules to RAM when VRAM is limited.
Automatic Directory SetupPlaces all required files (weights, encoders, VAEs) in proper directories automatically.
Fully Node-BasedIntegrated directly into ComfyUI as custom nodes, accessible under the “Ovi” category.
Fast & Flexible GenerationSupports text-to-video, iDirectory Structure
mage-to-video, video + audio fusion, and custom first-frame prompts.

Screenshots

Generation From Ovi AI Video + Audio Generator

System Requirements

ComponentMinimumRecommended
GPU16 GB (FP8 with offload)32 GB + (BF16 without offload)
CPU8-Core12 + Core
RAM32 GB64 GB + for large projects
Storage30 GB freeSSD preferred
CUDA12.x12.4 +
PyTorch2.4 +Latest Stable
OS SupportWindows, Linux, macOS (via ComfyUI)Windows/Linux preferred for CUDA acceleration

Directory Structure

ComfyUI/
├── models/
│   ├── diffusion_models/
│   │   ├── Ovi-11B-bf16.safetensors
│   │   └── Ovi-11B-fp8.safetensors
│   ├── text_encoders/umt5-xxl-enc-bf16.safetensors
│   └── vae/wan2.2_vae.safetensors
└── custom_nodes/ComfyUI-Ovi/ckpts/MMAudio/ext_weights/...

Available Ovi Nodes

NodeDescription
Ovi Engine LoaderDownloads missing weights, builds the fusion engine, and exposes OVI_ENGINE with selectable precision and device.
Ovi Wan Component LoaderConnects Ovi to existing Wan 2.2 VAE and UMT5 encoders.
Ovi Attention SelectorDynamically changes attention backend (FlashAttention, SDPA, etc.).
Ovi Video GeneratorGenerates AI-based video + audio latents from text prompts.
Ovi Latent DecoderConverts latents into viewable video + audio output.

How to Install Ovi Using ComfyUI

  1. Navigate to your ComfyUI custom nodes folder: cd ComfyUI/custom_nodes
  2. Clone the Ovi repository: git clone https://github.com/snicolast/ComfyUI-Ovi.git cd ComfyUI-Ovi
  3. Install dependencies: pip install -r requirements.txt
  4. Restart ComfyUI
    • Ovi nodes will now appear under the “Ovi” category in ComfyUI’s node search.

Workflow Example

  1. Drop Ovi Engine Loader — select your precision and enable CPU offload if needed.
  2. (Optional) Connect Ovi Wan Component Loader if your encoder/VAE is stored elsewhere.
  3. Add Attention Selector — pick FlashAttention, SDPA, or Auto.
  4. Generate Video — input your prompt (supports <S> speech and <AUDCAP> audio tags).
  5. Decode Latents — feed results into Ovi Latent Decoder for video + audio output.
  6. Export & Save — connect the outputs to your preferred save nodes in ComfyUI.

Troubleshooting & Tips

  • High VRAM after render: Use ComfyUI’s Unload Models button.
  • Missing weights: Place manually in the appropriate folders — loader will skip downloads if found.
  • Switching precision: Change in dropdown; no restart needed.
  • Backend errors: If FlashAttention/xFormers are missing, Ovi automatically falls back to native.

Why Ovi + ComfyUI is the Best Sora 2 & Veo 3 Alternative

Unlike closed-source AI video systems, ComfyUI-Ovi is:

  • 100 % open source and customizable
  • Runs completely offline
  • Uses existing ComfyUI assets (Wan 2.2, MMAudio)
  • Supports multi-GPU rendering
  • Lets you fine-tune, control precision, and select backend performance

Download Ovi AI Video Generator ComfyUI Workflow

Install Ovi AI Video + Audio Generator Best Veo 3 & Sora 2 alternative Directly

If you want to download and install Ovi AI Video Editor Diretly and run it using gradio interface then follow this Ovi Installation Guide, Enjoy!

LEAVE A REPLY

Please enter your comment!
Please enter your name here

YOU MAY ALSO LIKE
MiniCPM Desk Pet Open Source AI Desktop Companion That Runs Locally

MiniCPM Desk Pet: Open Source AI Desktop Companion That Runs Locally

0
MiniCPM Desk Pet turns the MiniCPM model into a desktop companion that lives alongside your workflow. Install the app, follow the setup wizard, and within a few minutes you can chat with a local AI pet directly from a floating desktop bubble. The app checks your environment, downloads the model, warms it up, and simplify the complexity of the setup Once everything is ready, conversations run on your machine using the local model. The pet can stay visible while you work, react to activity from tools like Cursor, Claude Code, and Codex, and even take on different personalities through character adapters. It's part local AI assistant, part desktop pet.
AionUi The Open Source AI Cowork App With Built-In Agents & Multi-Agent Automation

AionUi: The Open Source AI Cowork App With Built-In Agents & Multi-Agent Automation

0
AionUI is designed more like a full AI cowork platform where multiple AI agents can work alongside you directly on your computer. Instead of only chatting, the agents can read files, generate documents, browse the web, automate workflows, organize data, and execute long multi-step tasks while you stay in control. Most AI desktop apps require separate CLI installations and complicated setup steps before you can start using autonomous agents. AionUi removes that complexity completely. Install the app, add your preferred API key (or use Google login), and the built-in agent is ready immediately. The platform also supports multiple external AI agent systems including Claude Code, Codex, Hermes Agent, OpenClaw, Cursor Agent, and several others through one unified interface.
KanBots Open-Source AI Kanban Board for Claude Code & Codex Agents

KanBots: Open-Source AI Kanban Board for Claude Code & Codex Agents

0
Kanbots is a local-first AI kanban board designed for developers who want AI agents to work on multiple tasks in parallel instead of one conversation at a time. You open a repository, drop it into Kanbots, and instantly get a visual board where every card can become its own autonomous AI task. Claude Code or Codex agents run inside isolated git worktrees, allowing multiple coding sessions to happen simultaneously without interfering with each other. It treats AI agents more like active team members than chat assistants. Agents can split tasks into subtasks, review their own work, iterate in autopilot cycles, run QA loops, and continue refining changes until tests pass or a budget limit is reached. The board updates live as agents work, showing logs, decisions, costs, branches, and progress in real time. Kanbots stays heavily focused on local ownership. Everything lives inside the .kanbots/ directory within your project, database, configs, worktrees, attachments, and runtime state.

Don’t miss any Tech Story

Subscribe To Firethering NewsLetter

You Can Unsubscribe Anytime! Read more in our privacy policy