Discover Flux and 10 versatile In-Context LoRA models for image generation.

Mochi Edit UnSampling | Video-to-Video

Mochi Edit: Modify Videos Using Text-Based Prompts and Unsampling.

IC-Light | Video Relighting | AnimateDiff

Relight your videos with light maps and prompts

Wan FusionX | T2V+I2V+VACE Complete

Most powerful video generation solution yet! Cinema-grade detail, your personal film studio.

ComfyUI > Nodes > ComfyUI-LatentSyncWrapper

ComfyUI Extension: ComfyUI-LatentSyncWrapper

Repo Name

ComfyUI-LatentSyncWrapper

Author
ShmuelRonen (Account age: 1490 days) Nodes
View all nodes(2) Latest Updated
2025-04-04 Github Stars
0.67K

Github Ask ShmuelRonen Current Questions Past Questions

Table of Content

Description
ComfyUI-LatentSyncWrapper Introduction
How ComfyUI-LatentSyncWrapper Works
ComfyUI-LatentSyncWrapper Features
ComfyUI-LatentSyncWrapper Models
Troubleshooting ComfyUI-LatentSyncWrapper
Learn More about ComfyUI-LatentSyncWrapper
Related Nodes

How to Install ComfyUI-LatentSyncWrapper

Install this extension via the ComfyUI Manager by searching for ComfyUI-LatentSyncWrapper

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-LatentSyncWrapper in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI-LatentSyncWrapper Description

ComfyUI-LatentSyncWrapper integrates ByteDance's LatentSync model into ComfyUI, enabling precise lip-syncing of video lips with audio input for enhanced synchronization.

ComfyUI-LatentSyncWrapper Introduction

ComfyUI-LatentSyncWrapper is an unofficial implementation of ByteDance's LatentSync model, designed to integrate seamlessly with ComfyUI on Windows. This extension provides advanced lip-sync capabilities, allowing you to synchronize the lip movements in a video with an audio input. This is particularly useful for AI artists who want to create realistic or stylized animations where the audio and visual elements are perfectly aligned. By using this extension, you can enhance your creative projects with precise audio-visual synchronization, solving common issues related to mismatched lip movements in video production.

How ComfyUI-LatentSyncWrapper Works

At its core, ComfyUI-LatentSyncWrapper leverages the LatentSync model, which is based on audio-conditioned latent diffusion models. This means it uses audio inputs to guide the generation of lip movements in video frames. The process involves converting audio into embeddings using the Whisper model, which are then used to influence the U-Net model's output through cross-attention layers. This approach allows the extension to model complex audio-visual correlations directly, without relying on intermediate motion representations. The result is a more consistent and accurate lip-sync, achieved by aligning generated frames with ground truth frames using Temporal REPresentation Alignment (TREPA).

ComfyUI-LatentSyncWrapper Features

Lip-Sync Node: The primary feature of this extension is the lip-sync node, which allows you to input a video and an audio file to generate a synchronized output. You can customize the synchronization by setting parameters such as the video path, audio input, and a random seed for reproducibility.
Video Length Adjuster Node: This complementary node helps manage the synchronization of video and audio lengths. It offers several modes:
Normal: Adds padding to video frames to prevent frame loss.
Pingpong: Creates a forward-backward loop of the video sequence.
Loop to Audio: Extends the video by repeating frames to match the audio duration.
Silent Padding: Adjusts video length to match longer audio durations. These features allow for flexible customization, enabling you to tailor the synchronization process to your specific needs.

ComfyUI-LatentSyncWrapper Models

The extension uses two main models:

LatentSync U-Net: This model is responsible for generating the synchronized lip movements based on the audio input.
Whisper Model: Used to convert audio into embeddings that guide the U-Net model. These models can be automatically downloaded from HuggingFace on the first run, or you can manually download them if needed. The choice of model can affect the quality and performance of the lip-sync, with different models offering varying levels of detail and accuracy.

Troubleshooting ComfyUI-LatentSyncWrapper

Here are some common issues and solutions:

mediapipe Installation Issues: Ensure you are using a compatible Python version (3.8-3.11). If you encounter errors, try installing mediapipe separately using pip install mediapipe>=0.10.8.
PYTHONPATH Errors: Make sure Python is added to your system PATH. Running ComfyUI as an administrator may also resolve these issues.
Video Compatibility: The extension works best with clear, frontal face videos at 25 FPS. Ensure the face is visible throughout the video for optimal results.

Learn More about ComfyUI-LatentSyncWrapper

To further explore the capabilities of ComfyUI-LatentSyncWrapper, you can visit the following resources:

LatentSync on GitHub for more technical details about the underlying model.
ComfyUI on GitHub to learn more about the ComfyUI platform and its features.
HuggingFace Repository for downloading model files and exploring additional documentation. These resources provide valuable insights and support for AI artists looking to enhance their projects with advanced lip-sync capabilities.

ComfyUI-LatentSyncWrapper Related Nodes

LatentSync Node

Video Length Adjuster

Table of Content

Description
ComfyUI-LatentSyncWrapper Introduction
How ComfyUI-LatentSyncWrapper Works
ComfyUI-LatentSyncWrapper Features
ComfyUI-LatentSyncWrapper Models
Troubleshooting ComfyUI-LatentSyncWrapper
Learn More about ComfyUI-LatentSyncWrapper
Related Nodes

Wan 2.1 FLF2V | First-Last Frame Video

Generate smooth videos from a start and end frame using Wan 2.1 FLF2V.

Self Forcing | Autoregressive Keyframe-to-Video Generation

SUPER FAST! 5-second video in 45 seconds!

ACE++ Character Consistency

Generate consistent images of your character across poses, angles, and styles from a single photo.

PuLID Flux II | Consistent Character Generation

Generate images with precise character control while preserving artistic style.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.