FLUX Img2Img | Merge Visuals and Prompts

Merge visuals and prompts for stunning, enhanced results.

Wonder3D | ComfyUI 3D Pack

Generate multi-view normal maps and color images for 3D assets.

Hunyuan Image to Video | Breathtaking Motion Creator

Create magnificent movies out of still images through cinematic motion and customizable effects.

Flux Fill | Inpaint and Outpaint

Official Flux Tools - Flux Fill for Inpainting and Outpainting

ComfyUI > Nodes > ComfyUI_EchoMimic

ComfyUI Extension: ComfyUI_EchoMimic

Repo Name

ComfyUI_EchoMimic

Author
smthemex (Account age: 639 days) Nodes
View all nodes(2) Latest Updated
2025-03-09 Github Stars
0.6K

Github Ask smthemex Current Questions Past Questions

Table of Content

Description
How ComfyUI_EchoMimic Works
ComfyUI_EchoMimic Features
ComfyUI_EchoMimic Models
What's New with ComfyUI_EchoMimic
Troubleshooting ComfyUI_EchoMimic
Learn More about ComfyUI_EchoMimic
Related Nodes

How to Install ComfyUI_EchoMimic

Install this extension via the ComfyUI Manager by searching for ComfyUI_EchoMimic

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_EchoMimic in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI_EchoMimic Description

ComfyUI_EchoMimic enables lifelike audio-driven portrait animations in ComfyUI by utilizing editable landmark conditioning, allowing for realistic and dynamic facial movements synchronized with audio input.

ComfyUI_EchoMimic Introduction

ComfyUI_EchoMimic is an extension designed to bring lifelike, audio-driven portrait animations to your AI art projects. By leveraging advanced landmark conditioning, this tool allows you to create realistic animations from audio inputs. Whether you're an AI artist looking to add dynamic elements to your portraits or a developer seeking to integrate sophisticated animation capabilities into your projects, ComfyUI_EchoMimic offers a powerful and user-friendly solution.

How ComfyUI_EchoMimic Works

ComfyUI_EchoMimic operates by analyzing audio inputs and translating them into corresponding facial movements. This is achieved through a series of models that process the audio, identify key landmarks on the face, and generate animations that mimic natural expressions and movements. Think of it as a digital puppeteer that brings static images to life by synchronizing them with audio cues.

Basic Principles:

Audio Processing: The audio input is first processed to extract relevant features.
Landmark Detection: Key facial landmarks are identified and tracked.
Animation Generation: The detected landmarks are used to animate the portrait, creating a lifelike representation that moves in sync with the audio.

ComfyUI_EchoMimic Features

Audio-Driven Animation

Infer Mode: Choose between "audio_drived" and "audio_drived_acc" to generate animations driven by audio inputs.
Motion Sync: Synchronize animations with video files to create seamless, realistic movements.

Pose-Driven Animation

Pose Mode: Generate animations based on pre-defined pose models ("pose_normal" and "pose_acc").
Face Crop Support: Enhance animations by focusing on specific facial regions.

Customization Options

Save Video: Option to save the generated animations as video files.
Draw Mouse: Experiment with different animation styles.
Length: Control the duration of the animation by setting the number of frames.
Low VRAM Mode: Optimize performance for systems with limited video memory.

ComfyUI_EchoMimic Models

ComfyUI_EchoMimic utilizes several models to achieve its functionality. Each model serves a specific purpose in the animation pipeline:

Denoising Unet: Used for refining the generated animations.
Reference Unet: Helps in maintaining consistency across frames.
Motion Module: Handles the movement dynamics based on audio inputs.
Face Locator: Identifies and tracks facial landmarks.
Audio Processor (Whisper): Processes the audio input to extract features.

Model Variants:

Standard Models: Suitable for general use cases.
Accelerated Models: Optimized for faster performance, ideal for high-demand scenarios.

What's New with ComfyUI_EchoMimic

Latest Updates:

Low VRAM Mode: Added to support users with 6GB or 8GB video memory. Note that this mode may be slower and consume more memory.
Improved Model Loading: VAE models are now loaded from a specific directory to reduce loading times and improve compatibility.
Bug Fixes: Resolved issues related to batch image input errors and motion synchronization.

Previous Updates:

Audio ACC Model Support: Added support for audio ACC models and face crop for pose.
Unified Audio Output: Standardized audio output format for better integration with other tools.

Troubleshooting ComfyUI_EchoMimic

Common Issues and Solutions:

Slow Performance in Low VRAM Mode:

Solution: Ensure that your system meets the minimum requirements and try reducing the animation length.

Model Loading Errors:

Solution: Verify that all model files are placed in the correct directories as specified in the documentation.

Audio Sync Issues:

Solution: Check the audio file format and ensure it is compatible with the extension.

Frequently Asked Questions:

Q: Can I use my own audio files?
A: Yes, you can use any audio file as long as it is in a supported format.
Q: How do I improve animation quality?
A: Use high-quality audio inputs and ensure that the reference images are clear and well-lit.

Learn More about ComfyUI_EchoMimic

For additional resources, tutorials, and community support, visit the following links:

EchoMimic GitHub Repository
ComfyUI_EchoMimic Node
HuggingFace Model
ModelScope Model Explore these resources to get the most out of ComfyUI_EchoMimic and join the community of AI artists pushing the boundaries of digital animation.

ComfyUI_EchoMimic Related Nodes

Echo_LoadModel

Echo_Sampler

Table of Content

Description
How ComfyUI_EchoMimic Works
ComfyUI_EchoMimic Features
ComfyUI_EchoMimic Models
What's New with ComfyUI_EchoMimic
Troubleshooting ComfyUI_EchoMimic
Learn More about ComfyUI_EchoMimic
Related Nodes

DreamO | Unified Multi-Task Image Customization Framework

Perform identity, style, try-on, and multi-condition image generation from 1–3 references

FLUX IPAdapter V2 | XLabs

Explore XLabs FLUX IPAdapter V2 model compared to V1 for your creative goals.

Hunyuan Video | Text to Video

Generates videos from text prompts.

Nvidia Cosmos | Text & Image to Video Creation

Generate videos from text prompts or create frame interpolation between two images with Nvidia's Cosmos.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.