Install this extension via the ComfyUI Manager by searching
for ComfyUI_EchoMimic
1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_EchoMimic in the search bar
After installation, click the Restart button to
restart ComfyUI. Then, manually
refresh your browser to clear the cache and access
the updated list of nodes.
Visit
ComfyUI Online
for ready-to-use ComfyUI environment
ComfyUI_EchoMimic enables lifelike audio-driven portrait animations in ComfyUI by utilizing editable landmark conditioning, allowing for realistic and dynamic facial movements synchronized with audio input.
ComfyUI_EchoMimic Introduction
ComfyUI_EchoMimic is an extension designed to bring lifelike, audio-driven portrait animations to your AI art projects. By leveraging advanced landmark conditioning, this tool allows you to create realistic animations from audio inputs. Whether you're an AI artist looking to add dynamic elements to your portraits or a developer seeking to integrate sophisticated animation capabilities into your projects, ComfyUI_EchoMimic offers a powerful and user-friendly solution.
How ComfyUI_EchoMimic Works
ComfyUI_EchoMimic operates by analyzing audio inputs and translating them into corresponding facial movements. This is achieved through a series of models that process the audio, identify key landmarks on the face, and generate animations that mimic natural expressions and movements. Think of it as a digital puppeteer that brings static images to life by synchronizing them with audio cues.
Basic Principles:
Audio Processing: The audio input is first processed to extract relevant features.
Landmark Detection: Key facial landmarks are identified and tracked.
Animation Generation: The detected landmarks are used to animate the portrait, creating a lifelike representation that moves in sync with the audio.
ComfyUI_EchoMimic Features
Audio-Driven Animation
Infer Mode: Choose between "audio_drived" and "audio_drived_acc" to generate animations driven by audio inputs.
Motion Sync: Synchronize animations with video files to create seamless, realistic movements.
Pose-Driven Animation
Pose Mode: Generate animations based on pre-defined pose models ("pose_normal" and "pose_acc").
Face Crop Support: Enhance animations by focusing on specific facial regions.
Customization Options
Save Video: Option to save the generated animations as video files.
Draw Mouse: Experiment with different animation styles.
Length: Control the duration of the animation by setting the number of frames.
Low VRAM Mode: Optimize performance for systems with limited video memory.
ComfyUI_EchoMimic Models
ComfyUI_EchoMimic utilizes several models to achieve its functionality. Each model serves a specific purpose in the animation pipeline:
Denoising Unet: Used for refining the generated animations.
Reference Unet: Helps in maintaining consistency across frames.
Motion Module: Handles the movement dynamics based on audio inputs.
Face Locator: Identifies and tracks facial landmarks.
Audio Processor (Whisper): Processes the audio input to extract features.
Model Variants:
Standard Models: Suitable for general use cases.
Accelerated Models: Optimized for faster performance, ideal for high-demand scenarios.
What's New with ComfyUI_EchoMimic
Latest Updates:
Low VRAM Mode: Added to support users with 6GB or 8GB video memory. Note that this mode may be slower and consume more memory.
Improved Model Loading: VAE models are now loaded from a specific directory to reduce loading times and improve compatibility.
Bug Fixes: Resolved issues related to batch image input errors and motion synchronization.
Previous Updates:
Audio ACC Model Support: Added support for audio ACC models and face crop for pose.
Unified Audio Output: Standardized audio output format for better integration with other tools.
Troubleshooting ComfyUI_EchoMimic
Common Issues and Solutions:
Slow Performance in Low VRAM Mode:
Solution: Ensure that your system meets the minimum requirements and try reducing the animation length.
Model Loading Errors:
Solution: Verify that all model files are placed in the correct directories as specified in the documentation.
Audio Sync Issues:
Solution: Check the audio file format and ensure it is compatible with the extension.
Frequently Asked Questions:
Q: Can I use my own audio files?
A: Yes, you can use any audio file as long as it is in a supported format.
Q: How do I improve animation quality?
A: Use high-quality audio inputs and ensure that the reference images are clear and well-lit.
Learn More about ComfyUI_EchoMimic
For additional resources, tutorials, and community support, visit the following links:
ModelScope Model
Explore these resources to get the most out of ComfyUI_EchoMimic and join the community of AI artists pushing the boundaries of digital animation.