Install this extension via the ComfyUI Manager by searching
for ComfyUI-AudioScheduler
1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-AudioScheduler in the search bar
After installation, click the Restart button to
restart ComfyUI. Then, manually
refresh your browser to clear the cache and access
the updated list of nodes.
Visit
ComfyUI Online
for ready-to-use ComfyUI environment
ComfyUI-AudioScheduler enables loading mp3 files to drive animations and prompt scheduling using audio nodes, designed for integration with FizzNodes.
ComfyUI-AudioScheduler Introduction
ComfyUI-AudioScheduler is an extension designed to help AI artists integrate audio data into their creative projects. This extension provides a set of tools (nodes) that allow you to load audio files, analyze their properties, and use this information to control various aspects of your AI-generated animations or visualizations. By leveraging audio data, you can create more dynamic and synchronized multimedia experiences.
Key Features:
Loading Audio Files: Supports MP3 and WAV formats.
Amplitude Analysis: Reads and visualizes the amplitude of audio over time.
Frequency Analysis: Performs Fast Fourier Transforms (FFTs) to analyze frequency components.
Graphical Visualization: Generates graphs to preview amplitude and frequency data.
Dynamic Text Prompts: Creates text prompts driven by audio amplitude changes.
How ComfyUI-AudioScheduler Works
ComfyUI-AudioScheduler works by breaking down audio files into their fundamental components and using this data to influence other media elements. Here’s a simplified explanation:
Loading Audio: The extension can load audio files in MP3 or WAV formats.
Analyzing Audio: It reads the amplitude (volume) and frequency (pitch) information from the audio.
Visualizing Data: The extension can create visual graphs of the amplitude and frequency data.
Controlling Media: This data can then be used to control animations, generate text prompts, or influence other visual elements in your project.
Imagine you have a piece of music and you want certain visual effects to sync with the beats or melody. ComfyUI-AudioScheduler can analyze the music and provide the necessary data to make this synchronization possible.
ComfyUI-AudioScheduler Features
LoadAudio
Input: List of audio file names (MP3, WAV).
Output: Loaded audio data.
Description: Loads audio files from a specified directory and prepares them for further analysis.
AudioToFFTs
Input: Loaded audio data, audio channel, frames per second.
Output: FFT data, total number of frames.
Description: Performs FFT on the audio data to extract frequency information. Useful for analyzing the pitch and tone of the audio.
AudioToAmplitudeGraph
Input: Loaded audio data, audio channel, lower and upper band range.
Output: Graph image of amplitude.
Description: Creates a visual graph of the amplitude within a specified frequency range. Helps in visualizing how the volume changes over time.
BatchAmplitudeSchedule
Input: FFT data, operation (avg, max, sum), lower and upper band range.
Output: Amplitude data.
Description: Calculates amplitude values from FFT data based on specified operations and frequency ranges. Useful for aggregating amplitude data.
ClipAmplitude
Input: Amplitude data, max and optional min amplitude.
Output: Clipped amplitude data.
Description: Clips amplitude values to a specified range. Ensures that the amplitude stays within desired limits.
TransientAmplitudeBasic
Input: Amplitude data, frames for attack, hold, and release.
Output: Adjusted amplitude data.
Description: Adjusts amplitude data with transient characteristics. Controls the attack, hold, and release behavior of the amplitude.
NormalizeAmplitude
Input: Amplitude data.
Output: Normalized amplitude data.
Description: Normalizes amplitude data, optionally inverting the values.
Video Tutorials on YouTube
These resources will help you get the most out of ComfyUI-AudioScheduler and connect with other AI artists who are using the extension in their projects.