LivePortrait | Animate Portraits | Img2Vid

Animate portraits with facial expressions and motion using a single image and reference video.

Flux Consistent Characters | Input Image

Create consistent characters and ensure they look uniform using your images.

Era3D | ComfyUI 3D Pack

Generate 3D content, from multi-view images to detailed meshes.

Wan 2.1 Video Restyle | Consistent Video Style Transform

Transform your video style by applying the restyled first frame using Wan 2.1 video restyle workflow.

ComfyUI > Nodes > MBM's Music Visualizer > Audio Feature Calculator

ComfyUI Node: Audio Feature Calculator

Class Name

mbmAudioFeatureCalculator

Category
MBMnodes/Audio

Author
Sorcerio (Account age: 4532days) Extension
MBM's Music Visualizer Latest Updated
2024-05-23 Github Stars
0.02K

Github Ask Sorcerio Current Questions Past Questions

Table of Content

Description
mbmAudioFeatureCalculator:
mbmAudioFeatureCalculator Input Parameters:
mbmAudioFeatureCalculator Output Parameters:
mbmAudioFeatureCalculator Usage Tips:
mbmAudioFeatureCalculator Common Errors and Solutions:
Related Nodes

How to Install MBM's Music Visualizer

Install this extension via the ComfyUI Manager by searching for MBM's Music Visualizer

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter MBM's Music Visualizer in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Audio Feature Calculator Description

Extracts audio features for visualization and analysis, leveraging advanced processing techniques for tempo, spectrogram, and chroma.

Audio Feature Calculator:

The mbmAudioFeatureCalculator is a powerful node designed to extract and calculate relevant audio features from loaded audio files, making it an essential tool for music visualization and audio analysis. This node leverages advanced audio processing techniques to analyze the audio's tempo, spectrogram, and chroma features, among others, to generate a comprehensive set of feature modifiers. These modifiers can be used to create dynamic visualizations or further audio analysis. The node's primary goal is to transform audio data into meaningful visual or analytical outputs, providing a bridge between audio content and visual representation. By utilizing this node, you can gain insights into the audio's structure and dynamics, which can be particularly beneficial for AI artists looking to create music-driven visual art.

Audio Feature Calculator Input Parameters:

audio

The audio parameter is a tuple that contains the audio data and its sample rate. This is the primary input for the node, as it provides the raw audio information that will be analyzed to extract features.

intensity

The intensity parameter is a float that acts as a multiplier for the audio features, allowing you to increase or decrease the overall effect of the audio features. The default value is 1.0, meaning no change, but you can adjust it to amplify or diminish the impact of the features.

hop_length

The hop_length parameter is an integer that determines the number of audio samples between successive frames. It affects the temporal resolution of the analysis, with a default value of 512. Adjusting this value can influence the granularity of the feature extraction.

fps_target

The fps_target parameter is a float that specifies the desired frames per second for the output. It has a default value of 6, with a range from -1 to 10000. A value of <= 0 will use the audio's natural sampling rate, while positive values will resample the features to match the specified frame rate.

feat_mod_max

The feat_mod_max parameter is a float that sets the maximum value for the feature modifier. The default is 10000.0, with a range from -10000.0 to 10000.0. This parameter ensures that the feature modifiers do not exceed a certain threshold, which can be useful for maintaining consistency in visual outputs.

feat_mod_min

The feat_mod_min parameter is a float that sets the minimum value for the feature modifier. The default is -10000.0, with a range from -10000.0 to 10000.0. Similar to feat_mod_max, this parameter helps in controlling the lower bound of the feature modifiers.

feat_mod_normalize

The feat_mod_normalize parameter is a boolean option that determines whether the feature modifier array should be normalized between 0 and the maximum value in the array. This can be useful for standardizing the output, especially when comparing different audio files.

Audio Feature Calculator Output Parameters:

FEAT_MODS

The FEAT_MODS output is a 1D tensor containing the calculated feature modifiers for each frame of the audio. These modifiers represent the combined effect of various audio features and can be used for visualizing or further processing the audio data.

FEAT_SECONDS

The FEAT_SECONDS output is a float representing the duration of each frame in seconds. This value is derived from the total duration of the audio and the number of frames, providing a temporal context for the feature modifiers.

FPS

The FPS output is a float indicating the frames per second of the output data. This value reflects either the target frame rate specified by the fps_target parameter or the natural frame rate of the audio if no target was set.

CHARTS

The CHARTS output is an image that visualizes the various audio features and their modifiers. This visual representation can help in understanding the dynamics of the audio and the impact of different features on the overall analysis.

Audio Feature Calculator Usage Tips:

To achieve smoother visualizations, consider adjusting the hop_length to a smaller value, which increases the temporal resolution of the feature extraction.
Use the feat_mod_normalize option to ensure consistent feature modifier values across different audio files, which can be particularly useful when comparing or combining multiple audio sources.

Audio Feature Calculator Common Errors and Solutions:

"The maximum feature modifier value must be greater than the minimum feature modifier value."

Explanation: This error occurs when the feat_mod_max is set to a value less than or equal to feat_mod_min.
Solution: Ensure that feat_mod_max is greater than feat_mod_min to avoid this error.

"Invalid audio input format."

Explanation: This error might occur if the audio parameter is not provided as a tuple containing the audio data and sample rate.
Solution: Verify that the audio input is correctly formatted as a tuple with the necessary components.

Audio Feature Calculator Related Nodes

Go back to the extension to check out more related nodes.

MBM's Music Visualizer

Table of Content

Description
mbmAudioFeatureCalculator:
mbmAudioFeatureCalculator Input Parameters:
mbmAudioFeatureCalculator Output Parameters:
mbmAudioFeatureCalculator Usage Tips:
mbmAudioFeatureCalculator Common Errors and Solutions:
Related Nodes

ACE++ Character Consistency

Generate consistent images of your character across poses, angles, and styles from a single photo.

Consistent Style Transfer with Unsampling

Controlling latent noise with Unsampling helps dramatically increase consistency in video style transfer.

MatAnyone Video Matting | Single Mask Removal

Remove video backgrounds with one mask frame for perfect subject isolation.

Product Relighting | Magnific.AI Relight Alternative

Elevate your product photography effortlessly, a top alternative to Magnific.AI Relight.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.