Generates videos from text prompts.

BAGEL AI | T2I + I2T + I2I

Multimodal understanding and generation with open-source AI.

IDM-VTON | Virtual Try-on

Virtual try-on creating realistic results by capturing garment details and style.

Step1X-Edit | AI Image Editing Tool

Perform 11 editing operations with natural language in Step1X-Edit.

ComfyUI > Nodes > audio-separation-nodes-comfyui > AudioSeparation

ComfyUI Node: AudioSeparation

Class Name

AudioSeparation

Category
audio

Author
christian-byrne (Account age: 1633days) Extension
audio-separation-nodes-comfyui Latest Updated
2025-01-05 Github Stars
0.2K

Github Ask christian-byrne Current Questions Past Questions

Table of Content

Description
AudioSeparation:
AudioSeparation Input Parameters:
AudioSeparation Output Parameters:
AudioSeparation Usage Tips:
AudioSeparation Common Errors and Solutions:
Related Nodes

How to Install audio-separation-nodes-comfyui

Install this extension via the ComfyUI Manager by searching for audio-separation-nodes-comfyui

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter audio-separation-nodes-comfyui in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

AudioSeparation Description

Isolate audio components using advanced machine learning for remixing, karaoke, and analysis.

AudioSeparation:

The AudioSeparation node is designed to isolate different components of an audio track, such as bass, drums, other instruments, and vocals. This node leverages advanced machine learning models to perform source separation, allowing you to extract individual elements from a mixed audio file. This can be particularly useful for remixing, creating karaoke tracks, or analyzing specific parts of a song. The node processes the audio by resampling it to match the model's expected sample rate, normalizing the waveform, and then applying a pre-trained model to separate the sources. The result is a set of isolated audio tracks that can be used independently or recombined as needed.

AudioSeparation Input Parameters:

audio

This parameter accepts an audio file in the form of a dictionary containing the waveform and sample rate. The waveform is a tensor representation of the audio signal, and the sample rate is the number of samples per second. This input is required for the node to function.

chunk_fade_shape

This optional parameter defines the shape of the fade applied to the chunks of audio during processing. The available options are "linear", "half_sine", "logarithmic", and "exponential". The default value is "linear". This parameter affects how smoothly the transitions between chunks are handled, which can impact the quality of the separated audio.

chunk_length

This optional parameter specifies the length of each audio chunk in seconds. The default value is 10.0 seconds. Adjusting this parameter can influence the performance and accuracy of the source separation, as shorter chunks may process faster but could result in less accurate separation.

chunk_overlap

This optional parameter determines the overlap between consecutive audio chunks, expressed as a fraction of the chunk length. The default value is 0.1 (10%). Overlapping chunks can help to ensure smoother transitions and reduce artifacts in the separated audio.

AudioSeparation Output Parameters:

Bass

This output provides the isolated bass component of the input audio. It is one of the four separated sources and can be used independently or in combination with the other outputs.

Drums

This output provides the isolated drums component of the input audio. It is one of the four separated sources and can be used independently or in combination with the other outputs.

Other

This output provides the isolated "other" instruments component of the input audio. It includes all elements that are not classified as bass, drums, or vocals. It is one of the four separated sources and can be used independently or in combination with the other outputs.

Vocals

This output provides the isolated vocals component of the input audio. It is one of the four separated sources and can be used independently or in combination with the other outputs.

AudioSeparation Usage Tips:

For best results, ensure that the input audio is of high quality and has a consistent sample rate.
Experiment with different chunk_fade_shape options to find the one that produces the smoothest transitions for your specific audio.
Adjust the chunk_length and chunk_overlap parameters to balance between processing time and separation accuracy. Shorter chunks may process faster but could result in less accurate separation.

AudioSeparation Common Errors and Solutions:

"Input audio sample rate does not match model sample rate"

Explanation: The sample rate of the input audio does not match the sample rate expected by the model.
Solution: Ensure that the input audio is resampled to the model's expected sample rate before processing.

"Model failed to separate sources"

Explanation: The model encountered an issue while trying to separate the audio sources.
Solution: Check the input audio for any anomalies or corruption. Ensure that the audio is of high quality and try adjusting the chunk_length and chunk_overlap parameters.

"CUDA out of memory"

Explanation: The GPU does not have enough memory to process the audio.
Solution: Reduce the chunk_length or process the audio on a machine with more GPU memory. Alternatively, try running the node on a CPU if GPU resources are limited.

AudioSeparation Related Nodes

Go back to the extension to check out more related nodes.

audio-separation-nodes-comfyui

Table of Content

Description
AudioSeparation:
AudioSeparation Input Parameters:
AudioSeparation Output Parameters:
AudioSeparation Usage Tips:
AudioSeparation Common Errors and Solutions:
Related Nodes

Stable Diffusion 3.5

Stable Diffusion 3.5 (SD3.5) for high-quality, diverse image generation.

Flux PuLID for Face Swapping

Take your face swapping projects to new heights with Flux PuLID.

Wan 2.1 FLF2V | First-Last Frame Video

Generate smooth videos from a start and end frame using Wan 2.1 FLF2V.

MV-Adapter | High-Resolution Multi-view Generator

Generate 360-degree views of anything from a single image or description.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.