Visit ComfyUI Online for ready-to-use ComfyUI environment
Generate high-quality audio from text prompts using advanced machine learning models for creating unique soundscapes and effects.
The AudioLDM2Node is designed to generate high-quality audio based on textual descriptions. This node leverages advanced machine learning models to create audio that matches the given text prompt, allowing AI artists to produce unique soundscapes and audio effects effortlessly. By providing a text prompt and various configuration settings, you can control the characteristics of the generated audio, such as its duration, guidance scale, and sample rate. This node is particularly useful for creating audio content for multimedia projects, enhancing creative workflows, and exploring new auditory experiences.
This parameter takes a string input that describes the audio you want to generate. The text prompt guides the model in creating audio that matches the description. The default value is an empty string, and it is a required input.
This string input allows you to specify elements that should be avoided in the generated audio. By providing a negative prompt, you can refine the output to better match your desired outcome. The default value is an empty string, and it is a required input.
This integer parameter defines the length of the generated audio in seconds. You can set the duration between 1 and 60 seconds, with a default value of 10 seconds. Adjusting this parameter will directly impact the length of the audio output.
This float parameter controls the influence of the text prompt on the generated audio. A higher guidance scale makes the audio more closely match the text description. The value can range from 0.1 to 20.0, with a default of 3.5. Fine-tuning this parameter helps achieve the desired balance between creativity and adherence to the prompt.
This integer parameter sets the random seed for audio generation, ensuring reproducibility of results. By using the same seed, you can generate identical audio outputs for the same input parameters. The default value is 42.
This integer parameter specifies the number of audio candidates to generate. You can choose between 1 and 10 candidates, with a default value of 3. Generating multiple candidates allows you to select the best match for your needs.
This integer parameter determines the sample rate of the generated audio, affecting its quality and file size. The sample rate can be set between 8000 and 48000 Hz, with a default value of 16000 Hz. Higher sample rates result in better audio quality.
This parameter allows you to choose the file format for the generated audio. Available options are "wav", "mp3", and "flac", with "wav" as the default. Selecting the appropriate format depends on your specific use case and compatibility requirements.
This output parameter provides the waveform data of the generated audio. The waveform is a numerical representation of the audio signal, which can be used for further processing or playback.
This output parameter returns the sample rate of the generated audio. It indicates the number of samples per second in the audio file, which is crucial for playback and compatibility with other audio processing tools.
© Copyright 2024 RunComfy. All Rights Reserved.