FLUX LoRA (RealismLoRA) | Photorealistic Images

Blend FLUX-1 model with FLUX-RealismLoRA for photorealistic AI images

Wan 2.1 Video Restyle | Consistent Video Style Transform

Transform your video style by applying the restyled first frame using Wan 2.1 video restyle workflow.

Hallo2 | Lip-Sync Portrait Animation

Audio-driven lip-sync for portrait animation in 4K.

FLUX | A New Art Image Generation

A new image generation model developed by Black Forest Labs

ComfyUI > Nodes > ComfyUI > VAEDecodeAudio

ComfyUI Node: VAEDecodeAudio

Class Name

VAEDecodeAudio

Category
latent/audio

Author
ComfyAnonymous (Account age: 833days) Extension
ComfyUI Latest Updated
2025-04-05 Github Stars
73.39K

Github Ask ComfyAnonymous Current Questions Past Questions

Table of Content

Description
VAEDecodeAudio:
VAEDecodeAudio Input Parameters:
VAEDecodeAudio Output Parameters:
VAEDecodeAudio Usage Tips:
VAEDecodeAudio Common Errors and Solutions:
Related Nodes

How to Install ComfyUI

Install this extension via the ComfyUI Manager by searching for ComfyUI

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

VAEDecodeAudio Description

Convert latent audio representations to audible waveforms using VAE for AI artists working with generative audio models.

VAEDecodeAudio:

The VAEDecodeAudio node is designed to convert latent audio representations back into audible waveforms using a Variational Autoencoder (VAE). This node is particularly useful for AI artists who work with generative audio models, as it allows them to decode complex latent audio data into a format that can be easily listened to and further processed. By leveraging the VAE's decoding capabilities, this node ensures that the generated audio maintains high fidelity and coherence, making it an essential tool for audio synthesis and manipulation tasks.

VAEDecodeAudio Input Parameters:

samples

The samples parameter represents the latent audio data that needs to be decoded. This data is typically generated by an encoder or another generative model and is in a compressed, high-dimensional format. The VAE uses this latent representation to reconstruct the original audio waveform. The quality and characteristics of the decoded audio heavily depend on the information contained in these latent samples.

vae

The vae parameter specifies the Variational Autoencoder model that will be used to decode the latent audio samples. The VAE is a crucial component as it contains the learned parameters and architecture necessary to accurately reconstruct the audio from its latent representation. The choice of VAE can significantly impact the quality and style of the decoded audio.

VAEDecodeAudio Output Parameters:

AUDIO

The AUDIO output parameter provides the decoded audio waveform along with its sample rate. The output is a dictionary containing two key-value pairs: waveform, which is the actual audio data in tensor format, and sample_rate, which is set to 44100 Hz. This standardized sample rate ensures compatibility with most audio processing tools and playback devices.

VAEDecodeAudio Usage Tips:

Ensure that the latent samples provided to the samples parameter are correctly generated and compatible with the VAE model specified in the vae parameter to achieve optimal decoding results.
Use a well-trained VAE model to ensure high-quality audio reconstruction. The performance of the VAE model directly affects the fidelity of the decoded audio.
If the decoded audio sounds distorted or unclear, consider retraining the VAE model with a more diverse and high-quality dataset to improve its decoding capabilities.

VAEDecodeAudio Common Errors and Solutions:

`Invalid latent samples format`

Explanation: The latent samples provided are not in the expected format or structure.
Solution: Ensure that the latent samples are correctly generated and match the expected input format for the VAE model.

`VAE model not found`

Explanation: The specified VAE model is not available or not properly loaded.
Solution: Verify that the VAE model is correctly specified and loaded into the system. Check for any issues with the model file or its path.

`Decoding failed due to incompatible VAE`

Explanation: The VAE model provided is not compatible with the latent samples.
Solution: Ensure that the latent samples and the VAE model are from the same training setup and are compatible with each other.

VAEDecodeAudio Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI

Table of Content

Description
VAEDecodeAudio:
VAEDecodeAudio Input Parameters:
VAEDecodeAudio Output Parameters:
VAEDecodeAudio Usage Tips:
VAEDecodeAudio Common Errors and Solutions:
Related Nodes

Trellis | Image to 3D

Trellis is an advanced Image-to-3D model for high-quality 3D assets generation.

MatAnyone Video Matting | Single Mask Removal

Remove video backgrounds with one mask frame for perfect subject isolation.

FLUX Inpainting | Seamless Image Editing

Effortlessly fill, remove, and refine images, seamlessly integrating new content.

FLUX Controlnet Inpainting

Enhance realism by using ControlNet to guide FLUX.1-dev.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.