ComfyUI > Nodes > ComfyUI-Zonos > Zonos Emotion

ComfyUI Node: Zonos Emotion

Class Name

ZonosEmotion

Category
audio
Author
BuffMcBigHuge (Account age: 3170days)
Extension
ComfyUI-Zonos
Latest Updated
2025-03-07
Github Stars
0.05K

How to Install ComfyUI-Zonos

Install this extension via the ComfyUI Manager by searching for ComfyUI-Zonos
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-Zonos in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Zonos Emotion Description

Create emotion vectors for Zonos TTS by defining and manipulating emotional intensities to ensure balanced and realistic speech expression.

Zonos Emotion:

The ZonosEmotion node is designed to create emotion vectors for use in Zonos Text-to-Speech (TTS) systems. This node allows you to define and manipulate the intensity of various emotions such as happiness, sadness, disgust, fear, surprise, anger, and others, including a neutral state. By normalizing these emotional intensities, the node ensures that the sum of all emotions equals one, providing a balanced and realistic emotional expression in synthesized speech. This capability is particularly beneficial for AI artists and developers who wish to add nuanced emotional depth to their audio projects, enhancing the expressiveness and realism of generated speech.

Zonos Emotion Input Parameters:

happy

This parameter represents the intensity of happiness in the emotion vector. It influences how cheerful or joyful the synthesized speech will sound. The value ranges from 0.0 to 1.0, with a default of 1.0, allowing you to adjust the level of happiness to suit your needs.

sad

The sadness parameter controls the level of sadness in the emotion vector. It affects the melancholic tone of the speech. The value ranges from 0.0 to 1.0, with a default of 0.05, enabling you to fine-tune the degree of sadness expressed.

disgust

This parameter sets the intensity of disgust in the emotion vector, impacting the repulsiveness or aversion conveyed in the speech. The value ranges from 0.0 to 1.0, with a default of 0.05, allowing for subtle adjustments to the disgust level.

fear

The fear parameter determines the intensity of fear in the emotion vector, influencing the anxious or scared tone of the speech. The value ranges from 0.0 to 1.0, with a default of 0.05, providing control over the fearfulness expressed.

surprise

This parameter controls the intensity of surprise in the emotion vector, affecting the astonished or shocked tone of the speech. The value ranges from 0.0 to 1.0, with a default of 0.05, allowing you to adjust the level of surprise.

anger

The anger parameter sets the intensity of anger in the emotion vector, impacting the aggressive or frustrated tone of the speech. The value ranges from 0.0 to 1.0, with a default of 0.05, enabling you to fine-tune the degree of anger expressed.

other

This parameter represents the intensity of other unspecified emotions in the emotion vector. It allows for the inclusion of additional emotional nuances. The value ranges from 0.0 to 1.0, with a default of 0.1, providing flexibility in emotional expression.

neutral

The neutral parameter controls the intensity of neutrality in the emotion vector, affecting the balanced or emotionless tone of the speech. The value ranges from 0.0 to 1.0, with a default of 0.2, allowing you to adjust the level of neutrality.

Zonos Emotion Output Parameters:

EMOTION

The output is an emotion tensor that encapsulates the normalized intensities of the specified emotions. This tensor is crucial for conditioning the TTS model to produce speech with the desired emotional characteristics. By providing a structured representation of emotions, it enables the synthesis of expressive and contextually appropriate audio outputs.

Zonos Emotion Usage Tips:

  • To achieve a balanced emotional expression, ensure that the sum of all emotion intensities equals 1.0. This normalization is automatically handled by the node, but understanding this concept can help in manually adjusting the parameters for specific needs.
  • Experiment with different combinations of emotion intensities to create unique and expressive speech outputs. For instance, combining high levels of surprise and happiness can result in a joyful and astonished tone.

Zonos Emotion Common Errors and Solutions:

Invalid emotion format, using neutral

  • Explanation: This error occurs when the emotion data provided is not in the expected tensor format.
  • Solution: Ensure that the emotion data is correctly formatted as a tensor. If unsure, rely on the default neutral emotion tensor provided by the node.

Voice not found, using main

  • Explanation: This message indicates that the specified voice is not available in the system.
  • Solution: Verify that the voice name is correct and exists in the available voices. If not, the system will default to using the main voice.

Zonos Emotion Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-Zonos
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.