Visit ComfyUI Online for ready-to-use ComfyUI environment
Converts text to speech using Microsoft's technology, automates audio file generation with voice options for personalization.
The Text2AutioEdgeTts
node is designed to convert text into speech using Microsoft's text-to-speech technology. This node leverages the capabilities of the edge_tts
library to transform written content into audio files, providing a seamless way to generate speech from text. It is particularly beneficial for AI artists and developers who wish to incorporate voice synthesis into their projects, allowing for the creation of dynamic audio content. The node automates the process of saving the generated audio in a specified directory, ensuring that the output is easily accessible and organized. By offering a range of voice options, it enables users to select the most suitable voice for their needs, enhancing the personalization and effectiveness of the audio output.
The voice
parameter allows you to select the specific voice that will be used for the text-to-speech conversion. It offers a variety of options, primarily focusing on different Chinese dialects and accents, such as zh-CN-XiaoxiaoNeural
, zh-CN-XiaoyiNeural
, and zh-TW-HsiaoChenNeural
, among others. This selection impacts the tone, accent, and overall sound of the generated speech, enabling you to tailor the audio output to your specific requirements. There are no minimum or maximum values, but rather a list of predefined voice options to choose from.
The filename_prefix
parameter is a string that serves as the prefix for the generated audio file's name. By default, it is set to "comfyUI". This parameter helps in organizing and identifying the audio files, especially when multiple files are generated. It allows you to customize the naming convention to suit your project needs, making it easier to manage and locate the files later.
The text
parameter is a multiline string input where you provide the content that you want to convert into speech. This is the core input for the node, as it directly influences the content of the generated audio. The text can be as simple or complex as needed, and the node will process it to produce a corresponding audio file. There are no specific minimum or maximum values, but the length and complexity of the text may affect processing time.
The 音频地址
(audio address) output parameter provides the file path to the generated audio file. This string output is crucial as it allows you to access and utilize the audio file created by the node. The path includes the directory and filename, ensuring that you can easily locate and use the audio in your projects. This output is essential for integrating the generated speech into other applications or for further processing.
voice
parameter is selected based on the desired accent and tone for your project to achieve the best results.filename_prefix
to help organize and identify your audio files, especially when generating multiple outputs.RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.