ComfyUI > Nodes > ComfyUI_Lam > 微软文本转语音

ComfyUI Node: 微软文本转语音

Class Name

Text2AutioEdgeTts

Category
lam
Author
Lam Yan (Account age: 3065days)
Extension
ComfyUI_Lam
Latest Updated
2025-03-06
Github Stars
0.02K

How to Install ComfyUI_Lam

Install this extension via the ComfyUI Manager by searching for ComfyUI_Lam
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI_Lam in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

微软文本转语音 Description

Converts text to speech using Microsoft's technology, automates audio file generation with voice options for personalization.

微软文本转语音:

The Text2AutioEdgeTts node is designed to convert text into speech using Microsoft's text-to-speech technology. This node leverages the capabilities of the edge_tts library to transform written content into audio files, providing a seamless way to generate speech from text. It is particularly beneficial for AI artists and developers who wish to incorporate voice synthesis into their projects, allowing for the creation of dynamic audio content. The node automates the process of saving the generated audio in a specified directory, ensuring that the output is easily accessible and organized. By offering a range of voice options, it enables users to select the most suitable voice for their needs, enhancing the personalization and effectiveness of the audio output.

微软文本转语音 Input Parameters:

voice

The voice parameter allows you to select the specific voice that will be used for the text-to-speech conversion. It offers a variety of options, primarily focusing on different Chinese dialects and accents, such as zh-CN-XiaoxiaoNeural, zh-CN-XiaoyiNeural, and zh-TW-HsiaoChenNeural, among others. This selection impacts the tone, accent, and overall sound of the generated speech, enabling you to tailor the audio output to your specific requirements. There are no minimum or maximum values, but rather a list of predefined voice options to choose from.

filename_prefix

The filename_prefix parameter is a string that serves as the prefix for the generated audio file's name. By default, it is set to "comfyUI". This parameter helps in organizing and identifying the audio files, especially when multiple files are generated. It allows you to customize the naming convention to suit your project needs, making it easier to manage and locate the files later.

text

The text parameter is a multiline string input where you provide the content that you want to convert into speech. This is the core input for the node, as it directly influences the content of the generated audio. The text can be as simple or complex as needed, and the node will process it to produce a corresponding audio file. There are no specific minimum or maximum values, but the length and complexity of the text may affect processing time.

微软文本转语音 Output Parameters:

音频地址

The 音频地址 (audio address) output parameter provides the file path to the generated audio file. This string output is crucial as it allows you to access and utilize the audio file created by the node. The path includes the directory and filename, ensuring that you can easily locate and use the audio in your projects. This output is essential for integrating the generated speech into other applications or for further processing.

微软文本转语音 Usage Tips:

  • Ensure that the voice parameter is selected based on the desired accent and tone for your project to achieve the best results.
  • Use a descriptive filename_prefix to help organize and identify your audio files, especially when generating multiple outputs.
  • When inputting text, consider the length and complexity, as longer texts may take more time to process and convert into audio.

微软文本转语音 Common Errors and Solutions:

Directory not found

  • Explanation: This error occurs if the specified output directory does not exist.
  • Solution: Ensure that the output directory is correctly set up and accessible. The node should automatically create the directory if it doesn't exist, but verify permissions and paths if issues persist.

Invalid voice selection

  • Explanation: This error arises when a voice not listed in the available options is selected.
  • Solution: Double-check the list of available voices and ensure that the selected voice is one of the predefined options provided by the node.

Text input is empty

  • Explanation: This error occurs when no text is provided for conversion.
  • Solution: Make sure to input the text you wish to convert into speech. The text field should not be left empty to avoid this error.

微软文本转语音 Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI_Lam
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.