Professional face swapping toolkit for ComfyUI that enables natural face replacement and enhancement.

Wan 2.1 FLF2V | First-Last Frame Video

Generate smooth videos from a start and end frame using Wan 2.1 FLF2V.

AP Workflow 12.0 | Ready-to-Use Complete AI Media Suite

Pre-set all-in-one system for image & video generation, enhancement, and manipulation. Zero setup required.

ICEdit | Fast AI Image Editing with Nunchaku

ICEdit+Nunchaku: A solution for ultra-fast, precise AI image editing.

ComfyUI > Nodes > ComfyUI_Lam > 微软文本转语音

ComfyUI Node: 微软文本转语音

Class Name

Text2AutioEdgeTts

Category
lam

Author
Lam Yan (Account age: 3093days) Extension
ComfyUI_Lam Latest Updated
2025-04-04 Github Stars
0.03K

Github Ask Lam Yan Current Questions Past Questions

Table of Content

Description
Text2AutioEdgeTts:
Text2AutioEdgeTts Input Parameters:
Text2AutioEdgeTts Output Parameters:
Text2AutioEdgeTts Usage Tips:
Text2AutioEdgeTts Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_Lam

Install this extension via the ComfyUI Manager by searching for ComfyUI_Lam

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_Lam in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

微软文本转语音 Description

Converts text to speech using Microsoft's technology, automates audio file generation with voice options for personalization.

微软文本转语音:

The Text2AutioEdgeTts node is designed to convert text into speech using Microsoft's text-to-speech technology. This node leverages the capabilities of the edge_tts library to transform written content into audio files, providing a seamless way to generate speech from text. It is particularly beneficial for AI artists and developers who wish to incorporate voice synthesis into their projects, allowing for the creation of dynamic audio content. The node automates the process of saving the generated audio in a specified directory, ensuring that the output is easily accessible and organized. By offering a range of voice options, it enables users to select the most suitable voice for their needs, enhancing the personalization and effectiveness of the audio output.

微软文本转语音 Input Parameters:

voice

The voice parameter allows you to select the specific voice that will be used for the text-to-speech conversion. It offers a variety of options, primarily focusing on different Chinese dialects and accents, such as zh-CN-XiaoxiaoNeural, zh-CN-XiaoyiNeural, and zh-TW-HsiaoChenNeural, among others. This selection impacts the tone, accent, and overall sound of the generated speech, enabling you to tailor the audio output to your specific requirements. There are no minimum or maximum values, but rather a list of predefined voice options to choose from.

filename_prefix

The filename_prefix parameter is a string that serves as the prefix for the generated audio file's name. By default, it is set to "comfyUI". This parameter helps in organizing and identifying the audio files, especially when multiple files are generated. It allows you to customize the naming convention to suit your project needs, making it easier to manage and locate the files later.

text

The text parameter is a multiline string input where you provide the content that you want to convert into speech. This is the core input for the node, as it directly influences the content of the generated audio. The text can be as simple or complex as needed, and the node will process it to produce a corresponding audio file. There are no specific minimum or maximum values, but the length and complexity of the text may affect processing time.

微软文本转语音 Output Parameters:

音频地址

The 音频地址 (audio address) output parameter provides the file path to the generated audio file. This string output is crucial as it allows you to access and utilize the audio file created by the node. The path includes the directory and filename, ensuring that you can easily locate and use the audio in your projects. This output is essential for integrating the generated speech into other applications or for further processing.

微软文本转语音 Usage Tips:

Ensure that the voice parameter is selected based on the desired accent and tone for your project to achieve the best results.
Use a descriptive filename_prefix to help organize and identify your audio files, especially when generating multiple outputs.
When inputting text, consider the length and complexity, as longer texts may take more time to process and convert into audio.

微软文本转语音 Common Errors and Solutions:

Directory not found

Explanation: This error occurs if the specified output directory does not exist.
Solution: Ensure that the output directory is correctly set up and accessible. The node should automatically create the directory if it doesn't exist, but verify permissions and paths if issues persist.

Invalid voice selection

Explanation: This error arises when a voice not listed in the available options is selected.
Solution: Double-check the list of available voices and ensure that the selected voice is one of the predefined options provided by the node.

Text input is empty

Explanation: This error occurs when no text is provided for conversion.
Solution: Make sure to input the text you wish to convert into speech. The text field should not be left empty to avoid this error.

微软文本转语音 Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_Lam

Table of Content

Description
Text2AutioEdgeTts:
Text2AutioEdgeTts Input Parameters:
Text2AutioEdgeTts Output Parameters:
Text2AutioEdgeTts Usage Tips:
Text2AutioEdgeTts Common Errors and Solutions:
Related Nodes

VACE Wan2.1 | V2V

Transform videos with a reference style image using VACE Wan2.1.

FLUX LoRA (RealismLoRA) | Photorealistic Images

Blend FLUX-1 model with FLUX-RealismLoRA for photorealistic AI images

Hallo2 | Lip-Sync Portrait Animation

Audio-driven lip-sync for portrait animation in 4K.

Flux Fill | Inpaint and Outpaint

Official Flux Tools - Flux Fill for Inpainting and Outpainting

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.