ComfyUI > Nodes > ComfyUI-OpenAI-FM

ComfyUI Extension: ComfyUI-OpenAI-FM

Repo Name

ComfyUI-OpenAI-FM

Author
fairy-root (Account age: 2256 days)
Nodes
View all nodes(1)
Latest Updated
2025-05-09
Github Stars
0.03K

How to Install ComfyUI-OpenAI-FM

Install this extension via the ComfyUI Manager by searching for ComfyUI-OpenAI-FM
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-OpenAI-FM in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ComfyUI-OpenAI-FM Description

ComfyUI-OpenAI-FM integrates OpenAI FM Text-to-Speech into ComfyUI, enabling seamless text-to-speech conversion with diverse voices and emotional styles for enhanced audio workflows.

ComfyUI-OpenAI-FM Introduction

The ComfyUI-OpenAI-FM extension is a powerful tool designed to integrate the OpenAI FM Text-to-Speech (TTS) service into your creative audio projects. This extension allows you to convert written text into spoken words using a variety of voices and emotional styles, all within the ComfyUI environment. Whether you're looking to add realistic voiceovers to your animations, create dynamic audio content, or experiment with different vocal performances, this extension provides a seamless and user-friendly solution. By leveraging the capabilities of OpenAI's advanced TTS technology, you can enhance your projects with expressive and high-quality speech.

How ComfyUI-OpenAI-FM Works

At its core, the ComfyUI-OpenAI-FM extension functions by connecting to the OpenAI FM API, which is a service that transforms text into speech. When you input text into the extension, it sends this text to the API, which then processes it and returns an audio file. This audio file is then outputted as an AUDIO signal within ComfyUI, allowing you to further manipulate or save it as needed. The extension also provides options to select different voices and emotional tones, which are pre-configured in JSON files. This means you can customize the speech output to match the specific mood or style of your project, making it a versatile tool for AI artists.

ComfyUI-OpenAI-FM Features

  • Text-to-Speech Conversion: The extension uses the OpenAI FM API to convert your input text into high-quality speech. This feature is perfect for creating voiceovers or narrations for your projects.
  • Voice Selection: You can choose from a variety of voices using a dropdown menu. The available voices are listed in a file called voices.json, which you can modify to add or change voice options.
  • Vibe Control: This feature allows you to apply different emotional styles to the speech. By selecting a "vibe" from the dropdown menu, you can adjust the emotional tone of the voice to suit your project's needs. The vibes are defined in a file called vibes.json.
  • ComfyUI AUDIO Output: The generated speech is outputted as an AUDIO signal compatible with ComfyUI's audio processing pipeline, making it easy to integrate into your existing workflows.
  • Audio File Saving: The extension automatically saves the generated audio files to a designated output directory, ensuring you have easy access to your audio creations for future use.

ComfyUI-OpenAI-FM Models

The extension does not specifically mention different models, but it does offer a range of voices and emotional styles that can be considered as variations or "models" of speech output. By selecting different combinations of voices and vibes, you can achieve a wide array of vocal performances, each suited to different contexts or artistic visions.

What's New with ComfyUI-OpenAI-FM

Recent updates to the extension have improved its usability and functionality:

  1. The "vibe" dropdown now defaults to "---", which appears empty or disabled by default. This change helps prevent accidental selection of a vibe when none is desired.
  2. The logic for generating speech has been refined. If you provide specific text for the vibe, it will be used. If not, the extension checks the dropdown selection. If no vibe is selected, it defaults to a "Calm" vibe to ensure smooth operation and prevent errors. These updates enhance the user experience by providing more intuitive controls and reducing the likelihood of errors during speech generation.

Troubleshooting ComfyUI-OpenAI-FM

If you encounter issues while using the extension, here are some common problems and solutions:

  • Problem: The audio output is not generated.
  • Solution: Ensure that the text input is not empty and that a voice is selected. Check your internet connection, as the extension relies on the OpenAI FM API.
  • Problem: The selected vibe does not apply to the speech.
  • Solution: Verify that the vibe is correctly selected from the dropdown menu. If the dropdown shows "---", no vibe is applied. Choose a specific vibe to apply it.
  • Problem: Audio files are not saving.
  • Solution: Check the output directory permissions. Ensure that the directory is writable. If issues persist, try changing the save location to a different directory.

Learn More about ComfyUI-OpenAI-FM

To further explore the capabilities of the ComfyUI-OpenAI-FM extension, consider visiting the OpenAI FM Text-to-Speech (https://www.openai.fm/) website for more information on the underlying technology. Additionally, engaging with the ComfyUI community through forums or social media can provide valuable insights and support from fellow AI artists. These resources can help you maximize the potential of the extension and inspire new creative possibilities.

ComfyUI-OpenAI-FM Related Nodes

RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.