Install this extension via the ComfyUI Manager by searching
for Image-Captioning-in-ComfyUI
1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter Image-Captioning-in-ComfyUI in the search bar
After installation, click the Restart button to
restart ComfyUI. Then, manually
refresh your browser to clear the cache and access
the updated list of nodes.
Visit
ComfyUI Online
for ready-to-use ComfyUI environment
Image-Captioning-in-ComfyUI provides custom nodes for captioning images, preparing them for LoRA training.
Image-Captioning-in-ComfyUI Introduction
Welcome to Image-Captioning-in-ComfyUI! This extension is designed to help AI artists like you easily generate captions for images, making them ready for LoRA (Low-Rank Adaptation) training. By using this extension, you can streamline the process of preparing your image datasets, saving you time and effort. The main features of this extension include custom nodes that integrate seamlessly with ComfyUI, allowing you to load images, generate captions using the WD14 Tagger, and save the captions in a format suitable for training.
How Image-Captioning-in-ComfyUI Works
At its core, Image-Captioning-in-ComfyUI works by automating the process of captioning images. Here’s a simple breakdown of how it functions:
Image Loading: The extension starts by loading images from a specified folder. This is done using the "LoRA Caption Load" node.
Caption Generation: The loaded images are then passed to the WD14 Tagger, which is responsible for generating captions. Think of the WD14 Tagger as a tool that "looks" at each image and describes it in words.
Caption Saving: Finally, the generated captions are saved as text files using the "LoRA Caption Save" node. Each text file is named after the corresponding image and contains the description generated by the tagger.
By following these steps, the extension ensures that your images are properly captioned and ready for LoRA training.
Image-Captioning-in-ComfyUI Features
LoRA Caption Load Node
Function: Loads images from a specified folder and retrieves their filenames.
Customization: You can specify the folder path where your images are stored. Ensure all images are in PNG format (this requirement may change in future versions).
WD14 Tagger
Function: Generates captions for the images. This is a mandatory component as it performs the actual captioning.
Customization: You need to download a model for the tagger. More information can be found on the WD14 Tagger's GitHub page.
LoRA Caption Save Node
Function: Saves the generated captions as text files, with each file named after the corresponding image.
Customization: You can add a common prefix to the captions using a widget. This is useful for creating trigger words for your LoRA training. Ensure the prefix ends with a comma.
Optional Jjk Custom Nodes
Function: Allows you to see the names of the files being processed. This is optional but can be helpful for verification.
Image-Captioning-in-ComfyUI Models
Currently, the extension relies on the WD14 Tagger model for generating captions. This model is particularly effective for anime images, though it may also work for realistic images. The author notes that while the tagger is optimized for anime, it should still perform reasonably well with other types of images.
Troubleshooting Image-Captioning-in-ComfyUI
Here are some common issues you might encounter and how to resolve them:
Out of Range Error
Problem: This error occurs if the text files already exist.
Solution: Ensure that the text files do not already exist in the folder. If they do, delete them and relaunch the workflow.
Image Format Issues
Problem: The extension currently requires images to be in PNG format.
Solution: Convert your images to PNG format before loading them into the extension.
Prefix Widget Issues
Problem: The prefix widget requires the prefix to end with a comma.
Solution: Make sure to add a comma at the end of your prefix in the widget.
Learn More about Image-Captioning-in-ComfyUI
To further enhance your experience with Image-Captioning-in-ComfyUI, here are some additional resources:
ComfyUI Documentation: To understand more about how ComfyUI works and how to integrate other custom nodes.
Community Forums: Join forums and communities where you can ask questions, share your experiences, and get support from other AI artists.
By leveraging these resources, you can make the most out of Image-Captioning-in-ComfyUI and enhance your AI art projects. Happy captioning!