Visit ComfyUI Online for ready-to-use ComfyUI environment
Converts images to text prompts using AI, leveraging CLIP Interrogator for accurate descriptions, with adjustable speed and accuracy modes.
The easy imageInterrogator node is designed to convert images into descriptive text prompts using advanced AI models. This node leverages the power of the CLIP Interrogator to analyze the content of an image and generate a textual description that captures its essence. This can be particularly useful for AI artists who want to generate prompts for text-to-image models or for those who need to catalog and describe their image collections. The node offers various modes to balance between speed and accuracy, making it adaptable to different needs and computational resources.
This parameter accepts the image that you want to convert into a text prompt. The image should be in a format that the node can process, typically a tensor representation of the image. The quality and content of the image will directly impact the generated prompt.
This parameter determines the method used to generate the text prompt. The available options are fast
, classic
, best
, and negative
.
fast
: Prioritizes speed over accuracy, useful for quick results.classic
: Balances between speed and accuracy, providing a good middle ground.best
: Focuses on generating the most accurate and detailed prompts, but may take longer to process.negative
: Generates a prompt that describes what the image is not, useful for certain types of image analysis.
There is no default value, so you must choose one of the options.This boolean parameter allows you to enable or disable low VRAM mode. When set to True
, the node will use less GPU memory, which can be useful if you are working with limited resources. The default value is True
.
This output parameter provides the generated text prompt as a string. The prompt is a descriptive text that captures the essence of the input image, based on the selected mode. This can be used for various purposes, such as generating new images, cataloging, or further analysis.
fast
mode, but be aware that the generated prompts may be less detailed.best
mode, but ensure you have sufficient computational resources as it may take longer to process.negative
mode to generate prompts that describe what the image is not, which can be useful for certain types of image analysis or filtering.use_lowvram
if you are working on a machine with limited GPU memory to avoid running into memory issues.<mode>
fast
, classic
, best
, or negative
.use_lowvram
option to reduce GPU memory usage, or try processing smaller images.clip_interrogator
package or check the model paths.© Copyright 2024 RunComfy. All Rights Reserved.