Visit ComfyUI Online for ready-to-use ComfyUI environment
Facilitates interaction between batch of images and OpenAI language models for generating text responses.
The DataSet_OpenAIChatImageBatch
node is designed to facilitate the interaction between a batch of images and OpenAI's language models, such as GPT-4 and GPT-3.5-turbo. This node allows you to send multiple images along with a textual prompt to the OpenAI API, which then generates responses based on the provided inputs. This can be particularly useful for AI artists who want to generate descriptive text, captions, or any other form of textual content related to a set of images. The node processes each image individually, converts it to a base64-encoded string, and sends it to the OpenAI model specified by the user. The responses are then collected and returned as a list of strings, making it easy to handle multiple outputs in a single operation.
This parameter accepts a list of images that you want to process. Each image in the list will be converted to a base64-encoded string and sent to the OpenAI API for analysis. The quality and content of the images can significantly impact the generated responses.
This parameter specifies the level of detail for the image analysis. It can be set to either "low" or "high", with "high" being the default. Higher detail levels may provide more nuanced and detailed responses but could also require more processing time.
This is a textual prompt that will be sent along with each image to the OpenAI API. The prompt can be multiline and is used to guide the model in generating the desired response. The default value is an empty string, allowing you to customize the prompt as needed.
This parameter allows you to select the OpenAI model to be used for generating responses. Available options include "gpt-4o", "gpt-4", "gpt-4-32k", "gpt-3.5-turbo", "gpt-4-0125-preview", "gpt-4-turbo-preview", "gpt-4-1106-preview", and "gpt-4-0613". The default model is "gpt-4o".
This is the URL of the OpenAI API endpoint. The default value is "https://api.openai.com/v1". You can change this if you are using a different endpoint or a custom deployment of the OpenAI API.
This parameter is your OpenAI API key, which is required to authenticate and authorize your requests to the OpenAI API. It is a mandatory field and must be provided for the node to function.
This parameter specifies the maximum number of tokens (words or word pieces) that the model should generate in the response. The default value is 1024, but you can adjust this based on your needs and the complexity of the responses you expect.
The output is a list of strings, where each string is a response generated by the OpenAI model for the corresponding image and prompt. These responses can be used for various purposes, such as generating captions, descriptions, or any other textual content related to the images.
<specific error message>
© Copyright 2024 RunComfy. All Rights Reserved.