EchoMimic | Audio-driven Portrait Animations

Generate realistic talking heads and body gestures synced with the provided audio.

Mochi Edit UnSampling | Video-to-Video

Mochi Edit: Modify Videos Using Text-Based Prompts and Unsampling.

Hallo2 | Lip-Sync Portrait Animation

Audio-driven lip-sync for portrait animation in 4K.

LivePortrait | Animate Portraits | Img2Vid

Animate portraits with facial expressions and motion using a single image and reference video.

ComfyUI > Nodes > ComfyUI_Gemini_Flash > Gemini flash

ComfyUI Node: Gemini flash

Class Name

Gemini_Flash

Category
Gemini flash

Author
ShmuelRonen (Account age: 1490days) Extension
ComfyUI_Gemini_Flash Latest Updated
2025-03-20 Github Stars
0.03K

Github Ask ShmuelRonen Current Questions Past Questions

Table of Content

Description
Gemini flash:
Gemini flash Input Parameters:
Gemini flash Output Parameters:
Gemini flash Usage Tips:
Gemini flash Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_Gemini_Flash

Install this extension via the ComfyUI Manager by searching for ComfyUI_Gemini_Flash

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_Gemini_Flash in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Gemini flash Description

Versatile node generating detailed prompts using AI for creative and analytical purposes, user-friendly with powerful output.

Gemini flash:

Gemini_Flash is a versatile node designed to generate detailed text prompts based on image analysis or purely textual input. It leverages the capabilities of the Gemini generative AI model to provide high-quality, contextually relevant prompts that can be used for various creative and analytical purposes. Whether you are working with images or text, Gemini_Flash can help you create detailed descriptions, making it an invaluable tool for AI artists and content creators. The node is designed to be user-friendly, requiring minimal configuration while offering powerful output, thus enhancing your workflow efficiency and creativity.

Gemini flash Input Parameters:

prompt

The prompt parameter is a string input that serves as the initial text or query for the generative model. It guides the AI in generating the desired content. The default value is "Analyze the image and make a txt2img detailed prompt. no prefix!" and it supports multiline input, allowing for more complex and detailed prompts. This parameter is crucial as it sets the context for the AI's output.

vision

The vision parameter is a boolean input that determines whether the node should consider visual data (images) in addition to the text prompt. When set to True, the node expects an image input and uses it to generate a more contextually rich prompt. The default value is True. This parameter significantly impacts the node's execution, as enabling vision allows for image analysis, while disabling it makes the node function purely as a text-based generative model.

api_key

The api_key parameter is a string input used to authenticate with the Gemini API. If not provided, the node will attempt to use the API key from the configuration file. This parameter is essential for accessing the generative model's capabilities. The default value is an empty string, and it must be set correctly for the node to function.

proxy

The proxy parameter is a string input that specifies the proxy server to be used for network requests. If not provided, the node will use the proxy settings from the configuration file. This parameter is useful for users who need to route their API requests through a specific network path. The default value is an empty string.

image

The image parameter is an optional input that accepts an image tensor. This image is analyzed by the generative model when the vision parameter is set to True. Providing an image allows the node to generate prompts that are highly relevant to the visual content, enhancing the quality and specificity of the output.

Gemini flash Output Parameters:

text

The text output parameter is a string that contains the generated content based on the provided prompt and optional image. This output is the result of the generative model's analysis and synthesis, offering a detailed and contextually appropriate prompt or description. The text output can be used directly in various creative or analytical applications, making it a valuable asset for AI artists.

Gemini flash Usage Tips:

Ensure that your api_key is correctly set either through the input parameter or the configuration file to avoid authentication issues.
Use the vision parameter effectively by providing relevant images when set to True to enhance the quality of the generated prompts.
Experiment with different prompts to see how the generative model responds and to find the most effective way to get the desired output.
If you need to use a proxy, make sure the proxy parameter is correctly configured to ensure smooth network communication.

Gemini flash Common Errors and Solutions:

Error: API key is required

Explanation: This error occurs when the api_key parameter is not set, and the node cannot authenticate with the Gemini API.
Solution: Provide a valid API key either through the input parameter or ensure it is correctly set in the configuration file.

Error: gemini-1.5-flash needs image

Explanation: This error occurs when the vision parameter is set to True, but no image is provided.
Solution: Ensure that an image tensor is provided when the vision parameter is enabled.

Error: [specific error message from the generative model]

Explanation: This error occurs due to various issues such as network problems, invalid input, or model-specific errors.
Solution: Check the specific error message for details, ensure your inputs are valid, and verify your network connection and proxy settings if applicable.

Gemini flash Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_Gemini_Flash

Table of Content

Description
Gemini flash:
Gemini flash Input Parameters:
Gemini flash Output Parameters:
Gemini flash Usage Tips:
Gemini flash Common Errors and Solutions:
Related Nodes

LTX Video | Image+Text to Video

Generates videos from image+text prompts.

FLUX | A New Art Image Generation

A new image generation model developed by Black Forest Labs

HunyuanCustom | Multi-Subject Video Generator

Create dual-subject videos with exceptional identity preservation.

MultiTalk | Photo to Talking Video

Millisecond lip sync + Wan2.1 = 15s ultra-detailed talking videos!

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.