Consistent Style Transfer with Unsampling

Controlling latent noise with Unsampling helps dramatically increase consistency in video style transfer.

MV-Adapter | High-Resolution Multi-view Generator

Generate 360-degree views of anything from a single image or description.

Dance Video Transform | Scene Customization & Face Swap

Transform dance videos with scene editing, face-swapping, and motion preservation.

Insert Anything | Reference-Based Image Editing

Insert any subject into images with mask or text guidance.

ComfyUI > Nodes > GeminiOllama ComfyUI Extension > Gemini API

ComfyUI Node: Gemini API

Class Name

GeminiAPI

Category
AI API/Gemini

Author
al-swaiti (Account age: 1214days) Extension
GeminiOllama ComfyUI Extension Latest Updated
2025-04-02 Github Stars
0.05K

Github Ask al-swaiti Current Questions Past Questions

Table of Content

Description
GeminiAPI:
GeminiAPI Input Parameters:
GeminiAPI Output Parameters:
GeminiAPI Usage Tips:
GeminiAPI Common Errors and Solutions:
Related Nodes

How to Install GeminiOllama ComfyUI Extension

Install this extension via the ComfyUI Manager by searching for GeminiOllama ComfyUI Extension

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter GeminiOllama ComfyUI Extension in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Gemini API Description

Facilitates interaction with Gemini generative AI models for creative content generation based on textual prompts and optional image inputs.

Gemini API:

The GeminiAPI node is designed to facilitate interaction with the Gemini generative AI models, allowing you to generate creative content based on textual prompts. This node is particularly beneficial for AI artists and creators who wish to leverage advanced AI models to produce text-based outputs, potentially enhanced with image inputs. By integrating with the Gemini models, the node provides a streamlined way to access powerful AI capabilities, enabling the generation of diverse and contextually rich content. The node's primary function is to process input prompts and optionally images, using specified Gemini models to produce coherent and meaningful text outputs. This makes it an essential tool for those looking to explore AI-driven content creation without needing deep technical expertise.

Gemini API Input Parameters:

prompt

The prompt parameter is a string input that serves as the primary text input for the Gemini model. It is the basis for the content generation process, where the model interprets and expands upon the given text. This parameter can be a single line or multiline text, allowing for detailed and complex prompts. The default value is "What is the meaning of life?", but you can customize it to suit your creative needs. The prompt's content significantly influences the generated output, making it crucial to craft it thoughtfully to achieve the desired results.

gemini_model

The gemini_model parameter allows you to select from a list of available Gemini models, each with unique capabilities and characteristics. The options include "gemini-1.5-pro-latest", "gemini-1.5-pro-exp-0801", "gemini-1.5-flash", "gemini-1.5-flash-exp-0827", and "gemini-1.5-flash-8b-exp-0827". Choosing the right model can impact the style and quality of the generated content, so it's important to select a model that aligns with your creative goals.

stream

The stream parameter is a boolean option that determines whether the content generation should be streamed. When set to True, the output is generated in chunks, which can be useful for real-time applications or when dealing with large outputs. The default value is False, meaning the content is generated in a single batch. Streaming can enhance performance and responsiveness, especially in interactive settings.

image

The image parameter is an optional input that allows you to include an image as part of the content generation process. This can be particularly useful for tasks that require visual context or when you want the generated text to relate to a specific image. The image is processed and converted from a tensor to a format compatible with the Gemini model, adding an extra layer of creativity to the output.

Gemini API Output Parameters:

text

The text output parameter is the generated content produced by the Gemini model based on the provided inputs. It is a string that encapsulates the model's interpretation and expansion of the input prompt, potentially influenced by the optional image input. This output is the primary result of the node's operation, offering a creative and contextually relevant text that can be used in various artistic and content creation applications.

Gemini API Usage Tips:

Experiment with different gemini_model options to find the one that best suits your creative needs, as each model may produce different styles and qualities of text.
Utilize the stream parameter for applications that require real-time feedback or when working with large text outputs to improve performance and responsiveness.
Consider including an image input to add visual context to your prompts, which can lead to more nuanced and contextually rich text outputs.

Gemini API Common Errors and Solutions:

Error: Gemini API key is required

Explanation: This error occurs when the Gemini API key is not provided or is invalid, preventing the node from accessing the Gemini models.
Solution: Ensure that your config.json file contains a valid GEMINI_API_KEY. If the key is missing, obtain it from the Gemini API provider and update the configuration file accordingly.

Error: Invalid model selection

Explanation: This error may arise if an unsupported or incorrect model name is specified in the gemini_model parameter.
Solution: Verify that the model name is one of the supported options: "gemini-1.5-pro-latest", "gemini-1.5-pro-exp-0801", "gemini-1.5-flash", "gemini-1.5-flash-exp-0827", or "gemini-1.5-flash-8b-exp-0827". Correct any typos or unsupported names in the input.

Gemini API Related Nodes

Go back to the extension to check out more related nodes.

GeminiOllama ComfyUI Extension

Table of Content

Description
GeminiAPI:
GeminiAPI Input Parameters:
GeminiAPI Output Parameters:
GeminiAPI Usage Tips:
GeminiAPI Common Errors and Solutions:
Related Nodes

ComfyUI Vid2Vid Dance Transfer

Transfers the motion and style from a source video onto a target image or object.

MMAudio | Video-to-Audio

MMAudio: Advanced video-to-audio model for high-quality audio generation.

Audioreactive Dancers Evolved

Transform your subject with an audioreactive background made of intricate geometries.

AP Workflow 12.0 | Ready-to-Use Complete AI Media Suite

Pre-set all-in-one system for image & video generation, enhancement, and manipulation. Zero setup required.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.