IC-Light | Video Relighting | AnimateDiff

Relight your videos with light maps and prompts

Pyramid Flow | Video Generation

Including both text-to-video and image-to-video mode.

Wan 2.1 FLF2V | First-Last Frame Video

Generate smooth videos from a start and end frame using Wan 2.1 FLF2V.

Step1X-Edit | AI Image Editing Tool

Perform 11 editing operations with natural language in Step1X-Edit.

ComfyUI > Nodes > Bjornulf_custom_nodes > 🦙👁 Ollama Vision

ComfyUI Node: 🦙👁 Ollama Vision

Class Name

Bjornulf_OllamaImageVision

Category
Bjornulf

Author
justUmen (Account age: 3073days) Extension
Bjornulf_custom_nodes Latest Updated
2025-03-30 Github Stars
0.29K

Github Ask justUmen Current Questions Past Questions

Table of Content

Description
Bjornulf_OllamaImageVision:
Bjornulf_OllamaImageVision Input Parameters:
Bjornulf_OllamaImageVision Output Parameters:
Bjornulf_OllamaImageVision Usage Tips:
Bjornulf_OllamaImageVision Common Errors and Solutions:
Related Nodes

How to Install Bjornulf_custom_nodes

Install this extension via the ComfyUI Manager by searching for Bjornulf_custom_nodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter Bjornulf_custom_nodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

🦙👁 Ollama Vision Description

Image analysis node for detailed visual insights and scene interpretation using advanced processing techniques.

🦙👁 Ollama Vision:

The Bjornulf_OllamaImageVision node is designed to provide a comprehensive analysis of images by extracting and summarizing various aspects such as the main content, intricate details, characters, objects, and the overall semantic context. This node is particularly beneficial for AI artists and designers who wish to gain deeper insights into the visual elements of an image, enabling them to understand and interpret the scene more effectively. By leveraging advanced image processing techniques, the node can describe scenes thoroughly, capturing colors, textures, and significant actions, as well as analyzing the mood, environment, and implied meanings. This makes it an invaluable tool for those looking to enhance their creative projects with detailed visual analysis.

🦙👁 Ollama Vision Input Parameters:

IMAGE

The IMAGE parameter is the primary input for the node, representing the image that will be analyzed. This parameter is crucial as it determines the content that the node will process and describe. The image should be provided in a compatible format, and its quality and resolution can impact the accuracy and detail of the analysis.

OLLAMA_CONFIG

The OLLAMA_CONFIG parameter allows you to specify configuration settings for the image processing. This optional parameter can be used to customize the behavior of the node, such as adjusting the level of detail in the analysis or focusing on specific aspects of the image. The exact options available depend on the implementation details, which are not specified in the provided context.

output_selection

The output_selection parameter determines which aspects of the image analysis will be included in the output. It is an integer value, with a default setting of 6, that allows you to select specific types of information to be extracted, such as basic summaries, detailed descriptions, or semantic analysis. Adjusting this parameter can help tailor the output to your specific needs.

process_below_output_selection

The process_below_output_selection parameter is a boolean flag that indicates whether the node should process additional information below the selected output level. When set to True, the node will include more detailed analysis beyond the primary output selection, providing a more comprehensive understanding of the image.

🦙👁 Ollama Vision Output Parameters:

output_image

The output_image parameter is the processed image data that results from the node's analysis. This output provides a visual representation of the image with any applied transformations or enhancements, allowing you to see the effects of the node's processing.

output_mask

The output_mask parameter is a mask that highlights specific areas of interest within the image. This output is useful for identifying and isolating particular elements or regions, such as characters or objects, based on the node's analysis. The mask can be used in further image processing or editing tasks.

🦙👁 Ollama Vision Usage Tips:

To achieve the best results, ensure that the input image is of high quality and resolution, as this will enhance the accuracy of the analysis.
Experiment with the output_selection parameter to focus on different aspects of the image, such as detailed descriptions or semantic analysis, depending on your project needs.
Utilize the process_below_output_selection parameter to gain a more in-depth understanding of the image by including additional analysis beyond the primary output.

🦙👁 Ollama Vision Common Errors and Solutions:

Image format not supported

Explanation: The input image is in a format that is not supported by the node, such as MPO.
Solution: Convert the image to a supported format like JPEG or PNG before processing it with the node.

Inconsistent image dimensions

Explanation: The input image sequence contains frames with varying dimensions, which can cause processing issues.
Solution: Ensure that all frames in the image sequence have consistent dimensions before inputting them into the node.

Missing alpha channel

Explanation: The input image lacks an alpha channel, which is required for certain types of analysis.
Solution: Add an alpha channel to the image or use a different image that includes one.

🦙👁 Ollama Vision Related Nodes

Go back to the extension to check out more related nodes.

Bjornulf_custom_nodes

Table of Content

Description
Bjornulf_OllamaImageVision:
Bjornulf_OllamaImageVision Input Parameters:
Bjornulf_OllamaImageVision Output Parameters:
Bjornulf_OllamaImageVision Usage Tips:
Bjornulf_OllamaImageVision Common Errors and Solutions:
Related Nodes

CatVTON | Amazing Virtual Try-On

CatVTON for easy and accurate virtual try-on.

AnimateDiff + ControlNet + AutoMask | Comic Style

Effortlessly restyle videos, converting realistic characters into anime while keeping the original backgrounds intact.

AP Workflow 12.0 | Ready-to-Use Complete AI Media Suite

Pre-set all-in-one system for image & video generation, enhancement, and manipulation. Zero setup required.

ACE++ Face Swap ｜ Image Editing

Swap faces in images with natural language instructions while preserving style and context.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.