UNO | Consistent Subject & Object Generation

Create stable and consistent images from subject and object references.

Wan 2.1 FLF2V | First-Last Frame Video

Generate smooth videos from a start and end frame using Wan 2.1 FLF2V.

Trellis | Image to 3D

Trellis is an advanced Image-to-3D model for high-quality 3D assets generation.

VACE 14B: All-in-One Video Creation & Editing

Create, edit and transform videos with the powerful VACE Wan2.1 14B.

ComfyUI > Nodes > ComfyUI-Molmo > Molmo 7B D bnb 4bit

ComfyUI Node: Molmo 7B D bnb 4bit

Class Name

Molmo7BDbnb

Category
Molmo

Author
CY-CHENYUE (Account age: 482days) Extension
ComfyUI-Molmo Latest Updated
2024-10-14 Github Stars
0.12K

Github Ask CY-CHENYUE Current Questions Past Questions

Table of Content

Description
Molmo7BDbnb:
Molmo7BDbnb Input Parameters:
Molmo7BDbnb Output Parameters:
Molmo7BDbnb Usage Tips:
Molmo7BDbnb Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-Molmo

Install this extension via the ComfyUI Manager by searching for ComfyUI-Molmo

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-Molmo in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Molmo 7B D bnb 4bit Description

AI image description and analysis generator with customizable outputs for creative projects.

Molmo7BDbnb:

The Molmo7BDbnb node is designed to generate detailed descriptions or analyses of images using advanced AI models. It leverages a sophisticated model repository, cyan2k/molmo-7B-D-bnb-4bit, to process images and produce textual outputs that can either describe the image in detail or provide a comprehensive analysis. This node is particularly beneficial for AI artists and creators who wish to automate the process of generating descriptive content for their visual works. By utilizing this node, you can enhance your creative projects with rich, AI-generated narratives that capture the essence and intricacies of your images. The node is designed to be user-friendly, with customizable parameters that allow you to tailor the output to your specific needs, making it a versatile tool in the realm of AI-driven art and content creation.

Molmo7BDbnb Input Parameters:

image

This parameter represents the image that you want to analyze or describe. It is a required input and serves as the primary subject for the node's processing capabilities.

prompt_type

This parameter allows you to choose between predefined prompts: "Describe" or "Detailed Analysis". "Describe" generates a general description of the image, while "Detailed Analysis" provides a more in-depth examination. This selection influences the style and depth of the generated text.

custom_prompt

A string input that allows you to provide a custom prompt. If specified, this will override the prompt_type selection, giving you full control over the direction and focus of the generated content. It supports multiline text and has a default empty value.

seed

An integer value used to initialize the random number generator, ensuring reproducibility of results. The default is 0, with a range from 0 to 2³² - 1. Adjusting this can lead to different outputs for the same input, providing variability in the generated text.

max_new_tokens

This integer parameter sets the maximum number of new tokens (words or word pieces) to generate. It defaults to 350, with a minimum of 1 and a maximum of 1000. This controls the length of the generated text, allowing you to produce concise or detailed outputs.

temperature

A float value that influences the randomness of the generated text. With a default of 0.6, it ranges from 0.1 to 1.0. Lower values make the output more deterministic, while higher values introduce more variability and creativity.

top_k

An integer that limits the sampling pool to the top k most probable tokens. The default is 40, with a range from 1 to 100. This parameter helps in controlling the diversity of the generated text by focusing on the most likely options.

top_p

A float parameter that implements nucleus sampling, where only the most probable tokens with a cumulative probability of p are considered. It defaults to 0.9, with a range from 0.1 to 1.0. This allows for a balance between diversity and coherence in the output.

unload_model_after_generation

A boolean parameter that determines whether the model should be unloaded from memory after generating the text. The default is True, which helps in managing system resources efficiently, especially in environments with limited memory.

Molmo7BDbnb Output Parameters:

STRING

The output is a string that contains the generated text based on the input image and selected or custom prompt. This text can be a description or a detailed analysis, depending on the input parameters. It serves as a narrative or analytical content that can be used in various creative or documentation contexts.

Molmo7BDbnb Usage Tips:

To achieve a more creative and varied output, consider increasing the temperature parameter while keeping top_k and top_p at moderate levels.
Use the custom_prompt to guide the model towards specific themes or styles that align with your artistic vision, especially when the predefined prompts do not fully capture your intent.
If you are working with a series of images and need consistent outputs, set a fixed seed value to ensure reproducibility across different runs.

Molmo7BDbnb Common Errors and Solutions:

Model and processor have been unloaded, and CUDA cache has been cleared.

Explanation: This message indicates that the model and processor have been successfully unloaded from memory, and the CUDA cache has been cleared to free up resources.
Solution: This is an informational message, not an error. If you encounter issues with model loading afterward, ensure that your system has sufficient resources and that the model is correctly reloaded before the next operation.

CUDA out of memory

Explanation: This error occurs when the GPU does not have enough memory to load the model or process the image.
Solution: Try reducing the image size or complexity, or consider using a system with more GPU memory. Alternatively, ensure that unload_model_after_generation is set to True to free up memory after each operation.

Molmo 7B D bnb 4bit Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-Molmo

Table of Content

Description
Molmo7BDbnb:
Molmo7BDbnb Input Parameters:
Molmo7BDbnb Output Parameters:
Molmo7BDbnb Usage Tips:
Molmo7BDbnb Common Errors and Solutions:
Related Nodes

FLUX Dev ControlNet | Multi-Condition ControlNet

Controlled FLUX Dev image generation with Pose, Depth, Canny, and ReColor

LTX Video | Image+Text to Video

Generates videos from image+text prompts.

ReActor | Fast Face Swap

Professional face swapping toolkit for ComfyUI that enables natural face replacement and enhancement.

ACE++ Face Swap ｜ Image Editing

Swap faces in images with natural language instructions while preserving style and context.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.