ComfyUI > Nodes > ComfyUI-PixtralLlamaMolmoVision > Generate Text with Molmo

ComfyUI Node: Generate Text with Molmo

Class Name

MolmoGenerateText

Category
PixtralLlamaVision/Molmo
Author
SeanScripts (Account age: 1678days)
Extension
ComfyUI-PixtralLlamaMolmoVision
Latest Updated
2024-10-05
Github Stars
0.06K

How to Install ComfyUI-PixtralLlamaMolmoVision

Install this extension via the ComfyUI Manager by searching for ComfyUI-PixtralLlamaMolmoVision
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-PixtralLlamaMolmoVision in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • High-speed GPU machines
  • 200+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 50+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Generate Text with Molmo Description

Generate text using Molmo model, integrating visual cues for coherent and contextually relevant output.

Generate Text with Molmo:

MolmoGenerateText is a powerful node designed to generate text using the Molmo model, which is particularly adept at processing visual and textual inputs. This node allows you to input a series of images along with a textual prompt, enabling the model to generate coherent and contextually relevant text based on the visual content provided. The primary benefit of using MolmoGenerateText is its ability to seamlessly integrate visual cues into text generation, making it ideal for applications that require a deep understanding of both image and text data. This node is especially useful for creative projects where you want to describe images or generate narratives that are informed by visual elements. By leveraging advanced text generation techniques, MolmoGenerateText ensures that the output is not only relevant but also engaging and insightful.

Generate Text with Molmo Input Parameters:

molmo_model

This parameter specifies the vision model to be used for text generation. It is crucial as it determines the model's ability to interpret and generate text based on the provided images and prompts.

images

A list of images that the model will use as input. The number of images should match the number of [IMG] tokens in the prompt. These images provide the visual context necessary for generating relevant text.

system_prompt

A string that serves as an initial prompt or context for the model. It can be multiline and is used to set the stage for the text generation process. The default value is an empty string.

prompt

This is the main textual input that guides the text generation. It should include [IMG] tokens corresponding to the images provided. The default prompt is "Describe this image."

max_new_tokens

An integer that sets the maximum number of new tokens the model can generate. This controls the length of the generated text, with a default of 256 and a range from 1 to 4096.

do_sample

A boolean that determines whether sampling is used during text generation. When set to true, the model will generate more diverse outputs. The default value is true.

temperature

A float that influences the randomness of the text generation. Higher values result in more random outputs, while lower values make the output more deterministic. The default is 0.3, with a minimum of 0.

top_p

A float that sets the cumulative probability threshold for token selection. It helps in controlling the diversity of the generated text. The default value is 0.9, with a range from 0.0 to 1.0.

top_k

An integer that limits the number of highest probability tokens to consider during generation. This parameter helps in focusing the output. The default is 40, with a minimum of 1.

stop_strings

A string that specifies the stopping criteria for text generation. The model will stop generating text when it encounters this string. The default is `

Generate Text with Molmo Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-PixtralLlamaMolmoVision
RunComfy

© Copyright 2024 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals.