ComfyUI > Nodes > florence_dw > Florence2

ComfyUI Node: Florence2

Class Name

Florence2

Category
Florence2
Author
yiwangsimple (Account age: 781days)
Extension
florence_dw
Latest Updated
2024-10-21
Github Stars
0.04K

How to Install florence_dw

Install this extension via the ComfyUI Manager by searching for florence_dw
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter florence_dw in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • High-speed GPU machines
  • 200+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 50+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Florence2 Description

Sophisticated image processing node with advanced AI models for blending visual and textual creativity in AI art projects.

Florence2:

Florence2 is a sophisticated node designed to process images using advanced AI models, specifically tailored for tasks that involve image analysis and text input. It leverages the Florence2 model, which is known for its robust capabilities in understanding and generating visual content. This node is particularly beneficial for AI artists who wish to integrate complex image processing tasks with textual data, enabling a seamless blend of visual and textual creativity. The primary goal of Florence2 is to provide a flexible and powerful tool that can handle a variety of tasks, from generating new image content to enhancing existing images based on textual descriptions. By utilizing this node, you can achieve high-quality results in your creative projects, making it an essential component in your AI art toolkit.

Florence2 Input Parameters:

FLORENCE2

This parameter represents the Florence2 model and its associated components, including the model itself, the processor, the version, and the device on which it runs. It is crucial for the node's operation as it dictates the model's behavior and performance.

image

The image parameter is the input image that you wish to process. It should be provided in a format compatible with the node, typically as a tensor. This image serves as the base for any transformations or analyses performed by the node.

task

The task parameter specifies the type of operation you want the node to perform on the input image. It offers various options, each tailored to different image processing needs, allowing you to customize the node's functionality to suit your specific project requirements.

text_input

This parameter allows you to input a string of text that can guide or influence the image processing task. It is particularly useful for tasks that involve generating or modifying images based on textual descriptions, providing a way to integrate language into your visual projects.

max_new_tokens

max_new_tokens determines the maximum number of new tokens that can be generated during the processing task. It has a default value of 1024 and can be adjusted to control the extent of content generation, impacting the detail and complexity of the output.

num_beams

The num_beams parameter controls the number of beams used in the beam search algorithm during processing. It has a default value of 3, with a minimum of 1, and affects the diversity and quality of the generated content by exploring multiple possibilities.

do_sample

This boolean parameter, with a default value of False, indicates whether sampling should be used during the generation process. Enabling sampling can introduce variability and creativity into the output, making it more dynamic and less deterministic.

fill_mask

Another boolean parameter, fill_mask, determines whether masked tokens in the input should be filled during processing. With a default value of False, enabling this option allows for more complex and context-aware content generation.

Florence2 Output Parameters:

IMAGE

The IMAGE output is the processed image resulting from the node's operations. It reflects the transformations and enhancements applied based on the input parameters, providing a visual representation of the node's capabilities.

STRING

This output provides a string representation of the results, offering a textual summary or description of the processing task's outcome. It can be useful for understanding the context or details of the generated content.

F_BBOXES

F_BBOXES represents the bounding boxes or other relevant data structures that may be generated during the image processing task. This output is essential for tasks that involve object detection or spatial analysis within the image.

Florence2 Usage Tips:

  • Experiment with different task options to explore the full range of capabilities offered by the Florence2 node, tailoring the output to your specific creative needs.
  • Adjust the num_beams and do_sample parameters to balance between deterministic and creative outputs, depending on whether you prefer consistency or variability in your results.
  • Utilize the text_input parameter to guide the image processing task with descriptive language, enhancing the node's ability to generate contextually relevant content.

Florence2 Common Errors and Solutions:

Model not found

  • Explanation: This error occurs when the specified Florence2 model version is not available or has not been downloaded.
  • Solution: Ensure that the correct model version is specified and that it has been successfully downloaded to the appropriate directory.

CUDA out of memory

  • Explanation: This error indicates that the GPU does not have enough memory to process the current task.
  • Solution: Try reducing the image size or complexity, or adjust the max_new_tokens and num_beams parameters to lower values to decrease memory usage.

Invalid input type

  • Explanation: This error arises when the input parameters do not match the expected types or formats.
  • Solution: Verify that all input parameters are correctly formatted and compatible with the node's requirements, particularly the image and text_input parameters.

Florence2 Related Nodes

Go back to the extension to check out more related nodes.
florence_dw
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.