Design motion paths to animate still photos into videos.

LivePortrait | Animate Portraits | Img2Vid

Animate portraits with facial expressions and motion using a single image and reference video.

Consistent Style Transfer with Unsampling

Controlling latent noise with Unsampling helps dramatically increase consistency in video style transfer.

Wan 2.1 Fun | ControlNet Video Generation

Generate videos with ControlNet-style visual passes like Depth, Canny, and OpenPose.

ComfyUI > Nodes > ComfyUI Impact Pack > CLIPSegDetectorProvider

ComfyUI Node: CLIPSegDetectorProvider

Class Name

CLIPSegDetectorProvider

Category
ImpactPack/Util

Author
Dr.Lt.Data (Account age: 747days) Extension
ComfyUI Impact Pack Latest Updated
2025-03-23 Github Stars
2.28K

Github Ask Dr.Lt.Data Current Questions Past Questions

Table of Content

Description
CLIPSegDetectorProvider:
CLIPSegDetectorProvider Input Parameters:
CLIPSegDetectorProvider Output Parameters:
CLIPSegDetectorProvider Usage Tips:
CLIPSegDetectorProvider Common Errors and Solutions:
Related Nodes

How to Install ComfyUI Impact Pack

Install this extension via the ComfyUI Manager by searching for ComfyUI Impact Pack

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI Impact Pack in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

CLIPSegDetectorProvider Description

Detect bounding boxes in images based on text descriptions using CLIPSeg model for precise object segmentation and highlighting.

CLIPSegDetectorProvider:

The CLIPSegDetectorProvider node is designed to facilitate the detection of bounding boxes in images based on textual descriptions. This node leverages the CLIPSeg model, which combines the capabilities of CLIP (Contrastive Language-Image Pre-Training) and segmentation techniques to identify and segment objects within an image as specified by a text prompt. By providing a text description, the node can detect and highlight relevant areas in the image, making it a powerful tool for AI artists who want to automate the process of identifying and isolating specific elements in their artwork. The node also allows for fine-tuning through parameters such as blur, threshold, and dilation factor, enabling you to achieve precise and customized results.

CLIPSegDetectorProvider Input Parameters:

text

The text parameter is a string input that specifies the description of the object or area you want to detect in the image. This text prompt guides the CLIPSeg model in identifying relevant regions. The input should be a concise and clear description to ensure accurate detection. This parameter does not have a default value and must be provided by the user.

blur

The blur parameter is a float value that controls the amount of blur applied to the image before segmentation. Blurring can help in smoothing out noise and improving the accuracy of the segmentation. The value ranges from 0 to 15, with a default value of 7. Adjusting this parameter can help in refining the detection results based on the complexity and noise level of the image.

threshold

The threshold parameter is a float value that determines the confidence level required for a region to be considered as part of the detected object. It ranges from 0 to 1, with a default value of 0.4. A higher threshold means that only regions with higher confidence scores will be included, which can reduce false positives but might miss some relevant areas.

dilation_factor

The dilation_factor parameter is an integer that specifies the amount of dilation applied to the detected regions. Dilation can help in expanding the detected areas, making them more prominent. The value ranges from 0 to 10, with a default value of 4. Adjusting this parameter can help in covering more area around the detected regions, which can be useful for ensuring that the entire object is included.

CLIPSegDetectorProvider Output Parameters:

BBOX_DETECTOR

The BBOX_DETECTOR output is a bounding box detector object that contains the detected regions based on the provided text prompt and input parameters. This output can be used in subsequent nodes or processes to further analyze, manipulate, or visualize the detected areas. The bounding box detector provides a structured way to access the coordinates and properties of the detected regions, making it easier to integrate with other tools and workflows.

CLIPSegDetectorProvider Usage Tips:

Ensure that the text prompt is clear and specific to improve the accuracy of the detection.
Experiment with the blur parameter to find the optimal level of smoothing for your images, especially if they contain a lot of noise.
Adjust the threshold parameter to balance between detecting all relevant regions and minimizing false positives.
Use the dilation_factor to expand the detected regions if you find that the initial detection is too tight around the objects.

CLIPSegDetectorProvider Common Errors and Solutions:

[ERROR] CLIPSegToBboxDetector: CLIPSeg custom node isn't installed. You must install biegert/ComfyUI-CLIPSeg extension to use this node.

Explanation: This error occurs when the CLIPSeg custom node is not installed in your environment.
Solution: Install the CLIPSeg extension by following the instructions provided in the error message. You can install it from the URL: https://github.com/biegert/ComfyUI-CLIPSeg/raw/main/custom_nodes/clipseg.py.

Invalid text input

Explanation: This error occurs when the text parameter is not provided or is empty.
Solution: Ensure that you provide a valid and non-empty text description for the text parameter.

Parameter value out of range

Explanation: This error occurs when one of the input parameters (blur, threshold, or dilation_factor) is set to a value outside its allowed range.
Solution: Check the allowed ranges for each parameter and ensure that the values you provide fall within these ranges. For example, blur should be between 0 and 15, threshold between 0 and 1, and dilation_factor between 0 and 10.

CLIPSegDetectorProvider Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI Impact Pack

Table of Content

Description
CLIPSegDetectorProvider:
CLIPSegDetectorProvider Input Parameters:
CLIPSegDetectorProvider Output Parameters:
CLIPSegDetectorProvider Usage Tips:
CLIPSegDetectorProvider Common Errors and Solutions:
Related Nodes

Wonder3D | ComfyUI 3D Pack

Generate multi-view normal maps and color images for 3D assets.

Wan 2.1 | Revolutionary Video Generation

Create incredible videos from text or images with breakthrough AI running on everyday CPUs.

Flux Depth and Canny

Official Flux Tools - Flux Depth and Canny ControlNet Model

Era3D | ComfyUI 3D Pack

Generate 3D content, from multi-view images to detailed meshes.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.