Guide you through the entire process of training FLUX LoRA models using your custom datasets.

Flux UltraRealistic LoRA V2

Create stunningly lifelike image with Flux UltraRealistic LoRA V2

Flux Consistent Characters | Input Image

Create consistent characters and ensure they look uniform using your images.

ACE++ Face Swap ｜ Image Editing

Swap faces in images with natural language instructions while preserving style and context.

ComfyUI > Nodes > WAS Node Suite > BLIP Model Loader

ComfyUI Node: BLIP Model Loader

Class Name

BLIP Model Loader

Category
WAS Suite/Loaders

Author
WASasquatch (Account age: 4910days) Extension
WAS Node Suite Latest Updated
2025-03-27 Github Stars
1.44K

Github Ask WASasquatch Current Questions Past Questions

Table of Content

Description
BLIP Model Loader:
BLIP Model Loader Input Parameters:
BLIP Model Loader Output Parameters:
BLIP Model Loader Usage Tips:
BLIP Model Loader Common Errors and Solutions:
Related Nodes

How to Install WAS Node Suite

Install this extension via the ComfyUI Manager by searching for WAS Node Suite

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter WAS Node Suite in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

BLIP Model Loader Description

Load pre-trained BLIP models for image captioning and VQA tasks, simplifying integration into AI art projects.

BLIP Model Loader:

The BLIP Model Loader node is designed to load pre-trained BLIP (Bootstrapping Language-Image Pre-training) models, which are essential for tasks such as image captioning and visual question answering (VQA). This node simplifies the process of integrating these models into your AI art projects by handling the model loading and configuration, allowing you to focus on creative aspects rather than technical details. By leveraging the BLIP Model Loader, you can easily access state-of-the-art models for generating descriptive captions for images or answering questions based on visual content, enhancing the interactivity and descriptiveness of your AI-generated art.

BLIP Model Loader Input Parameters:

blip_model

This parameter specifies the identifier of the BLIP model to be loaded for image captioning. The default value is Salesforce/blip-image-captioning-base, which is a pre-trained model provided by Salesforce. You can also specify other model identifiers if you have different models available. This parameter is crucial as it determines the model's capability to generate descriptive captions for images.

vqa_model_id

This parameter defines the identifier of the BLIP model to be used for visual question answering (VQA). The default value is Salesforce/blip-vqa-base, another pre-trained model from Salesforce. Similar to the blip_model parameter, you can specify other model identifiers if needed. This parameter is essential for enabling the model to answer questions based on the visual content of images.

device

This parameter indicates the device on which the model will be loaded and executed. The available options are cuda and cpu. Using cuda will leverage GPU acceleration, which can significantly speed up model inference, while cpu will use the central processing unit. The choice of device can impact the performance and speed of the model, with cuda being preferable for faster processing if a compatible GPU is available.

BLIP Model Loader Output Parameters:

BLIP_MODEL

The output of this node is a loaded BLIP model, encapsulated in a BLIP_MODEL object. This object contains the pre-trained model ready for tasks such as image captioning and visual question answering. The BLIP_MODEL output is essential for subsequent nodes that will utilize the model to generate captions or answer questions based on images, providing a seamless integration into your AI art workflow.

BLIP Model Loader Usage Tips:

Ensure that you have the necessary pre-trained model files available or accessible via the specified identifiers to avoid loading issues.
For optimal performance, use the cuda device if you have a compatible GPU, as it will significantly speed up the model's processing time.
Experiment with different BLIP models by changing the blip_model and vqa_model_id parameters to find the best fit for your specific use case, whether it be image captioning or visual question answering.

BLIP Model Loader Common Errors and Solutions:

`RuntimeError: checkpoint url or path is invalid`

Explanation: This error occurs when the specified model identifier or file path is incorrect or inaccessible.
Solution: Verify that the model identifier or file path is correct and that the model files are available at the specified location. Ensure you have internet access if downloading from a URL.

`AssertionError: len(msg.missing_keys)==0`

Explanation: This error indicates that some keys are missing from the model's state dictionary during loading.
Solution: Ensure that you are using the correct and complete pre-trained model files. If the problem persists, try downloading the model files again to ensure they are not corrupted.

`RuntimeError: CUDA out of memory`

Explanation: This error occurs when the GPU does not have enough memory to load the model.
Solution: Reduce the batch size or switch to the cpu device if GPU memory is insufficient. Alternatively, try freeing up GPU memory by closing other applications or processes that are using the GPU.

BLIP Model Loader Related Nodes

Go back to the extension to check out more related nodes.

WAS Node Suite

Table of Content

Description
BLIP Model Loader:
BLIP Model Loader Input Parameters:
BLIP Model Loader Output Parameters:
BLIP Model Loader Usage Tips:
BLIP Model Loader Common Errors and Solutions:
Related Nodes

FLUX IPAdapter V2 | XLabs

Explore XLabs FLUX IPAdapter V2 model compared to V1 for your creative goals.

Hunyuan Video | Video to Video

Combine text prompt and source video to generate new video.

EchoMimic | Audio-driven Portrait Animations

Generate realistic talking heads and body gestures synced with the provided audio.

Consistent Character Creator

Create consistent, high-resolution character designs from multiple angles with full control over emotions, lighting, and environments.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.