ComfyUI
Playground
Pricing

RunComfy

Wan 2.1 FLF2V | First-Last Frame Video

Generate smooth videos from a start and end frame using Wan 2.1 FLF2V.

Flux TTP Upscale | 4K Face Restore

Repair distorted faces and upscale images to 4K resolution.

Wan FusionX | T2V+I2V+VACE Complete

Most powerful video generation solution yet! Cinema-grade detail, your personal film studio.

Audioreactive Dancers Evolved

Transform your subject with an audioreactive background made of intricate geometries.

ComfyUI > Nodes > comfyui_LLM_party > 本地大语言模型(LLM_local)

ComfyUI Node: 本地大语言模型(LLM_local)

Class Name

LLM_local

Category
大模型派对（llm_party）/模型链（model_chain）

Author
heshengtao (Account age: 3180days) Extension
comfyui_LLM_party Latest Updated
2025-03-30 Github Stars
1.57K

Github Ask heshengtao Current Questions Past Questions

Table of Content

Description
本地大语言模型(LLM_local):
本地大语言模型(LLM_local) Input Parameters:
本地大语言模型(LLM_local) Output Parameters:
本地大语言模型(LLM_local) Usage Tips:
本地大语言模型(LLM_local) Common Errors and Solutions:
Related Nodes

How to Install comfyui_LLM_party

Install this extension via the ComfyUI Manager by searching for comfyui_LLM_party

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter comfyui_LLM_party in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

本地大语言模型(LLM_local) Description

Facilitates loading local language models for AI tasks, enabling faster response times and greater control over data.

本地大语言模型(LLM_local):

The LLM_local node is designed to facilitate the loading and utilization of local language models for various AI-driven tasks. This node is particularly beneficial for AI artists who want to leverage the power of large language models (LLMs) without relying on external APIs or cloud services. By using LLM_local, you can load pre-trained models directly from your local environment, ensuring faster response times and greater control over your data. The node supports various configurations, allowing you to fine-tune the model's performance based on your specific needs. Whether you're generating text, creating conversational agents, or performing other language-related tasks, LLM_local provides a robust and flexible solution.

本地大语言模型(LLM_local) Input Parameters:

ckpt_path

The ckpt_path parameter specifies the file path to the pre-trained model checkpoint that you want to load. This path should point to a valid model file on your local system. The correct model file is essential for the node to function properly, as it contains the necessary data for the language model to operate.

max_ctx

The max_ctx parameter defines the maximum context length that the model can handle. This value determines how much previous text the model will consider when generating new text. A higher value allows the model to take more context into account, which can improve the coherence of the generated text but may also require more computational resources. The default value is typically set to a reasonable balance between performance and resource usage.

gpu_layers

The gpu_layers parameter indicates the number of layers in the model that should be processed on the GPU. Utilizing the GPU can significantly speed up the model's performance, especially for larger models. However, the number of layers you can offload to the GPU depends on your hardware capabilities. Adjust this parameter based on your system's GPU memory and processing power.

n_threads

The n_threads parameter sets the number of CPU threads to use for model inference. More threads can improve the model's performance by parallelizing the computation, but it also increases the CPU load. The default value is 8, with a minimum of 1 and a maximum of 100. Adjust this parameter based on your system's CPU capabilities and the desired performance.

clip_path

The clip_path parameter specifies the file path to the CLIP model, which is used for handling chat formats. This path should point to a valid CLIP model file on your local system. The CLIP model helps in processing and understanding the context of the conversation, enhancing the overall performance of the language model.

本地大语言模型(LLM_local) Output Parameters:

model

The model output parameter returns the loaded language model. This model can be used for various language-related tasks, such as text generation, conversation, and more. The returned model is ready for inference and can be integrated into your AI applications.

本地大语言模型(LLM_local) Usage Tips:

Ensure that the ckpt_path and clip_path parameters point to valid and accessible files on your local system to avoid loading errors.
Adjust the max_ctx parameter based on the complexity of your tasks. For longer and more coherent text generation, a higher context length is beneficial.
Utilize the gpu_layers parameter to offload as many layers as your GPU can handle, improving performance without overloading your system.
Experiment with the n_threads parameter to find the optimal balance between performance and CPU load, especially if you are running multiple models or tasks simultaneously.

本地大语言模型(LLM_local) Common Errors and Solutions:

"Model file not found"

Explanation: The specified ckpt_path does not point to a valid model file.
Solution: Verify that the file path is correct and that the model file exists at the specified location.

"CLIP model file not found"

Explanation: The specified clip_path does not point to a valid CLIP model file.
Solution: Ensure that the file path is correct and that the CLIP model file is present at the specified location.

"Insufficient GPU memory"

Explanation: The number of gpu_layers specified exceeds the available GPU memory.
Solution: Reduce the number of layers assigned to the GPU or upgrade your GPU hardware.

"CPU overload"

Explanation: The number of n_threads specified is too high, causing excessive CPU load.
Solution: Decrease the number of CPU threads to a more manageable level based on your system's capabilities.

本地大语言模型(LLM_local) Related Nodes

Go back to the extension to check out more related nodes.

comfyui_LLM_party

Table of Content

Description
本地大语言模型(LLM_local):
本地大语言模型(LLM_local) Input Parameters:
本地大语言模型(LLM_local) Output Parameters:
本地大语言模型(LLM_local) Usage Tips:
本地大语言模型(LLM_local) Common Errors and Solutions:
Related Nodes

ACE++ Face Swap ｜ Image Editing

Swap faces in images with natural language instructions while preserving style and context.

CogvideoX Fun | Video-to-Video Model

CogVideoX Fun: Advanced video-to-video model for high-quality video generation.

SUPIR + Foolhardy Remacri | 8K Image/Video Upscaler

Upscale images to 8K with SUPIR and 4x Foolhardy Remacri model.

Flux Upscaler - Ultimate 32k | Image Upscaler

Flux Upscaler – Achieve 4k, 8k, 16k, and Ultimate 32k Resolution!

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy