Edit backgrounds, enhance lighting, and regenerate new scenes easily.

Wan 2.1 Video Restyle | Consistent Video Style Transform

Transform your video style by applying the restyled first frame using Wan 2.1 video restyle workflow.

VACE 14B: All-in-One Video Creation & Editing

Create, edit and transform videos with the powerful VACE Wan2.1 14B.

Consistent Style Transfer with Unsampling

Controlling latent noise with Unsampling helps dramatically increase consistency in video style transfer.

ComfyUI > Nodes > smZNodes > CLIP Text Encode++

ComfyUI Node: CLIP Text Encode++

Class Name

smZ CLIPTextEncode

Category
conditioning

Author
shiimizu (Account age: 2050days) Extension
smZNodes Latest Updated
2025-03-16 Github Stars
0.26K

Github Ask shiimizu Current Questions Past Questions

Table of Content

Description
CLIP Text Encode++:
CLIP Text Encode++ Input Parameters:
CLIP Text Encode++ Output Parameters:
CLIP Text Encode++ Usage Tips:
CLIP Text Encode++ Common Errors and Solutions:
Related Nodes

How to Install smZNodes

Install this extension via the ComfyUI Manager by searching for smZNodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter smZNodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

CLIP Text Encode++ Description

Encode textual input for AI models using CLIP technology for nuanced conditioning and content generation.

CLIP Text Encode++:

The smZ CLIPTextEncode node is designed to encode textual input into a format that can be used for conditioning in AI models, particularly those involving CLIP (Contrastive Language-Image Pre-Training) technology. This node leverages the power of CLIP to transform text into a set of tokens that can be used to guide image generation or other AI-driven tasks. By encoding text, it allows for nuanced and contextually rich conditioning, enhancing the ability of AI models to understand and generate content based on textual descriptions. This node is particularly useful for AI artists who want to integrate complex textual prompts into their workflows, enabling more precise and creative control over the output.

CLIP Text Encode++ Input Parameters:

clip

This parameter represents the CLIP model to be used for encoding the text. It is essential as it defines the model that will process the textual input and convert it into tokens. The CLIP model is pre-trained and provides the necessary framework for understanding and encoding the text.

text

The text parameter is the main textual input that you want to encode. This can be any string of text that describes the content or context you wish to condition your AI model with. The quality and specificity of the text will directly impact the effectiveness of the encoding and the resulting AI output.

parser

The parser parameter is used to process the text input before encoding. It ensures that the text is in the correct format and may handle tasks such as tokenization or other preprocessing steps necessary for the CLIP model to understand the input.

mean_normalization

This parameter determines whether mean normalization should be applied to the encoded tokens. Mean normalization can help in standardizing the token values, which might be beneficial for certain models or tasks.

multi_conditioning

The multi_conditioning parameter allows for the use of multiple conditioning inputs. This can be useful if you want to encode and use several different pieces of text simultaneously to guide the AI model.

use_old_emphasis_implementation

This parameter specifies whether to use an older implementation of emphasis in the text encoding process. This might be relevant for compatibility with older models or specific use cases where the older method is preferred.

with_SDXL

The with_SDXL parameter indicates whether to use the SDXL (Stable Diffusion XL) variant of the CLIP model. SDXL is a more advanced version that might offer better performance or additional features.

ascore

The ascore parameter is a floating-point value that represents an aesthetic score. This score can be used to influence the encoding process, potentially guiding the model to favor certain aesthetic qualities in the output. The default value is 6.0, with a minimum of 0.0 and a maximum of 1000.0.

width

The width parameter specifies the width dimension for the encoded output. This can be important for models that require specific input dimensions. The default value is 1024, with a minimum of 0 and a maximum defined by the model's capabilities.

height

The height parameter specifies the height dimension for the encoded output. Similar to the width, this ensures that the encoded tokens fit the required input dimensions for the model. The default value is 1024, with a minimum of 0 and a maximum defined by the model's capabilities.

crop_w

The crop_w parameter allows for cropping the width of the encoded output. This can be useful for focusing on specific parts of the text or adjusting the output to fit certain requirements.

crop_h

The crop_h parameter allows for cropping the height of the encoded output. This can help in refining the focus of the encoded text or adjusting the output dimensions.

target_width

The target_width parameter sets the target width for the final encoded output. This ensures that the output matches the desired dimensions, which can be crucial for certain applications.

target_height

The target_height parameter sets the target height for the final encoded output. This ensures that the output matches the desired dimensions, which can be crucial for certain applications.

text_g

The text_g parameter is an additional textual input that can be used for global conditioning. This allows for more complex and layered conditioning by providing another piece of text to guide the model.

text_l

The text_l parameter is another textual input that can be used for local conditioning. This can be used in conjunction with text_g to provide a more detailed and nuanced conditioning input.

smZ_steps

The smZ_steps parameter defines the number of steps to be used in the encoding process. This can affect the granularity and detail of the encoded output. The default value is 1.

CLIP Text Encode++ Output Parameters:

CONDITIONING

The output of the smZ CLIPTextEncode node is a CONDITIONING parameter. This represents the encoded textual input in a format that can be used to condition AI models. The conditioning output includes the encoded tokens and any additional information such as pooled outputs or aesthetic scores, which can be used to guide the model's behavior and output.

CLIP Text Encode++ Usage Tips:

Ensure that your textual input is clear and specific to get the best results from the encoding process.
Experiment with the ascore parameter to see how different aesthetic scores influence the output.
Use the multi_conditioning parameter to combine multiple pieces of text for more complex conditioning.
Adjust the width and height parameters to match the requirements of your AI model.

CLIP Text Encode++ Common Errors and Solutions:

"Invalid CLIP model"

Explanation: The provided CLIP model is not recognized or is incompatible.
Solution: Ensure that you are using a valid and compatible CLIP model for the encoding process.

"Text input is empty"

Explanation: The text parameter is empty or not provided.
Solution: Provide a valid string of text to be encoded.

"Dimension mismatch"

Explanation: The specified dimensions (width, height, crop_w, crop_h, target_width, target_height) are not compatible with the model's requirements.
Solution: Check and adjust the dimension parameters to ensure they match the model's expected input dimensions.

"Tokenization error"

Explanation: There was an error during the tokenization of the text input.
Solution: Ensure that the text input is properly formatted and does not contain unsupported characters or structures.

CLIP Text Encode++ Related Nodes

Go back to the extension to check out more related nodes.

smZNodes

Table of Content

Description
CLIP Text Encode++:
CLIP Text Encode++ Input Parameters:
CLIP Text Encode++ Output Parameters:
CLIP Text Encode++ Usage Tips:
CLIP Text Encode++ Common Errors and Solutions:
Related Nodes

FLUX IPAdapter V2 | XLabs

Explore XLabs FLUX IPAdapter V2 model compared to V1 for your creative goals.

Hunyuan Video | Text to Video

Generates videos from text prompts.

Wan 2.1 | Revolutionary Video Generation

Create incredible videos from text or images with breakthrough AI running on everyday CPUs.

LBM Relighting | I2I

Relight subjects using image-based lighting inputs with LBM.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.