Virtual try-on creating realistic results by capturing garment details and style.

LivePortrait | Animate Portraits | Img2Vid

Animate portraits with facial expressions and motion using a single image and reference video.

Janus-Pro | T2I + I2T Model

Janus-Pro: Advanced Text-to-Image and Image-to-Text generation.

Flux Upscaler - Ultimate 32k | Image Upscaler

Flux Upscaler – Achieve 4k, 8k, 16k, and Ultimate 32k Resolution!

ComfyUI > Nodes > ComfyUI CogVideoX Wrapper

ComfyUI Extension: ComfyUI CogVideoX Wrapper

Repo Name

ComfyUI-CogVideoXWrapper

Author
kijai (Account age: 2467 days) Nodes
View all nodes(12) Latest Updated
2025-02-17 Github Stars
1.46K

Github Ask kijai Current Questions Past Questions

Table of Content

Description
How ComfyUI CogVideoX Wrapper Works
ComfyUI CogVideoX Wrapper Features
ComfyUI CogVideoX Wrapper Models
What's New with ComfyUI CogVideoX Wrapper
Troubleshooting ComfyUI CogVideoX Wrapper
Learn More about ComfyUI CogVideoX Wrapper
Related Nodes

How to Install ComfyUI CogVideoX Wrapper

Install this extension via the ComfyUI Manager by searching for ComfyUI CogVideoX Wrapper

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI CogVideoX Wrapper in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI CogVideoX Wrapper Description

ComfyUI CogVideoX Wrapper integrates CogVideoX models with Diffusers, enabling seamless video generation and manipulation within the ComfyUI framework.

ComfyUI CogVideoX Wrapper Introduction

ComfyUI-CogVideoXWrapper is an extension designed to integrate the powerful capabilities of CogVideoX into the ComfyUI framework. This extension allows AI artists to generate high-quality videos from text prompts, images, or even other videos, making it a versatile tool for creative projects. By leveraging the advanced features of CogVideoX, this wrapper simplifies the process of video generation, enabling users to create stunning visual content with minimal technical effort.

How ComfyUI CogVideoX Wrapper Works

At its core, ComfyUI-CogVideoXWrapper utilizes the CogVideoX models to transform text descriptions, images, or video inputs into new video content. The process involves several key steps:

Input Processing: The extension takes in various forms of input, such as text prompts, images, or existing videos.
Model Application: Depending on the input type, the appropriate CogVideoX model is applied. For instance, text-to-video (T2V) models generate videos from text descriptions, while image-to-video (I2V) models create videos based on a single image.
Video Generation: The selected model processes the input and generates a sequence of video frames, which are then compiled into a final video output. This process is streamlined within the ComfyUI interface, making it accessible even to users without a deep technical background.

ComfyUI CogVideoX Wrapper Features

Text-to-Video (T2V)

Description: Generate videos from text prompts.
Customization: Adjust parameters like prompt, negative prompt, guidance scale, and seed to influence the video output.
Example: A prompt like "A cat playing with a ball of yarn" can generate a short video depicting this scene.

Image-to-Video (I2V)

Description: Create videos starting from a single image.
Customization: Combine the image with text prompts to guide the video generation.
Example: An image of a sunset can be used to generate a video showing the transition from day to night.

Video-to-Video (V2V)

Description: Transform existing videos into new content.
Customization: Apply different styles or effects to the original video.
Example: A simple animation can be enhanced with artistic effects to create a more visually appealing video.

Temporal Tiling

Description: Generate endless videos by seamlessly looping the content.
Customization: Adjust the tiling parameters to control the looping effect.
Example: Create a continuous video of waves crashing on a beach.

ComfyUI CogVideoX Wrapper Models

CogVideoX-2B

Description: A smaller model suitable for generating videos at lower resolutions.
Use Case: Ideal for quick previews or when working with limited computational resources.
Example: Generate a 512x512 resolution video of a dog running in a park.

CogVideoX-5B

Description: A larger model capable of producing high-resolution videos.
Use Case: Best for final outputs where high quality is essential.
Example: Create a 1024x1024 resolution video of a cityscape at night.

CogVideoX-5B-I2V

Description: Specialized for image-to-video tasks.
Use Case: Perfect for transforming static images into dynamic video content.
Example: Turn a photograph of a mountain into a video showing changing weather conditions.

What's New with ComfyUI CogVideoX Wrapper

Update 4

Feature: Initial support for the official I2V version of CogVideoX.
Benefit: Enhanced capabilities for generating videos from images.
Link: CogVideoX-5b-I2V

Update 3

Feature: Added support for CogVideoX-Fun.
Benefit: More flexibility in video generation, including experimental image-to-video capabilities.
Link: CogVideoX-Fun

Update 2

Feature: Experimental support for onediff, reducing sampling time by ~40%.
Benefit: Faster video generation on compatible systems.
Requirements: Linux, torch 2.4.0, onediff, and nexfort.

Initial Update

Feature: Support for the 5b model for basic text-to-video tasks.
Benefit: High-quality video generation from text prompts.
Link: CogVideoX-5b

Troubleshooting ComfyUI CogVideoX Wrapper

Common Issues and Solutions

Issue: Video generation is slow.

Solution: Ensure your system meets the recommended hardware requirements. Consider using the onediff feature for faster sampling.

Issue: Generated videos are not as expected.

Solution: Experiment with different prompts, guidance scales, and seeds. Fine-tuning these parameters can significantly impact the output.

Issue: High VRAM usage.

Solution: Offload some processes to the CPU if possible, and ensure other applications are not consuming excessive VRAM.

Frequently Asked Questions

Q: Can I use this extension on Windows?
A: Yes, the extension supports both Windows and Linux.
Q: What resolutions are supported?
A: The models support resolutions from 256x256 to 1024x1024.
Q: How long can the generated videos be?
A: The videos can be up to 6 seconds long with 8 frames per second.

Learn More about ComfyUI CogVideoX Wrapper

Additional Resources

Tutorials: Explore step-by-step guides on using ComfyUI-CogVideoXWrapper for various tasks.
Documentation: Detailed documentation on all features and settings.
Community Forums: Join discussions with other AI artists and developers to share tips and get support. For more information, visit the ComfyUI-CogVideoXWrapper GitHub repository and the CogVideoX-Fun GitHub repository.

ComfyUI CogVideoX Wrapper Related Nodes

CogVideo Decode

CogVideo DualTextEncode

CogVideo ImageEncode

CogVideo PABConfig

CogVideo Sampler

CogVideo TextEncode

CogVideo TextEncode Combine

CogVideo TransformerEdit

CogVideoXFun Sampler

CogVideoXFun Vid2Vid Sampler

(Down)load CogVideo GGUF Model

(Down)load CogVideo Model

Table of Content

Description
How ComfyUI CogVideoX Wrapper Works
ComfyUI CogVideoX Wrapper Features
ComfyUI CogVideoX Wrapper Models
What's New with ComfyUI CogVideoX Wrapper
Troubleshooting ComfyUI CogVideoX Wrapper
Learn More about ComfyUI CogVideoX Wrapper
Related Nodes

Wan 2.1 Video Restyle | Consistent Video Style Transform

Transform your video style by applying the restyled first frame using Wan 2.1 video restyle workflow.

FLUX IPAdapter V2 | XLabs

Explore XLabs FLUX IPAdapter V2 model compared to V1 for your creative goals.

ComfyUI Vid2Vid Dance Transfer

Transfers the motion and style from a source video onto a target image or object.

LatentSync| Lip Sync Model

Advanced audio-driven lip sync technology.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.