Achieve better control with FLUX-ControlNet-Depth & FLUX-ControlNet-Canny for FLUX.1 [dev].

Janus-Pro | T2I + I2T Model

Janus-Pro: Advanced Text-to-Image and Image-to-Text generation.

Hunyuan Video | Image-Prompt to Video

Convert an image and a text prompt into a dynamic video.

Era3D | ComfyUI 3D Pack

Generate 3D content, from multi-view images to detailed meshes.

ComfyUI > Nodes > ComfyUI-DSD > DSD Image Generator

ComfyUI Node: DSD Image Generator

Class Name

DSDImageGenerator

Category
DSD

Author
irreveloper (Account age: 4039days) Extension
ComfyUI-DSD Latest Updated
2025-03-15 Github Stars
0.04K

Github Ask irreveloper Current Questions Past Questions

Table of Content

Description
DSDImageGenerator:
DSDImageGenerator Input Parameters:
DSDImageGenerator Output Parameters:
DSDImageGenerator Usage Tips:
DSDImageGenerator Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-DSD

Install this extension via the ComfyUI Manager by searching for ComfyUI-DSD

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-DSD in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

DSD Image Generator Description

Generate high-quality visuals from textual prompts using DSD model for AI artists.

DSD Image Generator:

The DSDImageGenerator is a powerful node designed to create images using the DSD model, a sophisticated tool for generating high-quality visuals based on textual prompts. This node is particularly beneficial for AI artists looking to transform their creative ideas into visual art by leveraging advanced deep learning techniques. The DSDImageGenerator allows you to input a variety of parameters, including textual prompts and images, to guide the generation process, ensuring that the output aligns closely with your artistic vision. By adjusting parameters such as guidance scales and inference steps, you can fine-tune the balance between creativity and adherence to the input prompts, resulting in unique and personalized artwork. The node's ability to handle both positive and negative prompts, along with its support for image resizing and guidance, makes it a versatile tool for a wide range of artistic applications.

DSD Image Generator Input Parameters:

dsd_model

The dsd_model parameter specifies the DSD model to be used for image generation. This model is the core engine that interprets the prompts and generates the corresponding images. Selecting the appropriate model is crucial as it directly influences the style and quality of the output.

image

The image parameter allows you to input an initial image that the DSD model will use as a reference or starting point for the generation process. This can help in maintaining certain visual elements or styles from the input image in the generated output.

prompt

The prompt parameter is a multiline string that serves as the primary textual input guiding the image generation. It describes the desired content or theme of the image, and the model uses this information to create visuals that align with the provided description.

negative_prompt

The negative_prompt parameter is an optional multiline string that specifies elements or themes to avoid in the generated image. By providing a negative prompt, you can steer the model away from certain features, ensuring the output better matches your artistic intent. The default value is an empty string.

seed

The seed parameter is an integer that initializes the random number generator, ensuring reproducibility of the generated images. By using the same seed, you can produce identical outputs across different runs. The default value is 0, with a range from 0 to 2147483647.

guidance_scale

The guidance_scale parameter is a float that controls the strength of adherence to the prompt. A higher value results in images that more closely follow the prompt, while a lower value allows for more creative freedom. The default is 3.5, with a range from 0.0 to 20.0.

image_guidance_scale

The image_guidance_scale parameter is a float that determines how much the initial image influences the final output. A higher value means the generated image will retain more features from the input image. The default is 1.0, with a range from 0.0 to 20.0.

text_guidance_scale

The text_guidance_scale parameter is a float that adjusts the influence of the textual prompt on the image generation. Similar to guidance_scale, it balances between following the prompt and allowing creative variations. The default is 1.0, with a range from 0.0 to 20.0.

num_inference_steps

The num_inference_steps parameter is an integer that sets the number of steps the model takes to generate the image. More steps can lead to higher quality images but require more computation time. The default is 28, with a range from 1 to 100.

width

The width parameter is an integer that defines the width of the generated image in pixels. It allows you to specify the desired resolution, with a default of 1024 pixels and a range from 512 to 2048 pixels, adjustable in steps of 64.

height

The height parameter is an integer that sets the height of the generated image in pixels. Like the width, it determines the resolution of the output, with a default of 512 pixels and a range from 512 to 2048 pixels, adjustable in steps of 64.

use_gemini_prompt

The use_gemini_prompt parameter is a boolean that indicates whether to use the Gemini prompt enhancement feature. When enabled, it can enhance the prompt to potentially improve the quality or creativity of the generated image. The default value is False.

resize_params

The resize_params parameter is optional and allows you to specify how the input image should be resized before processing. This can include methods like cropping or padding, and it helps ensure the input image is compatible with the model's requirements.

DSD Image Generator Output Parameters:

image

The image output is the final generated image that results from the DSD model's processing of the input parameters. This image reflects the influence of the prompt, initial image, and various guidance scales, providing a visual representation of your artistic vision.

reference_image

The reference_image output is an optional image that serves as a reference or comparison point. It may include elements from the initial input image or intermediate stages of the generation process, offering insights into how the final image was derived.

seed

The seed output is the integer seed value used during the image generation process. It allows you to reproduce the same image by reusing this seed in future runs, ensuring consistency and repeatability in your creative workflow.

DSD Image Generator Usage Tips:

Experiment with different guidance_scale values to find the right balance between creativity and adherence to the prompt.
Use the seed parameter to reproduce specific images or explore variations by changing the seed value.
Adjust the image_guidance_scale to control how much the initial image influences the final output, especially when working with complex compositions.

DSD Image Generator Common Errors and Solutions:

"Invalid model selected"

Explanation: This error occurs when the specified dsd_model is not recognized or available.
Solution: Ensure that the correct model name is provided and that it is properly loaded in the system.

"Image dimensions out of range"

Explanation: This error indicates that the specified width or height is outside the allowed range.
Solution: Adjust the width and height parameters to fall within the specified limits of 512 to 2048 pixels.

"Negative prompt not applied"

Explanation: This warning suggests that the negative_prompt was not effectively used in the generation process.
Solution: Review the negative_prompt content to ensure it is clear and relevant to the desired output.

DSD Image Generator Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-DSD

Table of Content

Description
DSDImageGenerator:
DSDImageGenerator Input Parameters:
DSDImageGenerator Output Parameters:
DSDImageGenerator Usage Tips:
DSDImageGenerator Common Errors and Solutions:
Related Nodes

FLUX LoRA (RealismLoRA) | Photorealistic Images

Blend FLUX-1 model with FLUX-RealismLoRA for photorealistic AI images

LTX Video | Image+Text to Video

Generates videos from image+text prompts.

Self Forcing | Autoregressive Keyframe-to-Video Generation

SUPER FAST! 5-second video in 45 seconds!

Flux Fill | Inpaint and Outpaint

Official Flux Tools - Flux Fill for Inpainting and Outpainting

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.