ComfyUI > Workflows > ComfyUI PhotoMakerV2 | Create Realistic Photos

ComfyUI PhotoMakerV2 | Create Realistic Photos

ComfyUI PhotoMakerV2 is a powerful text-to-image generation tool that enables users to create realistic personalized photos efficiently. By inputting identity images and a text prompt, PhotoMakerV2 preserves the likeness of the individuals while allowing flexible control over context, style, and attributes. This latest version offers improved identity fidelity compared to its predecessor. Discover the creative possibilities of generating photorealistic images in different settings, stylizing appearances, and even merging identities.

ComfyUI PhotoMakerV2 Workflow

Want to run this workflow?

Fully operational workflows
No missing nodes or models
No manual setups required
Features stunning visuals

ComfyUI PhotoMakerV2 Examples

comfyui-photomakerv2-create-realistic-photos-1109

ComfyUI PhotoMakerV2 Description

What is PhotoMakerV2

PhotoMakerV2, an upgrade from PhotoMaker, offers an efficient method for personalized text-to-image generation. It synthesizes realistic photos of individuals using a few input identity images and a text prompt.

Some key features of PhotoMakerV2 include:

High efficiency: Quickly generates personalized photos.
Excellent identity preservation: Maintains the likeness of input identities.
Flexible text control: Allows specifying context, style, attributes, etc., in the prompt.
Improved identity fidelity: Enhanced compared to PhotoMaker V1. PhotoMakerV2 generates photorealistic images of a person in various contexts, stylizes appearances, changes attributes like age and gender, merges identities, and modernizes people from old photos or artwork. It unlocks numerous creative possibilities.

How PhotoMakerV2 Works

PhotoMakerV2 encodes one or more input identity images into a "stacked ID embedding," serving as a unified representation encapsulating identity information.

This embedding, combined with a text prompt, feeds into a text-to-image diffusion model. The model then produces an image depicting the embedded identity in the context described by the prompt.

Some key aspects of how it works under the hood:

Uses an identity encoder to extract identity information from input face images
Improves identity preservation by leveraging an external face recognition model (InsightFace)
Encodes multiple identity images into a stacked embedding to capture identity comprehensively
Feeds the stacked ID embedding into the diffusion model's cross-attention layers
Guides generation with the text prompt while adaptively merging the identity information
Trained with an identity-oriented dataset to improve identification capabilities

How to Use ComfyUI PhotoMakerV2

To use PhotoMakerV2 in ComfyUI, primarily interact with the PhotoMakerEncodePlus node. A typical workflow involves:

Load PhotoMakerV2 model using "PhotoMaker Loader Plus" node.
Load one or more identity images using "Prepare Images For CLIP Vision" node.
Load InsightFace model required by PhotoMakerV2 using "PhotoMaker InsightFace Loader" node.
Connect outputs of these nodes to corresponding inputs of "PhotoMaker Encode Plus" node.
In the "PhotoMaker Encode Plus" node, specify the prompt describing the desired image. Use the special trigger word in the prompt where the identity should appear.
Connect output conditioning from "PhotoMaker Encode Plus" to a "KSampler" node to generate the image.

For more information, please visit and . All credit goes to their contributions.

Want More ComfyUI Workflows?

AnimateDiff + ControlNet TimeStep KeyFrame | Morphing Animation

Set ControlNet Timestep KeyFrames, such as the first and last frames, to create morphing animations.

SVD (Stable Video Diffusion) + SD | Text to Video

Integrate Stable Diffusion and Stable Video Diffusion to convert text directly into video.

InstantID | Portraits to Art

InstantID accurately enhances and transforms portraits with style and aesthetic appeal.

CCSR | Consistent Image/Video Upscaler

The CCSR model enhances image and video upscaling by focusing more on content consistency.

FLUX LoRA (RealismLoRA) | Photorealistic Images

Blend FLUX-1 model with FLUX-RealismLoRA for photorealistic AI images

FLUX Inpainting | Seamless Image Editing

Effortlessly fill, remove, and refine images, seamlessly integrating new content.

LivePortrait | Animate Portraits | Vid2Vid

Transfer facial expressions and movements from a driving video onto a source video

Flux PuLID for Face Swapping

Take your face swapping projects to new heights with Flux PuLID.