AI/EXPLORER
ToolsCategoriesSitesLLMsCompareAI QuizAlternativesPremium
—AI Tools
—Sites & Blogs
—LLMs & Models
—Categories
AI Explorer

Find and compare the best artificial intelligence tools for your projects.

Made within France

Explore

  • ›All tools
  • ›Sites & Blogs
  • ›LLMs & Models
  • ›Compare
  • ›Chatbots
  • ›AI Images
  • ›Code & Dev

Company

  • ›Premium
  • ›About
  • ›Contact
  • ›Blog

Legal

  • ›Legal notice
  • ›Privacy
  • ›Terms

© 2026 AI Explorer·All rights reserved.

HomeLLMsPhotoMaker V2

PhotoMaker V2

by TencentARC

Open source · 5k downloads · 154 likes

2.7
(154 reviews)ImageAPI & Local
About

PhotoMaker V2 is an AI model specialized in customizing images from one or more facial photos and a text description. It quickly generates realistic or stylized portraits that can be adapted to various artistic or photographic styles without requiring additional training. The model stands out for its ability to easily integrate with SDXL-based architectures or work with other LoRA modules, offering great flexibility in use. Its use cases include creating personalized content, editing images, or generating artwork, though its performance may vary depending on certain types of faces or anatomical details. PhotoMaker V2 positions itself as a powerful tool for creators looking to transform portraits with precision and creativity.

Documentation

PhotoMaker V2 Model Card

Project Page | Paper (ArXiv) | Code

🤗 Gradio demo

Introduction

Users can input one or a few face photos, along with a text prompt, to receive a customized photo or painting within seconds (no training required!). Additionally, this model can be adapted to any base model based on SDXL or used in conjunction with other LoRA modules.

Realistic results

image/jpeg

image/jpeg

Stylization results

image/jpeg

image/jpeg

More results can be found in our project page

Model Details

It mainly contains two parts corresponding to two keys in loaded state dict:

  1. id_encoder includes finetuned OpenCLIP-ViT-H-14 and a few fuse layers.

  2. lora_weights applies to all attention layers in the UNet, and the rank is set to 64.

Usage

You can directly download the model in this repository. You also can download the model in python script:

Python
from huggingface_hub import hf_hub_download
photomaker_ckpt = hf_hub_download(repo_id="TencentARC/PhotoMaker-V2", filename="photomaker-v2.bin", repo_type="model")

Then, please follow the instructions in our GitHub repository.

Limitations

  • The model's customization performance degrades on Asian male faces.
  • The model still struggles with accurately rendering human hands.

Bias

While the capabilities of image generation models are impressive, they can also reinforce or exacerbate social biases.

Citation

BibTeX:

Bibtex
@inproceedings{li2023photomaker,
  title={PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding},
  author={Li, Zhen and Cao, Mingdeng and Wang, Xintao and Qi, Zhongang and Cheng, Ming-Ming and Shan, Ying},
  booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2024}
}
Capabilities & Tags
diffuserstext-to-imageen
Links & Resources
Specifications
CategoryImage
AccessAPI & Local
LicenseOpen Source
PricingOpen Source
Rating
2.7

Try PhotoMaker V2

Access the model directly