by OzzyGT
Open source · 4k downloads · 8 likes
The SDXL IP Adapter is an AI model designed to generate images from text descriptions while incorporating precise visual references. It allows users to adapt the style, details, or elements of a source image to influence the final output, providing greater control over the creation of realistic or stylized images. Its key capabilities include generating cinematic portraits, detailed compositions, or complex scenes by combining text and input images. This model stands out for its compatibility with SDXL and its modular approach, particularly with the multi-image composition feature for finer adjustments. It is especially useful for digital artists, designers, or content creators looking to refine their visuals using external visual references.
This is just a clone from the original repository with just the SDXL vit-h models and the corresponding image encoder.
Update: Added IP Adapter for Composition to be able to use multi ip adapters (they need to be in the same repository). Original model: https://huggingface.co/ostris/ip-composition-adapter
Made to be used with Image Artisan XL.
Here's some experiments:
prompt: cinematic portrait photo of a woman, against a white background, 4k, highly detailed
| source | 0% noise | 25% noise | 50% noise | 85% noise | 100% noise |
|---|---|---|---|---|---|
![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
| source | 0% noise | 25% noise | 50% noise | 85% noise | 100% noise |
|---|---|---|---|---|---|
![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
prompt: cinematic portrait photo of a woman, against a white background, half body shot, closeup, 4k, highly detailed
| source | 0% noise | 25% noise | 50% noise | 85% noise | 100% noise |
|---|---|---|---|---|---|
![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
| w1 | w2 | w3 | w4 | w5 | bg |
|---|---|---|---|---|---|
![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
| w1(1.0) + w2(1.0) | w1(0.5) + w2(1.0) | w1 (1.0) + w2(0.5) | w1(0.5) + w2(0.5) | w1(1.0) + w2(1.0) + bg(1.0) | w1(0.5) + w2(0.5) + bg(1.0) | w1(0.5) + w2(0.5) + bg(0.5) |
|---|---|---|---|---|---|---|
![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
| w1+w2+w3 | w1+w2+w3+w4 | w1+w2+w3+w4+w5 | w1+w2+w3+w4+w5+bg |
|---|---|---|---|
![]() | ![]() | ![]() | ![]() |