Examples - InfluxOW/Stable-Diffusion-Text-To-Person GitHub Wiki

Before we go any further, it makes sense to demonstrate what results can generally be achieved.


Danila Poperechnij | More

Karina Istomina | More

Lana Del Rey | More

Skryptonite | More


It's worth noting separately that these are the results generated by the previous generation of text-to-image neural networks based on Stable Diffusion 1.5 (SD1.5). The current generation of neural networks based on Stable Diffusion XL (SDXL) demonstrates a completely different level of quality. However, these neural networks are much more hardware-demanding, and the algorithms of their training and image generation are far from optimal at the moment. Therefore, right now I can recommend delving into them only to enthusiasts. Nevertheless, it won't hurt to demonstrate what they are capable of.

Karina Istomina | More

These are the results obtained using the base SDXL checkpoint and the LoRA model trained on it. Meanwhile the base SD1.5 checkpoint produces something completely inappropriate.

It's frightening to imagine what will come next, considering that less than a year has passed between SD1.5 and SDXL. However, training models on SDXL is fundamentally no different from training models on SD1.5, so everything we will discuss further is applicable to both.


Next - Dataset Preparation

⚠️ **GitHub.com Fallback** ⚠️