Model Training ‐ Introduction - InfluxOW/Stable-Diffusion-Text-To-Person GitHub Wiki
So, we will train the LoRA
model. The essence of LoRA
comes down to the fact that during image generation it inserts its own additional layers between the neural network layers.
We can generate an image of a random person. It is the base image. If we add a LoRA
to the generation then base image will shift in a new direction, such as updating image style or changing person's appearance to the one that LoRA
has been trained on.
Here is the base image and the same image with Lana Del Rey LoRA
.
An important advantage we will greatly benefit from is that we can train LoRA
on any checkpoint and then apply it to all the checkpoints based on the same neural network. I.e. LoRA
trained on any SD1.5
based checkpoint can be applied to any SD1.5
based checkpoint. Same for SDXL
and others. It may not perform well everywhere, but technically it will work. Looking for the best checkpoint to train on will be a huge part of further comparison.
Next - Model Training ‐ Basics