LoRA - AshokBhat/ml GitHub Wiki

About

LoRA (Low-Rank Adaptation)
A technique in natural language processing that reduces the number of trainable parameters and GPU memory requirements
By introducing trainable rank decomposition matrices into each layer of the Transformer architecture
Allowing for efficient adaptation of pre-trained language models to specific tasks or domains.

See also