| char_gpt.ipynb |
Creating GPT-2 from scratch and conducting autoregressive learning for python script (predict the next char based on context) |
| gpt2.ipynb |
Usage of open-souce GPT-2 model |
| lora_tutorial.ipynb |
Implementation of LoRA (simple version) and usage of LoRA in peft |
| gpt2_lora.ipynb |
SFT (supervised fine-tuning) of GPT-2 by using LoRA (Note: This script is NOT optimal) |
| gpt2_lora_optimum.ipynb |
More elegant SFT on GPT-2 by using LoRA |
| gpt2_reward_modeling.ipynb |
Reward modeling on GPT-2 by using LoRA |