about AI-Master-Book
AI Master Book
LLM MASTER BOOK
LLMs

Powered by GitBook

On this page

Contents
References:

LLMs

Hugging Face

Huggingface Fine-tuning

Contents

Transformer Fine-tuning
PEFT: Fine-tuning Basic
PEFT: Fine-tuning with QLoRA
PEFT: Fine-tuning Phi-2 with QLoRA
Axoltl Fine-tning with QLoRA
TRL: RLHF Alignment Fine-tuning
TRL: DPO Fine-tuning with Phi-3-4k-instruct
TRL: ORPO Fine-tuning with Llama3-8B
Convert GGUF gemma-2b with llama.cpp
Apple Silicon Fine-tuning Gemma-2B with MLX
LLM Mergekit

References:

Huggingface 공식 문서: https://huggingface.co/docs
Toward Data Science: https://towardsdatascience.com
DeepLearning.ai 튜토리얼: https://www.deeplearning.ai/short-courses/
mlabonee Blog: https://mlabonne.github.io/blog/
Mergekit github: https://github.com/arcee-ai/mergekit
Alex Ishida github: https://github.com/alexweberk
Llama.cpp github: https://github.com/ggerganov/llama.cpp
Axotle github: https://github.com/OpenAccess-AI-Collective/axolotl

PreviousOptimum-Intel NextTransformer Fine-tuning

Last updated 4 months ago