Huggingface Optimization
Contents
References:
Huggingface 공식 문서: https://huggingface.co/docs
Toward Data Science: https://towardsdatascience.com
ONNX Runtime: https://onnxruntime.ai
NVIDIA TensorRT: https://github.com/huggingface/optimum-nvidia
Intel OpenVINO: https://github.com/huggingface/optimum-intel
DeepLearning.ai 튜토리얼: https://www.deeplearning.ai/short-courses/
Flash Attention Github: https://github.com/Dao-AILab/flash-attention
Bitsandbytes Github: https://github.com/TimDettmers/bitsandbytes
Last updated