useful sharded checkpoints for users to run inference / fine-tuning on a Google colab without having to deal with CPU OOM issues.
Younes B
ybelkada
AI & ML interests
Large Language Models, Quantization, Vision, Multimodality, Diffusion models
Recent Activity
authored
a paper
about 24 hours ago
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
upvoted
a
paper
1 day ago
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers