tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.5 Text Generation • 8B • Updated Jun 25, 2025 • 36.6k • • 17
Tiny Language Model Datasets Collection Collection of Synthetic Datasets that can be used in pretraining of any the Tiny Language Model • 14 items • Updated Sep 21, 2025 • 29