Datasets
updated
shayekh/perplexity__aya_dataset__train
Viewer
• Updated • 540k • 28
• 1
argilla/magpie-ultra-v0.1
Viewer
• Updated • 50k • 722
• 221
Magpie-Align/Magpie-Qwen2-Pro-1M-v0.1
Viewer
• Updated • 1M • 129
• 14
HuggingFaceTB/smollm-corpus
Viewer
• Updated • 237M • 36.9k
• 445
Viewer
• Updated • 100k • 7.34k
• 266
BanglaLLM/bangla-alpaca-orca
Viewer
• Updated • 172k • 50
• 4
AhmadMustafa/Urdu-Instruct-News-Article-Generation
Viewer
• Updated • 112k • 25
• 4
AhmadMustafa/Urdu-Instruct-News-Headline-Generation
Viewer
• Updated • 112k • 14
AhmadMustafa/Urdu-Instruct-News-Category-Classification
Viewer
• Updated • 112k • 20
Viewer
• Updated • 10k • 280
• 54
akbargherbal/six_millions_instruction_dataset_for_arabic_llm_ft
Viewer
• Updated • 6.37M • 106
• 2
CohereLabs/aya_collection_language_split
Viewer
• Updated • 514M • 8.38k
• 114
Viewer
• Updated • 63k • 162
• 35
Viewer
• Updated • 21.9M • 2.18k
• 701
convaiinnovations/Nadi_Indic466k_Instruct
Viewer
• Updated • 466k • 6
• 2
ai4bharat/indic-instruct-data-v0.1
Viewer
• Updated • 404k • 316
• 25
Viewer
• Updated • 9.97k • 25
• 2
MarkrAI/KoCommercial-Dataset
Viewer
• Updated • 175k • 482
• 165