Datasets
updated
shayekh/perplexity__aya_dataset__train
Viewer
• Updated • 540k • 28
• 1
argilla/magpie-ultra-v0.1
Viewer
• Updated • 50k • 694
• 221
Magpie-Align/Magpie-Qwen2-Pro-1M-v0.1
Viewer
• Updated • 1M • 117
• 14
HuggingFaceTB/smollm-corpus
Viewer
• Updated • 237M • 36.7k
• 445
Viewer
• Updated • 100k • 7.35k
• 265
BanglaLLM/bangla-alpaca-orca
Viewer
• Updated • 172k • 47
• 4
AhmadMustafa/Urdu-Instruct-News-Article-Generation
Viewer
• Updated • 112k • 25
• 4
AhmadMustafa/Urdu-Instruct-News-Headline-Generation
Viewer
• Updated • 112k • 11
AhmadMustafa/Urdu-Instruct-News-Category-Classification
Viewer
• Updated • 112k • 28
Viewer
• Updated • 10k • 304
• 54
akbargherbal/six_millions_instruction_dataset_for_arabic_llm_ft
Viewer
• Updated • 6.37M • 93
• 2
CohereLabs/aya_collection_language_split
Viewer
• Updated • 514M • 7.75k
• 114
Viewer
• Updated • 63k • 167
• 35
Viewer
• Updated • 21.9M • 2.13k
• 700
convaiinnovations/Nadi_Indic466k_Instruct
Viewer
• Updated • 466k • 6
• 2
ai4bharat/indic-instruct-data-v0.1
Viewer
• Updated • 404k • 314
• 25
Viewer
• Updated • 9.97k • 27
• 2
MarkrAI/KoCommercial-Dataset
Viewer
• Updated • 175k • 620
• 165