Coding-Models YOYO-AI/Qwen3-30B-A3B-CoderThinking-YOYO-linear Text Generation • 31B • Updated Aug 6, 2025 • 27 • 8 ServiceNow-AI/Apriel-1.6-15b-Thinker Image-Text-to-Text • Updated Dec 22, 2025 • 414 • 300
models robowaifudev/megatron-gpt2-345m Text Generation • 0.4B • Updated Apr 8, 2023 • 3.51k • 9 openai-community/gpt2-medium Text Generation • 0.4B • Updated Feb 19, 2024 • 827k • 200 distilbert/distilbert-base-uncased Fill-Mask • 67M • Updated May 6, 2024 • 13.6M • • 872
Datasets ytzi/the-stack-dedup-python-filtered-gpt2 Viewer • Updated Mar 29, 2024 • 12.7M • 22 • 1 allenai/c4 Viewer • Updated Jan 9, 2024 • 10.4B • 760k • 562 P1ayer-1/books-3-textbooks Viewer • Updated Jul 29, 2023 • 5.44k • 223 • 13 databricks/databricks-dolly-15k Viewer • Updated Jun 30, 2023 • 15k • 30.3k • 955
Coding-Models YOYO-AI/Qwen3-30B-A3B-CoderThinking-YOYO-linear Text Generation • 31B • Updated Aug 6, 2025 • 27 • 8 ServiceNow-AI/Apriel-1.6-15b-Thinker Image-Text-to-Text • Updated Dec 22, 2025 • 414 • 300
Datasets ytzi/the-stack-dedup-python-filtered-gpt2 Viewer • Updated Mar 29, 2024 • 12.7M • 22 • 1 allenai/c4 Viewer • Updated Jan 9, 2024 • 10.4B • 760k • 562 P1ayer-1/books-3-textbooks Viewer • Updated Jul 29, 2023 • 5.44k • 223 • 13 databricks/databricks-dolly-15k Viewer • Updated Jun 30, 2023 • 15k • 30.3k • 955
models robowaifudev/megatron-gpt2-345m Text Generation • 0.4B • Updated Apr 8, 2023 • 3.51k • 9 openai-community/gpt2-medium Text Generation • 0.4B • Updated Feb 19, 2024 • 827k • 200 distilbert/distilbert-base-uncased Fill-Mask • 67M • Updated May 6, 2024 • 13.6M • • 872