bigatuna/Qwen3.5-9b-Sushi-Coder-RL-GGUF Text Generation • 9B • Updated 10 days ago • 13.8k • 41
unsloth/Nemotron-3-Nano-30B-A3B-GGUF Text Generation • 32B • Updated Dec 31, 2025 • 220k • 289
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated 27 days ago • 1.38M • 707
Nanbeige/Nanbeige4-3B-Thinking-2511 Text Generation • 4B • Updated Dec 17, 2025 • 1.88k • 204
view post Post 4438 At the close of the National Holiday🇨🇳, Antgroup drops a new SoTA model.Ling-1T 🔥 the trillion-parameter flagship of the Ling 2.0 series. inclusionAI/Ling-1T✨1T total / 50B active params per token ✨20T+ reasoning-dense tokens (Evo-CoT)✨128K context via YaRN ✨FP8 training: 15%+ faster, same precision as BF16 ✨Hybrid Syntax-Function-Aesthetics reward for front-end & visual generation See translation 1 reply · 🔥 8 8 + Reply