Running 59 Stick To Your Role! Leaderboard 🎠59 Benchmarking LLMs on the stability of simulated populations
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • Updated May 22, 2025 • 200k • • 1.22k
Running 593 Scaling test-time compute 📈 593 Boost LLM answers with search‑guided test‑time compute
Llama 3.3 (All Versions) Collection Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 5 days ago • 38
meta-llama/Meta-Llama-3-8B-Instruct Text Generation • 8B • Updated Jun 18, 2025 • 1.37M • • 4.38k