smol-training-playbook Running on CPU Upgrade Featured 3.12k The Smol Training Playbook ๐ 3.12k The secrets to building world-class LLMs LLM-in-Sandbox Elicits General Agentic Intelligence Paper โข 2601.16206 โข Published Jan 22 โข 86 EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper โข 2601.15876 โข Published Jan 22 โข 92 BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper โข 2510.08697 โข Published Oct 9, 2025 โข 39
Running on CPU Upgrade Featured 3.12k The Smol Training Playbook ๐ 3.12k The secrets to building world-class LLMs
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper โข 2601.15876 โข Published Jan 22 โข 92
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper โข 2510.08697 โข Published Oct 9, 2025 โข 39
smol-training-playbook Running on CPU Upgrade Featured 3.12k The Smol Training Playbook ๐ 3.12k The secrets to building world-class LLMs LLM-in-Sandbox Elicits General Agentic Intelligence Paper โข 2601.16206 โข Published Jan 22 โข 86 EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper โข 2601.15876 โข Published Jan 22 โข 92 BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper โข 2510.08697 โข Published Oct 9, 2025 โข 39
Running on CPU Upgrade Featured 3.12k The Smol Training Playbook ๐ 3.12k The secrets to building world-class LLMs
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper โข 2601.15876 โข Published Jan 22 โข 92
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper โข 2510.08697 โข Published Oct 9, 2025 โข 39