An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models Paper • 2408.00724 • Published Aug 1, 2024 • 2
Learn Hard Problems During RL with Reference Guided Fine-tuning Paper • 2603.01223 • Published Mar 1 • 13
BenchEvolver: Frontier Task Synthesis via Solution-Centric Evolution Paper • 2606.01286 • Published 7 days ago • 5
BenchEvolver: Frontier Task Synthesis via Solution-Centric Evolution Paper • 2606.01286 • Published 7 days ago • 5
BenchEvolver: Frontier Task Synthesis via Solution-Centric Evolution Paper • 2606.01286 • Published 7 days ago • 5
Reliable Fine-Grained Evaluation of Natural Language Math Proofs Paper • 2510.13888 • Published Oct 14, 2025 • 2