ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Paper • 2512.05111 • Published 5 days ago • 45
ExpertQA: Expert-Curated Questions and Attributed Answers Paper • 2309.07852 • Published Sep 14, 2023 • 2
DebugBench: Evaluating Debugging Capability of Large Language Models Paper • 2401.04621 • Published Jan 9, 2024 • 2
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 872
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models Paper • 2404.07839 • Published Apr 11, 2024 • 47