GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 9 days ago • 347
Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models? Paper • 2603.22582 • Published 19 days ago • 7