F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking Paper • 2605.12995 • Published 7 days ago • 2
DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation Paper • 2604.20841 • Published 28 days ago • 24
MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval Paper • 2604.18584 • Published 30 days ago • 15
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
AgentGL: Towards Agentic Graph Learning with LLMs via Reinforcement Learning Paper • 2604.05846 • Published Apr 7 • 10
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 628