OR-Space: A Full-Lifecycle Workspace Benchmark for Industrial Optimization Agents Paper • 2605.28158 • Published 5 days ago • 4
Same Architecture, Different Capacity: Optimizer-Induced Spectral Scaling Laws Paper • 2605.21803 • Published 12 days ago • 4
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL Paper • 2605.18703 • Published 14 days ago • 48
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 29 days ago • 166
DiagramBank: A Large-scale Dataset of Diagram Design Exemplars with Paper Metadata for Retrieval-Augmented Generation Paper • 2604.20857 • Published Feb 28 • 3
Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents Paper • 2604.04979 • Published Apr 4 • 10
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 630