iVGR: Internalizing Visually Grounded Reasoning for MLLMs with Reinforcement Learning Paper • 2605.31096 • Published 10 days ago • 7
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs Paper • 2605.30611 • Published 11 days ago • 192
MRT: Masked Region Transformer for Layered Image Generation and Editing at Scale Paper • 2605.27235 • Published 13 days ago • 8
Multi-view Consistent 3D Gaussian Head Avatars 'without' Multi-view Generation Paper • 2605.25220 • Published 15 days ago • 7
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 19 days ago • 204
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published May 6 • 102
UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards Paper • 2604.14967 • Published Apr 16 • 15
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published Apr 13 • 102
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published Apr 9 • 292