Submitted by Wenqi Shi 9 Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs Eigen AI 2