MMSkills: Towards Multimodal Skills for General Visual Agents Paper • 2605.13527 • Published 8 days ago • 116
Running on Zero Agents Featured 2.49k Qwen Image Multiple Angles 3D Camera 🎥 2.49k Edit image camera angle with interactive 3D controls
FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization Paper • 2605.15824 • Published 7 days ago • 57
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 14 days ago • 97
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 18 days ago • 333
GUI-G^2: Gaussian Reward Modeling for GUI Grounding Paper • 2507.15846 • Published Jul 21, 2025 • 135