PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published 8 days ago • 41
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 2 days ago • 43
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 3 days ago • 66