Uniform Discrete Diffusion with Metric Path for Video Generation Paper • 2510.24717 • Published Oct 28, 2025 • 42
Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation Paper • 2510.08994 • Published Oct 10, 2025 • 4 • 2
MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation Paper • 2509.26391 • Published Sep 30, 2025 • 22
InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis Paper • 2509.10441 • Published Sep 12, 2025 • 31
Transition Models: Rethinking the Generative Learning Objective Paper • 2509.04394 • Published Sep 4, 2025 • 29
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding Paper • 2507.07984 • Published Jul 10, 2025 • 43
StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling Paper • 2507.05240 • Published Jul 7, 2025 • 48
OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion Paper • 2507.06165 • Published Jul 8, 2025 • 60
OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion Paper • 2507.06165 • Published Jul 8, 2025 • 60 • 1
GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning Paper • 2506.16141 • Published Jun 19, 2025 • 27
DreamCube: 3D Panorama Generation via Multi-plane Synchronization Paper • 2506.17206 • Published Jun 20, 2025 • 23
AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation Paper • 2506.03126 • Published Jun 3, 2025 • 22
GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning Paper • 2505.17022 • Published May 22, 2025 • 27