SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations Paper • 2512.05905 • Published 3 days ago • 12
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image Paper • 2512.05044 • Published 4 days ago • 12
SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling Paper • 2512.05343 • Published 4 days ago • 7
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards Paper • 2512.00473 • Published 9 days ago • 10
PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling Paper • 2512.04784 • Published 6 days ago • 19
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper • 2512.05150 • Published 5 days ago • 43
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published 3 days ago • 30
World Models That Know When They Don't Know: Controllable Video Generation with Calibrated Uncertainty Paper • 2512.05927 • Published 3 days ago • 8
COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence Paper • 2512.04563 • Published 4 days ago • 11
AI & Human Co-Improvement for Safer Co-Superintelligence Paper • 2512.05356 • Published 4 days ago • 4
Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning Paper • 2512.05591 • Published 3 days ago • 13
FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring Paper • 2512.04390 • Published 5 days ago • 6
UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers Paper • 2512.04504 • Published 5 days ago • 15
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Paper • 2512.05111 • Published 4 days ago • 44
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published 5 days ago • 137
Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding Paper • 2512.04000 • Published 5 days ago • 2
PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation Paper • 2512.04025 • Published 5 days ago • 2
Light-X: Generative 4D Video Rendering with Camera and Illumination Control Paper • 2512.05115 • Published 4 days ago • 10
Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment Paper • 2511.22345 • Published 11 days ago • 12