Collections
Discover the best community collections!
Collections including paper arxiv:2511.10629
-
yandex/stable-diffusion-3.5-medium-alchemist
Text-to-Image • Updated • 18 • 6 -
Ovis-U1 Technical Report
Paper • 2506.23044 • Published • 62 -
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
Paper • 2507.01953 • Published • 19 -
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Paper • 2507.01945 • Published • 78
-
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models
Paper • 2511.10629 • Published • 122 -
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation
Paper • 2511.09057 • Published • 75 -
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Paper • 2511.04570 • Published • 208
-
Arbitrary-steps Image Super-resolution via Diffusion Inversion
Paper • 2412.09013 • Published • 13 -
Deep Researcher with Test-Time Diffusion
Paper • 2507.16075 • Published • 67 -
nablaNABLA: Neighborhood Adaptive Block-Level Attention
Paper • 2507.13546 • Published • 124 -
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 87
-
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
Paper • 2412.20800 • Published • 11 -
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models
Paper • 2501.06751 • Published • 32 -
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps
Paper • 2501.09732 • Published • 71 -
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
Paper • 2501.09755 • Published • 36
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper • 2401.09048 • Published • 10 -
Improving fine-grained understanding in image-text pre-training
Paper • 2401.09865 • Published • 18 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper • 2401.10891 • Published • 62 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper • 2401.13627 • Published • 77
-
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models
Paper • 2511.10629 • Published • 122 -
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation
Paper • 2511.09057 • Published • 75 -
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Paper • 2511.04570 • Published • 208
-
Arbitrary-steps Image Super-resolution via Diffusion Inversion
Paper • 2412.09013 • Published • 13 -
Deep Researcher with Test-Time Diffusion
Paper • 2507.16075 • Published • 67 -
nablaNABLA: Neighborhood Adaptive Block-Level Attention
Paper • 2507.13546 • Published • 124 -
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 87
-
yandex/stable-diffusion-3.5-medium-alchemist
Text-to-Image • Updated • 18 • 6 -
Ovis-U1 Technical Report
Paper • 2506.23044 • Published • 62 -
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
Paper • 2507.01953 • Published • 19 -
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Paper • 2507.01945 • Published • 78
-
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
Paper • 2412.20800 • Published • 11 -
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models
Paper • 2501.06751 • Published • 32 -
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps
Paper • 2501.09732 • Published • 71 -
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
Paper • 2501.09755 • Published • 36
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper • 2401.09048 • Published • 10 -
Improving fine-grained understanding in image-text pre-training
Paper • 2401.09865 • Published • 18 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper • 2401.10891 • Published • 62 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper • 2401.13627 • Published • 77