Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2403.03206

ComparePerspective

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71

Models - Image - Rectified Flow Transformers

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 791k • • 4.45k
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71

Diffusion Model

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Paper • 2405.08748 • Published May 14, 2024 • 24
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection

Paper • 2405.10300 • Published May 16, 2024 • 30
Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 132
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 41

Vision and language

No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

Paper • 2404.04125 • Published Apr 4, 2024 • 29
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Paper • 2404.03653 • Published Apr 4, 2024 • 36
Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

Paper • 2404.02747 • Published Apr 3, 2024 • 13
3D Congealing: 3D-Aware Image Alignment in the Wild

Paper • 2404.02125 • Published Apr 2, 2024 • 10

Papers - Image - Rectified Flow Transformers

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71

stable diffusion

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71

computer vision papers 👓

Rich feature hierarchies for accurate object detection and semantic segmentation

Paper • 1311.2524 • Published Nov 11, 2013 • 1
DeepPose: Human Pose Estimation via Deep Neural Networks

Paper • 1312.4659 • Published Dec 17, 2013 • 1
Generative Adversarial Networks

Paper • 1406.2661 • Published Jun 10, 2014 • 5
scikit-image: Image processing in Python

Paper • 1407.6245 • Published Jul 23, 2014 • 1

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71
stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 10.4k • • 4.88k

Scaling Diffusion Models

Resources (papers) mostly related to studying the scaling aspects of diffusion models. The resources appear in non-chronological order.

Bigger is not Always Better: Scaling Properties of Latent Diffusion Models

Paper • 2404.01367 • Published Apr 1, 2024 • 22
On the Scalability of Diffusion-based Text-to-Image Generation

Paper • 2404.02883 • Published Apr 3, 2024 • 19
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71
Improved Denoising Diffusion Probabilistic Models

Paper • 2102.09672 • Published Feb 18, 2021 • 2

ComparePerspective

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71

Papers - Image - Rectified Flow Transformers

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71

Models - Image - Rectified Flow Transformers

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 791k • • 4.45k
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71

stable diffusion

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71

Diffusion Model

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71

computer vision papers 👓

Rich feature hierarchies for accurate object detection and semantic segmentation

Paper • 1311.2524 • Published Nov 11, 2013 • 1
DeepPose: Human Pose Estimation via Deep Neural Networks

Paper • 1312.4659 • Published Dec 17, 2013 • 1
Generative Adversarial Networks

Paper • 1406.2661 • Published Jun 10, 2014 • 5
scikit-image: Image processing in Python

Paper • 1407.6245 • Published Jul 23, 2014 • 1

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Paper • 2405.08748 • Published May 14, 2024 • 24
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection

Paper • 2405.10300 • Published May 16, 2024 • 30
Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 132
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 41

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71
stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 10.4k • • 4.88k

Vision and language

No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

Paper • 2404.04125 • Published Apr 4, 2024 • 29
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Paper • 2404.03653 • Published Apr 4, 2024 • 36
Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

Paper • 2404.02747 • Published Apr 3, 2024 • 13
3D Congealing: 3D-Aware Image Alignment in the Wild

Paper • 2404.02125 • Published Apr 2, 2024 • 10

Scaling Diffusion Models

Resources (papers) mostly related to studying the scaling aspects of diffusion models. The resources appear in non-chronological order.

Bigger is not Always Better: Scaling Properties of Latent Diffusion Models

Paper • 2404.01367 • Published Apr 1, 2024 • 22
On the Scalability of Diffusion-based Text-to-Image Generation

Paper • 2404.02883 • Published Apr 3, 2024 • 19
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 71
Improved Denoising Diffusion Probabilistic Models

Paper • 2102.09672 • Published Feb 18, 2021 • 2

Previous
1
2
3
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs