Scaling Diffusion Transformers Efficiently via μP
Paper
•
2505.15270
•
Published
•
35
We release pretrained models in the paper Scaling Diffusion Transformers Efficiently via μP, which includes DiT-muP and PixArt-muP.
Code: https://github.com/ML-GSAI/Scaling-Diffusion-Transformers-muP