The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation Paper • 2503.04606 • Published Mar 6 • 9
RMT: Retentive Networks Meet Vision Transformers Paper • 2309.11523 • Published Sep 20, 2023 • 33