Unified Speech-Text Pre-training for Speech Translation and Recognition Paper • 2204.05409 • Published Apr 11, 2022
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language Paper • 2202.03555 • Published Feb 7, 2022
StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis Paper • 2110.08985 • Published Oct 18, 2021
Cross-lingual Retrieval for Iterative Self-Supervised Training Paper • 2006.09526 • Published Jun 16, 2020
Multilingual Denoising Pre-training for Neural Machine Translation Paper • 2001.08210 • Published Jan 22, 2020
Multilingual Translation with Extensible Multilingual Pretraining and Finetuning Paper • 2008.00401 • Published Aug 2, 2020
fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit Paper • 2109.06912 • Published Sep 14, 2021
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation Paper • 2204.02967 • Published Apr 6, 2022
Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Paper • 2405.21048 • Published May 31, 2024 • 16
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Paper • 2410.08159 • Published Oct 10, 2024 • 26