Jiatao Gu's picture

Jiatao Gu

thomagram

·

AI & ML interests

NLP, Generative Models, Efficient Models, Deep Learning

Recent Activity

updated a model about 2 months ago

upvoted a paper 2 months ago

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

liked a model 2 months ago

Multiplex-Thinking/Multiplex-Thinking-1.5B

View all activity

Organizations

authored 20 papers 3 months ago

Unified Speech-Text Pre-training for Speech Translation and Recognition

Paper • 2204.05409 • Published Apr 11, 2022

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

Paper • 2202.03555 • Published Feb 7, 2022

Textless Speech-to-Speech Translation on Real Data

Paper • 2112.08352 • Published Dec 15, 2021

StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis

Paper • 2110.08985 • Published Oct 18, 2021

Direct speech-to-speech translation with discrete units

Paper • 2107.05604 • Published Jul 12, 2021

Diffusion Models Without Attention

Paper • 2311.18257 • Published Nov 30, 2023 • 3

Neural Machine Translation with Byte-Level Subwords

Paper • 1909.03341 • Published Sep 7, 2019 • 1

Non-Autoregressive Neural Machine Translation

Paper • 1711.02281 • Published Nov 7, 2017 • 1

Depth-Adaptive Transformer

Paper • 1910.10073 • Published Oct 22, 2019 • 1

Cross-lingual Retrieval for Iterative Self-Supervised Training

Paper • 2006.09526 • Published Jun 16, 2020

Generative Modeling with Phase Stochastic Bridges

Paper • 2310.07805 • Published Oct 11, 2023

Multilingual Denoising Pre-training for Neural Machine Translation

Paper • 2001.08210 • Published Jan 22, 2020

Matryoshka Diffusion Models

Paper • 2310.15111 • Published Oct 23, 2023 • 45

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning

Paper • 2008.00401 • Published Aug 2, 2020 • 1

Volume Rendering of Neural Implicit Surfaces

Paper • 2106.12052 • Published Jun 22, 2021

fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit

Paper • 2109.06912 • Published Sep 14, 2021

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation

Paper • 2204.02967 • Published Apr 6, 2022

Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling

Paper • 2405.21048 • Published May 31, 2024 • 16

CLEAR: Contrastive Learning for Sentence Representation

Paper • 2012.15466 • Published Dec 31, 2020

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Paper • 2410.08159 • Published Oct 10, 2024 • 26