Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
34
73
109
Li Dong
unilm
Follow
wxy1988's profile picture
Fatimmmma's profile picture
AndroidGuy's profile picture
52 followers
ยท
21 following
AI & ML interests
Language Model Pre-Training
Recent Activity
liked
a model
about 14 hours ago
microsoft/VibeVoice-ASR
authored
a paper
2 days ago
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
authored
a paper
2 days ago
Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts
View all activity
Organizations
Articles
1
Article
19
Differential Transformer V2
Papers
81
arxiv:
2601.08808
arxiv:
2511.10643
arxiv:
2510.26658
arxiv:
2510.24514
Expand 81 papers
spaces
1
Runtime error
4
Promptist
๐
models
0
None public yet
datasets
0
None public yet