Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
35
73
109
Li Dong
unilm
Follow
CZWin32768's profile picture
Alya-cc's profile picture
sehun's profile picture
52 followers
Β·
22 following
AI & ML interests
Language Model Pre-Training
Recent Activity
new
activity
about 13 hours ago
microsoft/VibeVoice-ASR:
Can this model be run on a Turing GPU (No Flash Attention support)?
liked
a model
1 day ago
microsoft/VibeVoice-ASR
authored
a paper
2 days ago
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
View all activity
Organizations
Articles
1
Article
22
Differential Transformer V2
Papers
81
arxiv:
2601.08808
arxiv:
2511.10643
arxiv:
2510.26658
arxiv:
2510.24514
Expand 81 papers
spaces
1
Runtime error
4
Promptist
π
models
0
None public yet
datasets
0
None public yet