Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
35
73
109
Li Dong
unilm
Follow
SinclairWang's profile picture
Talha1920's profile picture
liyucheng's profile picture
53 followers
ยท
22 following
AI & ML interests
Language Model Pre-Training
Recent Activity
new
activity
about 16 hours ago
microsoft/VibeVoice-ASR:
Can this model be run on a Turing GPU (No Flash Attention support)?
liked
a model
1 day ago
microsoft/VibeVoice-ASR
authored
a paper
3 days ago
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
View all activity
Organizations
Articles
1
Article
22
Differential Transformer V2
Papers
81
arxiv:
2601.08808
arxiv:
2511.10643
arxiv:
2510.26658
arxiv:
2510.24514
Expand 81 papers
spaces
1
Runtime error
4
Promptist
๐
models
0
None public yet
datasets
0
None public yet