Jialiang Cheng
Julius-L
ยท
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models
authored
a paper
1 day ago
EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models
liked
a dataset
6 months ago
Salesforce/wikitext