8 11 20

Yingfa Chen

chen-yingfa

https://chen-yingfa.github.io

AI & ML interests

Long-context modeling, continual learning, architectures

Recent Activity

liked a model 24 days ago

chen-yingfa/HypeNet-5B

updated a model 24 days ago

chen-yingfa/HypeNet-5B

updated a collection 24 days ago

HypeNet

View all activity

Organizations

None yet

liked a model 24 days ago

chen-yingfa/HypeNet-5B

5B • Updated 24 days ago • 60 • 1

updated a model 24 days ago

chen-yingfa/HypeNet-5B

5B • Updated 24 days ago • 60 • 1

updated a collection 24 days ago

HypeNet

Collection

The models for the paper: Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts • 2 items • Updated 24 days ago

published a model 24 days ago

chen-yingfa/HypeNet-5B

5B • Updated 24 days ago • 60 • 1

updated a collection about 2 months ago

HypeNet

Collection

The models for the paper: Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts • 2 items • Updated 24 days ago

liked a model about 2 months ago

chen-yingfa/HypeNet-2B

2B • Updated Apr 7 • 26 • 2

updated a model about 2 months ago

chen-yingfa/HypeNet-2B

2B • Updated Apr 7 • 26 • 2

published a model about 2 months ago

chen-yingfa/HypeNet-2B

2B • Updated Apr 7 • 26 • 2

upvoted an article about 2 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 898

liked a model 3 months ago

openbmb/MiniCPM-SALA

Text Generation • 9B • Updated 15 days ago • 13.3k • 679

liked a dataset 3 months ago

openbmb/UltraData-Math

Viewer • Updated Apr 15 • 181M • 63.7k • 307

liked a model 4 months ago

openbmb/MiniCPM-o-4_5

Any-to-Any • 9B • Updated 3 days ago • 160k • 1.38k

authored a paper 4 months ago

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Paper • 2601.22156 • Published Jan 29 • 14

upvoted a paper 4 months ago

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Paper • 2601.22156 • Published Jan 29 • 14

submitted a paper to Daily Papers 4 months ago

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Paper • 2601.22156 • Published Jan 29 • 14

upvoted a paper 6 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 133

liked a dataset 7 months ago

caskcsg/Litelong_Nextlong_512k

Preview • Updated Sep 20, 2025 • 184 • 1

authored a paper 8 months ago

StateX: Enhancing RNN Recall via Post-training State Expansion

Paper • 2509.22630 • Published Sep 26, 2025 • 4

upvoted a paper 8 months ago

StateX: Enhancing RNN Recall via Post-training State Expansion

Paper • 2509.22630 • Published Sep 26, 2025 • 4

commented a paper 8 months ago

StateX: Enhancing RNN Recall via Post-training State Expansion

Paper • 2509.22630 • Published Sep 26, 2025 • 4 •

Yingfa Chen

AI & ML interests

Recent Activity

Organizations

chen-yingfa's activity

Welcome Gemma 4: Frontier multimodal intelligence on device