Jingwei Zuo's picture

Jingwei Zuo

JingweiZuo

·

AI & ML interests

None yet

Recent Activity

authored a paper about 12 hours ago

Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

updated a collection about 15 hours ago

upvoted a paper about 15 hours ago

Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

View all activity

Organizations

authored a paper about 12 hours ago

Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

Paper • 2601.04890 • Published 1 day ago • 28

updated a collection about 15 hours ago

Falcon-H1

Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 39 items • Updated about 14 hours ago • 57

upvoted a paper about 15 hours ago

Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

Paper • 2601.04890 • Published 1 day ago • 28

submitted a paper to Daily Papers about 15 hours ago

Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

Paper • 2601.04890 • Published 1 day ago • 28

updated a Space about 1 month ago

Falcon H1 Playground

Generate text responses through chat with Falcon-H1 models

upvoted an article 3 months ago

Article

BigCodeArena: Judging code generations end to end with code executions

Oct 7, 2025

•

19

liked a dataset 3 months ago

GAIR/LIMI

Viewer • Updated Oct 9, 2025 • 78 • 295 • 22

liked a model 3 months ago

GAIR/LIMI

Text Generation • 353B • Updated Oct 14, 2025 • 29 • 31

upvoted a collection 4 months ago

MobileLLM-R1

MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21, 2025 • 27

New activity in facebook/MobileLLM-R1-950M 4 months ago

Comparison with Falcon-H1 models

#1 opened 4 months ago by

New activity in huggingface/InferenceSupport 5 months ago

tiiuae/Falcon-H1-1.5B-Deep-Instruct

#4286 opened 5 months ago by

tiiuae/Falcon-H1-1.5B-Instruct

#4285 opened 5 months ago by

tiiuae/Falcon-H1-0.5B-Instruct

#4284 opened 5 months ago by

New activity in tiiuae/Falcon-H1-0.5B-Instruct 5 months ago

dataset

#5 opened 7 months ago by

upvoted a paper 5 months ago

NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models

Paper • 2506.07731 • Published Jun 9, 2025 • 2

authored 2 papers 5 months ago

NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models

Paper • 2506.07731 • Published Jun 9, 2025 • 2

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30, 2025 • 68

updated 3 models 5 months ago

tiiuae/Falcon-H1-34B-Instruct-GGUF

Text Generation • 34B • Updated Aug 4, 2025 • 1.74k • 14

tiiuae/Falcon-H1-7B-Instruct-GGUF

8B • Updated Jul 31, 2025 • 1.25k • 19

tiiuae/Falcon-H1-3B-Instruct-GGUF

3B • Updated Jul 31, 2025 • 572 • 15