4 7

Zhichen Zeng

CharyZeng

Zhichenzzz

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

HierSVA: A Data Synthesis Pipeline, Dataset, and Benchmark for LLM-Driven Hierarchical Hardware Formal Verification

updated a model 3 days ago

CharyZeng/Kimi-K2.5-2layer

updated a model 17 days ago

CharyZeng/Kimi-K2.5-4layer

View all activity

Organizations

authored a paper 3 days ago

HierSVA: A Data Synthesis Pipeline, Dataset, and Benchmark for LLM-Driven Hierarchical Hardware Formal Verification

Paper • 2606.13706 • Published 10 days ago

updated a model 3 days ago

CharyZeng/Kimi-K2.5-2layer

21B • Updated 3 days ago • 18

updated a model 17 days ago

CharyZeng/Kimi-K2.5-4layer

56B • Updated 17 days ago • 13

published a model 17 days ago

CharyZeng/Kimi-K2.5-4layer

56B • Updated 17 days ago • 13

authored 2 papers 18 days ago

Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regression

Paper • 2510.01450 • Published Oct 1, 2025 • 2

Parallax: Parameterized Local Linear Attention for Language Modeling

Paper • 2605.29157 • Published 23 days ago • 11

upvoted a paper 19 days ago

Parallax: Parameterized Local Linear Attention for Language Modeling

Paper • 2605.29157 • Published 23 days ago • 11

updated a model about 2 months ago

CharyZeng/DeepSeek-V4-Flash-4layer

Text Generation • 15B • Updated Apr 25 • 83

published 2 models about 2 months ago

CharyZeng/DeepSeek-V4-Flash-4layer

Text Generation • 15B • Updated Apr 25 • 83

CharyZeng/Kimi-K2.5-2layer

21B • Updated 3 days ago • 18

authored a paper 3 months ago

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Paper • 2603.10160 • Published Mar 10 • 26

liked a model 4 months ago

zai-org/GLM-5

Text Generation • 754B • Updated Apr 5 • 65.6k • • 2.1k

liked a Space 4 months ago

HLE Leaderboard for Agents with Tools

🥇

Humanity's Last Exam Leaderboard for LLM Agents with Tools

liked a dataset 11 months ago

UW-FMRL2/MMMG

Viewer • Updated May 27, 2025 • 937 • 78 • 13

authored a paper over 1 year ago

Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs

Paper • 2503.06342 • Published Mar 8, 2025 • 1

upvoted a paper over 1 year ago

Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs

Paper • 2503.06342 • Published Mar 8, 2025 • 1

liked 2 models over 1 year ago

SeerAttention/SeerAttention-Llama-3.1-8B-AttnGates

Text Generation • Updated Mar 3, 2025 • 559 • 4

SeerAttention/SeerAttention-Llama-3.1-8B

Text Generation • 8B • Updated Feb 16, 2025 • 7 • 4

authored 2 papers over 1 year ago

EN-T: Optimizing Tensor Computing Engines Performance via Encoder-Based Methodology

Paper • 2404.11887 • Published Apr 18, 2024

LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration

Paper • 2408.06003 • Published Aug 12, 2024

Zhichen Zeng

AI & ML interests

Recent Activity

Organizations

CharyZeng's activity

HLE Leaderboard for Agents with Tools