2 17 61

Zilin Zhu

zhuzilin

zhuzilin

AI & ML interests

MLSys

Recent Activity

upvoted a paper 6 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

updated a dataset 11 days ago

zhuzilin/aime-2025

published a dataset 11 days ago

zhuzilin/aime-2025

View all activity

Organizations

None yet

upvoted a paper 6 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 7 days ago • 78

updated a dataset 11 days ago

zhuzilin/aime-2025

Viewer • Updated 11 days ago • 30 • 28

published a dataset 11 days ago

zhuzilin/aime-2025

Viewer • Updated 11 days ago • 30 • 28

upvoted a paper 27 days ago

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Paper • 2511.07327 • Published 28 days ago • 74

liked a model 4 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 4.51M • • 4.22k

updated 2 datasets 5 months ago

zhuzilin/dapo-math-17k

Viewer • Updated Jul 25 • 17.4k • 930 • 3

zhuzilin/gsm8k

Viewer • Updated Jul 25 • 8.79k • 122 • 1

published a dataset 5 months ago

zhuzilin/gsm8k

Viewer • Updated Jul 25 • 8.79k • 122 • 1

upvoted a paper 5 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 240

upvoted a paper 6 months ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23 • 56

updated a dataset 6 months ago

zhuzilin/aime-2024

Viewer • Updated Jun 19 • 30 • 827 • 2

published 2 datasets 6 months ago

zhuzilin/aime-2024

Viewer • Updated Jun 19 • 30 • 827 • 2

zhuzilin/dapo-math-17k

Viewer • Updated Jul 25 • 17.4k • 930 • 3

updated a model 6 months ago

zhuzilin/Moonlight-16B-A3B-Instruct

Updated May 31

published a model 6 months ago

zhuzilin/Moonlight-16B-A3B-Instruct

Updated May 31

upvoted a paper 8 months ago

Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model

Paper • 2504.15843 • Published Apr 22 • 17

upvoted a paper 9 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113

liked a dataset 10 months ago

facebook/natural_reasoning

Viewer • Updated Feb 21 • 1.15M • 2.11k • 543

liked a dataset 11 months ago

microsoft/RedStone

Updated Dec 5, 2024 • 16 • 35

upvoted a paper 11 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99

Zilin Zhu

AI & ML interests

Recent Activity

Organizations

zhuzilin's activity