2 10 15

CL Yu

clyu

AI & ML interests

None yet

Recent Activity

upvoted a collection 12 days ago

NeMo Gym

liked a dataset 22 days ago

nvidia/Nemotron-Pretraining-Specialized-v1.1

liked a dataset 23 days ago

stepfun-ai/Step-3.5-Flash-SFT

View all activity

Organizations

upvoted a collection 12 days ago

NeMo Gym

Collection

Collection of RL verifiable data for NeMo Gym • 22 items • Updated about 13 hours ago • 55

liked a dataset 22 days ago

nvidia/Nemotron-Pretraining-Specialized-v1.1

Viewer • Updated about 1 month ago • 19.8M • 4.37k • 38

liked a dataset 23 days ago

stepfun-ai/Step-3.5-Flash-SFT

Viewer • Updated 28 days ago • 1.62M • 59.9k • 313

liked a model 2 months ago

Qwen/Qwen3-Coder-Next

Text Generation • 80B • Updated Feb 3 • 678k • • 1.25k

submitted a paper to Daily Papers 2 months ago

Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training

Paper • 2602.05933 • Published Feb 5 • 6

upvoted an article 2 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

Jan 28

•

153

upvoted a paper 2 months ago

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Paper • 2601.21821 • Published Jan 29 • 62

liked a dataset 3 months ago

MiniMaxAI/OctoCodingBench

Viewer • Updated Jan 13 • 72 • 460 • 264

updated a model 4 months ago

clyu/clip0.28_clipl0.2_vanilla_bsz512_mb128

Updated Dec 17, 2025

published a model 4 months ago

clyu/clip0.28_clipl0.2_vanilla_bsz512_mb128

Updated Dec 17, 2025

updated a model 4 months ago

clyu/cliph4_clipl0.5_cumloss_bsz512_mb128

Updated Dec 17, 2025

published a model 4 months ago

clyu/cliph4_clipl0.5_cumloss_bsz512_mb128

Updated Dec 17, 2025

liked a model 5 months ago

Salesforce/xRouter

Text Generation • 8B • Updated Nov 4, 2025 • 124 • 15

updated a model 5 months ago

clyu/qwen3_14b_rstar_sft_step802

15B • Updated Nov 17, 2025 • 3

published a model 5 months ago

clyu/qwen3_14b_rstar_sft_step802

15B • Updated Nov 17, 2025 • 3

liked 2 datasets 6 months ago

microsoft/rStar-Coder

Viewer • Updated Jul 20, 2025 • 1.86M • 6.31k • 236

zhenghaoxu/R2E-Gym-Lite-with-Difficulty

Viewer • Updated Sep 19, 2025 • 6.24k • 55 • 4

upvoted 3 papers 6 months ago

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published Oct 22, 2025 • 63

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 85

AlphaQuanter: An End-to-End Tool-Orchestrated Agentic Reinforcement Learning Framework for Stock Trading

Paper • 2510.14264 • Published Oct 16, 2025 • 10

CL Yu

AI & ML interests

Recent Activity

Organizations

clyu's activity

We Got Claude to Build CUDA Kernels and teach open models!