9 9

He Yifan

jackt34

AI & ML interests

Alignment-focused model research.

Recent Activity

upvoted a paper 4 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

upvoted a paper 4 days ago

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

upvoted a paper 4 days ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

View all activity

Organizations

None yet

upvoted 4 papers 4 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 6 days ago • 200

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

Paper • 2605.17757 • Published 8 days ago • 62

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published 12 days ago • 143

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 14 days ago • 191

liked a model 5 days ago

laion/clap-htsat-fused

Audio Classification • 0.2B • Updated Jan 12 • 20.9M • 91

upvoted a paper 8 days ago

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 12 days ago • 110

liked a dataset 12 days ago

cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 552k • 748

upvoted 2 papers 19 days ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published 20 days ago • 100

AcademiClaw: When Students Set Challenges for AI Agents

Paper • 2605.02661 • Published 22 days ago • 16

liked a dataset 25 days ago

IPEC-COMMUNITY/droid_lerobot

Preview • Updated Apr 28, 2025 • 489k • 22

liked 2 models about 1 month ago

brendan-gho/qwen2.5-1.5b-paraphrased-wolf-cot-seed42

Updated 23 days ago • 1

longzhiying/test

Robotics • 8.45M • Updated Apr 12 • 4 • 1

upvoted 2 papers about 2 months ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

liked a dataset about 2 months ago

rytfh/edtyhgv

Updated Apr 15 • 5.27k • 5

liked a model about 2 months ago

Vizuara/dreamzero-so101-lora

Robotics • 0.1B • Updated Apr 8 • 75 • 6

liked 2 datasets about 2 months ago

OpenSQZ/AutoMathText-V2

Viewer • Updated Apr 2 • 15.2B • 197k • 78

daaxila/twitter-TianxinKitten-2025.04.23-1914986604822925424-ZGRgnHpGeV7rYlxs-part1

Viewer • Updated Apr 1 • 1 • 88 • 1

He Yifan

AI & ML interests

Recent Activity

Organizations

jackt34's activity