Yikun B's picture

Yikun B

Yikunb

·

AI & ML interests

Reinforcement Learning

Recent Activity

commented on a paper about 7 hours ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

authored a paper 1 day ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

upvoted a paper 1 day ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

View all activity

Organizations

None yet

authored a paper 1 day ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published 15 days ago • 164

submitted a paper to Daily Papers 1 day ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published 15 days ago • 164

authored a paper 14 days ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published 16 days ago • 267

submitted a paper to Daily Papers 14 days ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published 16 days ago • 267

authored a paper 22 days ago

Real-Time Aligned Reward Model beyond Semantics

Paper • 2601.22664 • Published 25 days ago • 13

submitted a paper to Daily Papers 23 days ago

Real-Time Aligned Reward Model beyond Semantics

Paper • 2601.22664 • Published 25 days ago • 13

submitted a paper to Daily Papers about 1 month ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 154

authored 2 papers about 1 month ago

Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

Paper • 2505.16270 • Published May 22, 2025 • 6

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 154