Yikun Ban
Yikunb
AI & ML interests
Reinforcement Learning
Recent Activity
authored
a paper
1 day ago
Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning
authored
a paper
1 day ago
Your Group-Relative Advantage Is Biased
commented on
a paper
2 days ago
Your Group-Relative Advantage Is Biased
Organizations
None yet