Mingjie Bi
mjb95m
AI & ML interests
None yet
Recent Activity
upvoted a paper 4 days ago
Less is More: Early Stopping Rollout for On-Policy Distillation liked a dataset 5 months ago
bigai/TongSIM-Asset upvoted a paper 10 months ago
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification