He Yifan
jackt34
AI & ML interests
Alignment-focused model research.
Recent Activity
upvoted a paper about 22 hours ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable RewardsOrganizations
None yet