Shaobai Jiang
shaobaij
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 hour ago
daVinci-LLM:Towards the Science of Pretraining upvoted a paper about 12 hours ago
Composer 2 Technical Report upvoted a paper about 22 hours ago
Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-TrainingOrganizations
None yet