Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
1
wang binghai
refrain-wbh
Follow
Gargaz's profile picture
0xSojalSec's profile picture
maharshpatelx's profile picture
10 followers
·
1 following
refrain-wbh
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 11 hours ago
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models
submitted
a paper
about 11 hours ago
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models
liked
a dataset
5 days ago
Qwen/RationaleRM
View all activity
Organizations
refrain-wbh
's models
1
Sort: Recently updated
refrain-wbh/emnlp-hh-rlhf
Updated
Jun 29, 2024