wang binghai's picture

1 3 1

wang binghai

refrain-wbh

·

refrain-wbh

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

submitted a paper 12 days ago

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

liked a dataset 16 days ago

Qwen/RationaleRM

View all activity

Organizations

refrain-wbh 's datasets

None public yet