Models used in CHARM: Calibrating Reward Models With Chatbot Arena Scores.
shawnxzhu
shawnxzhu
·
AI & ML interests
None yet
Recent Activity
updated
a model
13 days ago
shawnxzhu/cdgpt-1b
published
a model
14 days ago
shawnxzhu/cdgpt-1b
upvoted
a
paper
about 2 months ago
QueST: Incentivizing LLMs to Generate Difficult Problems
Organizations
None yet