Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ilgee hong's picture
1 6 1

ilgee hong

ilgee
John6666's profile picture
·
  • ilgeehong

AI & ML interests

None yet

Recent Activity

updated a dataset 20 days ago
ilgee/RMB-BoN
published a dataset 20 days ago
ilgee/RMB-BoN
updated a dataset 20 days ago
ilgee/RMB-Pairwise
View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

Paper • 2602.01511 • Published Feb 2 • 15
upvoted 2 papers 6 months ago

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Paper • 2510.03259 • Published Sep 26, 2025 • 57

OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment

Paper • 2510.07743 • Published Oct 9, 2025 • 13
upvoted 2 papers 10 months ago

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Paper • 2505.16421 • Published May 22, 2025 • 19

Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models

Paper • 2505.16265 • Published May 22, 2025 • 8
upvoted a paper about 1 year ago

Discriminative Finetuning of Generative Large Language Models without Reward Models and Preference Data

Paper • 2502.18679 • Published Feb 25, 2025 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs