Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lalala's picture
5 12

lalala

llllaadafatrea
shtefcs's profile picture SiweiWu's profile picture frascuchon's profile picture
·

AI & ML interests

None yet

Organizations

ezetimibe's profile picture

upvoted a paper 6 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 187
upvoted 2 papers 7 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 188
upvoted a paper 8 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44
upvoted a paper about 1 year ago

PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment

Paper • 2410.13785 • Published Oct 17, 2024 • 19
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs