Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
prometheus-eval
university
Activity Feed
Follow
107
AI & ML interests
None defined yet.
Recent Activity
amphora
submitted
a paper
19 days ago
Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math
Seongyun
authored
a paper
about 2 months ago
Efficient Long Context Language Model Retrieval with Compression
Seongyun
authored
a paper
about 2 months ago
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors
View all activity
Team members
56
+22
+9
prometheus-eval
's Spaces
2
Sort: Recently updated
Running
15
BiGGen Bench Leaderboard
😻
Display model performance leaderboard
Running
README
🐨