Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Open CoT Leaderboard
community
Activity Feed
Request to join this org
Follow
9
AI & ML interests
Chain of Thought, LLM Evaluation
Recent Activity
yakazimir
Â
authored
a paper
about 3 hours ago
AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite
yakazimir
Â
authored
a paper
about 3 hours ago
TinyScientist: An Interactive, Extensible, and Controllable Framework for Building Research Agents
yakazimir
Â
authored
a paper
about 4 hours ago
Probabilistic Programs of Thought
View all activity
Team members
3
cot-leaderboard
's models
None public yet