Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
4
quinn
jwhe
Follow
0 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
new
activity
1 day ago
harborframework/parity-experiments:
[Parity] CL-bench: codex/gpt-5.2 vs infer_codex.py (50 tasks, 3 trials, MATCHING)
new
activity
11 days ago
harborframework/parity-experiments:
[Parity] CL-bench: codex/gpt-5.1 vs original pipeline (50 tasks, 3 trials)
authored
a paper
2 months ago
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks
View all activity
Organizations
jwhe
's datasets
None public yet