DyCodeEval (ICML 2025) enables dynamic benchmarking for code LLMs. This collection features dynamic HumanEval and MBPP sets generated with Claude 3.5.
Simin Chen
CM
AI & ML interests
None yet
Organizations
datasets 12
CM/humaneval_trans_java_python
Viewer • Updated • 164 • 25
CM/Dynamic_LeetCode
Viewer • Updated • 2.87k • 18
CM/Dynamic_MBPP_sanitized
Viewer • Updated • 15.8k • 21
CM/Dynamic_HumanEvalZero
Viewer • Updated • 15.7k • 21
CM/codexglue_codetrans
Viewer • Updated • 11.8k • 60 • 2
CM/codexglue_code2text_ruby
Viewer • Updated • 27.6k • 367 • 1
CM/codexglue_code2text_python
Viewer • Updated • 281k • 1.08k • 8
CM/codexglue_code2text_php
Viewer • Updated • 268k • 281 • 2
CM/codexglue_code2text_javascript
Viewer • Updated • 65.2k • 583 • 12
CM/codexglue_code2text_java
Viewer • Updated • 181k • 1.15k • 4