The datasets for benchmarking and training of LLM coding agents.
SWE-bench-Live
community
AI & ML interests
None defined yet.
Recent Activity
Organization Card
SWE-bench-Live
Here we host SWE-bench-Live dataset, with continuous monthly updates!
models
0
None public yet