Running 230 BigCodeBench Leaderboard ๐ฅ 230 Explore code-generation model leaderboards and task details
Restarting on CPU Upgrade 18 BigCodeBench Evaluator ๐ฅ 18 Evaluate code samples using specified parameters
meta-llama/Llama-3.1-70B-Instruct Text Generation โข 71B โข Updated Dec 15, 2024 โข 1M โข โข 898