-
Open VLM Leaderboard
π1kVLMEvalKit Evaluation Results Collection
-
Open VLM Video Leaderboard
π131VLMEvalKit Eval Results in video understanding benchmark
-
Open LMM Reasoning Leaderboard
π₯44A Leaderboard that demonstrates LMM reasoning capabilities
-
MMBench Leaderboard
π24Explore MMBench Leaderboard data
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM
-
opencompass/CompassJudger-1-32B-Instruct
Text Generation β’ 33B β’ Updated β’ 20 β’ 18 -
opencompass/CompassJudger-1-14B-Instruct
Text Generation β’ 15B β’ Updated β’ 7 β’ 2 -
opencompass/CompassJudger-1-7B-Instruct
Updated β’ 17 β’ 10 -
opencompass/CompassJudger-1-1.5B-Instruct
2B β’ Updated β’ 9 β’ 1
-
Open VLM Leaderboard
π1kVLMEvalKit Evaluation Results Collection
-
Open VLM Video Leaderboard
π131VLMEvalKit Eval Results in video understanding benchmark
-
Open LMM Reasoning Leaderboard
π₯44A Leaderboard that demonstrates LMM reasoning capabilities
-
MMBench Leaderboard
π24Explore MMBench Leaderboard data
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
-
opencompass/CompassJudger-1-32B-Instruct
Text Generation β’ 33B β’ Updated β’ 20 β’ 18 -
opencompass/CompassJudger-1-14B-Instruct
Text Generation β’ 15B β’ Updated β’ 7 β’ 2 -
opencompass/CompassJudger-1-7B-Instruct
Updated β’ 17 β’ 10 -
opencompass/CompassJudger-1-1.5B-Instruct
2B β’ Updated β’ 9 β’ 1