Open VLM Leaderboard
VLMEvalKit Evaluation Results Collection
None defined yet.
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM
VLMEvalKit Evaluation Results Collection
ATLAS for Frontier Scientific Benchmark
A Gallery of Generation Results on RISEBench
A Leaderboard for LMM spatial understanding capabilities
VLMEvalKit Subjectivce Benchmark Results
Compass Academic Leaderboard Full Version
A Leaderboard that demonstrates LMM reasoning capabilities
Compass Academic Leaderboard
Explore MMBench Leaderboard data
VLMEvalKit Eval Results in video understanding benchmark
CompassJudger Subjective Evaluation Learderboard
JudgerBench Leaderboard
Display a web page
Evaluate code snippets across multiple languages
Display CompassArena platform
Display a web page