jiayi lei
jyjyjyjy
ยท
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for
Foundation Models
authored
a paper
about 2 months ago
CodeApex: A Bilingual Programming Evaluation Benchmark for Large
Language Models
authored
a paper
about 2 months ago
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large
Vision-Language Models Towards Multitask AGI