arxiv:2604.14683
Qianqian Xie
mistletoe111
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 hour ago
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation upvoted a paper 1 day ago
WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models updated a dataset 2 days ago
NJU-LINK/DR3-Eval