4 18 3

Xuehui Wang

huiserwang

https://huiserwang.site

huiserwang

AI & ML interests

Segmentation

Recent Activity

upvoted a paper 2 months ago

Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports

upvoted a paper 2 months ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

upvoted a paper 5 months ago

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

View all activity

Organizations

upvoted 2 papers 2 months ago

Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports

Paper • 2603.09896 • Published Mar 10 • 28

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published Mar 10 • 48

upvoted a paper 5 months ago

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Paper • 2512.08478 • Published Dec 9, 2025 • 77

upvoted a paper 6 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 195

liked a model 6 months ago

miromind-ai/MiroThinker-v1.0-72B

Text Generation • 73B • Updated Jan 16 • 36 • 130

updated a dataset 7 months ago

huiserwang/Layout_HW

Viewer • Updated Oct 20, 2025 • 230 • 8

upvoted 2 papers 7 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 110

NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints

Paper • 2510.08565 • Published Oct 9, 2025 • 21

published 2 datasets 8 months ago

huiserwang/Layout_HW

Viewer • Updated Oct 20, 2025 • 230 • 8

huiserwang/temp_files

Updated Sep 19, 2025 • 5

updated a dataset 8 months ago

huiserwang/temp_files

Updated Sep 19, 2025 • 5

updated a dataset 9 months ago

OpenGVLab/MMBench-GUI

Preview • Updated Aug 15, 2025 • 209 • 37

upvoted a paper 10 months ago

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

Paper • 2507.19478 • Published Jul 25, 2025 • 33

commented a paper 10 months ago

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

Paper • 2507.19478 • Published Jul 25, 2025 • 33 •

New activity in OpenGVLab/MMBench-GUI 10 months ago

Enhance dataset card: Add metadata, paper abstract, and detailed information

#1 opened 10 months ago by

nielsr

liked a dataset 11 months ago

OpenGVLab/MMBench-GUI

Preview • Updated Aug 15, 2025 • 209 • 37

published a dataset 11 months ago

OpenGVLab/MMBench-GUI

Preview • Updated Aug 15, 2025 • 209 • 37

upvoted a paper 11 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 207

upvoted 2 articles 11 months ago

Article

A Dive into Vision-Language Models

adirik, sayakpaul

•

Feb 3, 2023

• 84

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 531

Xuehui Wang

AI & ML interests

Recent Activity

Organizations

huiserwang's activity

Enhance dataset card: Add metadata, paper abstract, and detailed information

A Dive into Vision-Language Models

Vision Language Models Explained