8 17 9

Zhongang Cai

caizhongang

http://caizhongang.com/

AI & ML interests

Multimodal, Spatial Intelligence, Embodied AI, Virtual Humans.

Recent Activity

upvoted a paper 15 days ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

authored a paper 18 days ago

Scaling Spatial Intelligence with Multimodal Foundation Models

upvoted a paper 18 days ago

Scaling Spatial Intelligence with Multimodal Foundation Models

View all activity

Organizations

upvoted a paper 15 days ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published 19 days ago • 91

upvoted a paper 18 days ago

Scaling Spatial Intelligence with Multimodal Foundation Models

Paper • 2511.13719 • Published 21 days ago • 44

upvoted a paper 21 days ago

PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image

Paper • 2511.13648 • Published 22 days ago • 52

upvoted a paper 22 days ago

Simulating the Visual World with Artificial Intelligence: A Roadmap

Paper • 2511.08585 • Published 27 days ago • 29

upvoted 3 papers about 1 month ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5 • 124

Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals

Paper • 2510.27684 • Published Oct 31 • 22

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Paper • 2510.26794 • Published Oct 30 • 26

upvoted a paper about 2 months ago

DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Paper • 2508.00599 • Published Aug 1 • 7

upvoted 2 papers 2 months ago

VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

Paper • 2510.05094 • Published Oct 6 • 37

Visual Jigsaw Post-Training Improves MLLMs

Paper • 2509.25190 • Published Sep 29 • 36

upvoted 3 papers 4 months ago

upvoted a collection 7 months ago

EgoLife

Collection

CVPR 2025 - EgoLife: Towards Egocentric Life Assistant. Homepage: https://egolife-ai.github.io/ • 10 items • Updated Mar 7 • 20

upvoted a paper 7 months ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 46

upvoted 2 papers about 1 year ago

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Paper • 2412.00174 • Published Nov 29, 2024 • 23

Disco4D: Disentangled 4D Human Generation and Animation from a Single Image

Paper • 2409.17280 • Published Sep 25, 2024 • 11

Zhongang Cai

AI & ML interests

Recent Activity

Organizations

caizhongang's activity