arxiv:2512.16561
Dan Xu
danxuhk
AI & ML interests
Computer Vision, Deep Learning, Multimedia
Recent Activity
upvoted a paper about 1 month ago
Controllable Video Generation: A Survey authored
a paper
2 months ago
N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models upvoted a paper 2 months ago
N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models Organizations
None yet