arxiv:2601.09708
Min-Hung Chen
AI & ML interests
Multimodal AI, Transfer Learning, Unsupervised Learning, Video Understanding, Vision Transformer, Computer Vision, Deep Learning
Recent Activity
new activity about 14 hours ago
nvidia/4D-RGPT-8B:fix links liked a model about 14 hours ago
nvidia/4D-RGPT-8B upvoted a paper 4 days ago
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models