Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Pu Fanyi's picture
7 71 145

Pu Fanyi

pufanyi
KairuiHu's profile picture khang119966's profile picture NickyNicky's profile picture
·
https://pufanyi.github.io
  • pufanyi
  • pufanyi

AI & ML interests

CV

Recent Activity

upvoted a paper 8 days ago
From Pixels to Words -- Towards Native One-Vision Models at Scale
liked a model 17 days ago
FastVideo/CausalWan2.2-I2V-A14B-Preview-Diffusers
liked a dataset 17 days ago
FastVideo/Wan-Syn_77x448x832_600k
View all activity

Organizations

Nanyang Technological University's profile picture SenseNova's profile picture LMMs-Lab's profile picture LongVa's profile picture Evolve-lmms-lab's profile picture LMMs-Lab-SI's profile picture LMMs-Lab-Speedrun's profile picture

authored a paper 3 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373
authored a paper 7 months ago

Scaling Spatial Intelligence with Multimodal Foundation Models

Paper • 2511.13719 • Published Nov 17, 2025 • 50
authored a paper over 1 year ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23, 2025 • 24
authored a paper almost 2 years ago

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17, 2024 • 35
authored a paper over 2 years ago

OtterHD: A High-Resolution Multi-modality Model

Paper • 2311.04219 • Published Nov 7, 2023 • 34
authored a paper almost 3 years ago

MIMIC-IT: Multi-Modal In-Context Instruction Tuning

Paper • 2306.05425 • Published Jun 8, 2023 • 12
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs