Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Cooper 's Collections
Code
image-quality
video-inpainting
text2video
image-text
advertisement
Map
table
Agent
Deepfake detection
KIE
Comic
cartoon
reasoning
multi-image
music-theory
grounding
VLA
Med
counting
Spatial
video
OCR
STEM
mix-multimodal-datasets
model
image-point
chart
imageCode
image-qa
image-caption
knowledge
GUI

image-qa

updated 2 days ago
Upvote
-

  • Mayfull/LRV-Instruction

    Viewer • Updated Oct 11, 2025 • 181k • 126 • 1

  • nvidia/Nemotron-VLM-Dataset-v2

    Viewer • Updated Dec 18, 2025 • 4.58M • 4.09k • 87

  • array/SAT

    Preview • Updated Feb 16 • 609 • 13

  • WildVision/wildvision-internal-data

    Viewer • Updated Aug 21, 2024 • 155k • 546 • 5

  • PhoenixZ/OmniAlign-V

    Updated Mar 1, 2025 • 313 • 7

  • PhoenixZ/OmniAlign-V-DPO

    Viewer • Updated Mar 1, 2025 • 133k • 99 • 6

  • allenai/pixmo-cap-qa

    Viewer • Updated Dec 5, 2024 • 272k • 239 • 9

  • moonshotai/WorldVQA

    Viewer • Updated Feb 4 • 3k • 1.4k • 65

  • YangyiYY/LVLM_NLF

    Preview • Updated Nov 17, 2023 • 91 • 12

  • pufanyi/MIMICIT

    Viewer • Updated Mar 28, 2024 • 5.1M • 179 • 48

  • MMInstruction/M3IT

    Updated Nov 24, 2023 • 27.4k • 136

  • openlamm/Ch3Ef

    Updated Sep 28, 2024 • 46 • 3

  • AntGroup-MI/Osprey-724K

    Preview • Updated Feb 5, 2024 • 64 • 15

  • dutta18/Physical-Reasoning-VQA-45K

    Viewer • Updated 3 days ago • 64.9k • 23
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs