pinned
Running
5
Physical AI Bench Leaderboard
🤖
Benchmark for Physical AI generation and understanding
Computer Vision, AI, Machine Learning
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation
Benchmark for Physical AI generation and understanding
Visualize image depth, segmentation, and generation
Describe video content with text prompts
Generate text based on images and input text
Generate image segmentation overlays and maps