Generate gaze pattern and reconstruction videos from any video
Generate text answers from live camera images
Discussions about the Inference Providers feature on the Hub
Generate answers by combining text and images
Segment images using texts, points, or everything mode