Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
Paper
•
2410.02073
•
Published
•
41
DepthPro is a monocular depth estimation model. This means that it is trained to predict depth on a single image.
image: 1536x1536 3 color image.normalizedInverseDepth 1536x1536 monochrome image.image: 1536x1536 3 color image.originalWidth: 1x1x1x1 Tensor containing the original width of the image before resizing.depthMeters: 1x1x1536x1536 Tensor containing depth in meters.Install huggingface-cli
brew install huggingface-cli
To download one of the .mlpackage folders to the models directory:
huggingface-cli download \
--local-dir models --local-dir-use-symlinks False \
KeighBee/coreml-DepthPro \
--include "DepthProNormalizedInverseDepthPruned10QuantizedLinear.mlpackage/*" "DepthProPruned10QuantizedLinear.mlpackage/*"
To download everything, skip the --include argument.
The huggingface/coreml-examples repository contains sample Swift code for DepthProNormalizedInverseDepthPruned10QuantizedLinear.mlpackage and other models. See the instructions there to build the demo app, which shows how to use the model in your own Swift apps.
Base model
apple/DepthPro