Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nielsr 's Collections
Image-to-text models
SigLIP release
DPT 3.0 release
DPT 3.1 release
Depth Anything release

DPT 3.1 release

updated Jan 25, 2024

DPT 3.1 (MiDaS) models, leveraging state-of-the-art vision backbones such as BEiT and Swinv2

Upvote
1

  • Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

    Paper • 1907.01341 • Published Jul 2, 2019

  • Intel/dpt-beit-large-512

    Depth Estimation • Updated Jun 21, 2024 • 2.88k • 8

    Note This model gives the highest quality, but is also the most heavy in terms of computation as mentioned in the paper.


  • Intel/dpt-beit-large-384

    Depth Estimation • Updated Jun 21, 2024 • 67

  • Intel/dpt-beit-base-384

    Depth Estimation • Updated Dec 11, 2023 • 46k • 1

  • Intel/dpt-swinv2-large-384

    Depth Estimation • Updated Jun 21, 2024 • 217

    Note This model has moderately less quality, but has a better speed-performance trade-off


  • Intel/dpt-swinv2-base-384

    Depth Estimation • Updated Dec 11, 2023 • 173

  • Intel/dpt-swinv2-tiny-256

    Depth Estimation • Updated Jun 21, 2024 • 2.22k • 9

    Note This model is recommended for deployment on embedded devices

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs