DPT 3.1 release - a nielsr Collection

nielsr 's Collections

Image-to-text models

DPT 3.0 release

DPT 3.1 release

Depth Anything release

DPT 3.1 release

updated Jan 25, 2024

DPT 3.1 (MiDaS) models, leveraging state-of-the-art vision backbones such as BEiT and Swinv2

Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

Paper • 1907.01341 • Published Jul 2, 2019
Intel/dpt-beit-large-512

Depth Estimation • Updated Jun 21, 2024 • 2.88k • 8

Note This model gives the highest quality, but is also the most heavy in terms of computation as mentioned in the paper.
Intel/dpt-beit-large-384

Depth Estimation • Updated Jun 21, 2024 • 67
Intel/dpt-beit-base-384

Depth Estimation • Updated Dec 11, 2023 • 46k • 1
Intel/dpt-swinv2-large-384

Depth Estimation • Updated Jun 21, 2024 • 217

Note This model has moderately less quality, but has a better speed-performance trade-off
Intel/dpt-swinv2-base-384

Depth Estimation • Updated Dec 11, 2023 • 173
Intel/dpt-swinv2-tiny-256

Depth Estimation • Updated Jun 21, 2024 • 2.22k • 9

Note This model is recommended for deployment on embedded devices