Dinov2 for depth estimation #26057

rfan-debug · 2023-09-08T18:02:02Z

Feature request

Dinov2's original repo has an example using Dinov2 backbone + DPT head for depth estimation notebook link. If we can integrate it into transformers repo by adding a class Dinov2ForImageDepthEstimation and let forward method return DepthEstimatorOutput, we'll have a unified output interface across all depth estimation models. By doing this, we can easily chain this powerful depth estimation method together with other models under transformers's pipelines.

Motivation

This would be a very great feature for many production use cases or research problems. One example is camera angle estimation from a 2D image, in which reliable depth information are critical. In my limited test cases, using dinov2+DPT head to run depth estimation is way better than the existing DPT model itself.

Your contribution

I can submit a PR to add this feature if other professional developers don't have the bandwidth to deal with it. (I am relatively new to transformers's develop workflow though.)

The text was updated successfully, but these errors were encountered:

amyeroberts · 2023-09-08T18:16:36Z

Hi @rfan-debug, this would be a great contribution!

If you'd like to open a PR we'd be happy to review and answer any questions if you need help.

cc @rafaelpadilla

NielsRogge · 2023-09-09T08:13:03Z

Hi,

So I saw they released the DINOv2 checkpoints with a DPT head: https://github.com/facebookresearch/dinov2#pretrained-heads---depth-estimation. I do have a PR which extends DPT to leverage the AutoBackbone API. This means that the DPT head can be used together with any backbone (like ViT, DINOv2, etc.). This way, we could just do the following:

from transformers import Dinov2Config, DPTConfig, DPTForDepthEstimation

backbone_config = Dinov2Config(num_hidden_layers=2, num_attention_heads=4, out_features=["stage1", "stage2", "stage3", "stage4")
config = DPTConfig(backbone_config=backbone_config)
model = DPTForDepthEstimation(config)

=> so would be great to leverage this instead of adding a standalone Dinov2ForDepthEstimation.

rfan-debug · 2023-09-11T16:56:44Z

@NielsRogge Leveraging the AutoBackbone API is a great idea. Thanks for your advice and contributions! I'll follow your code examples.

github-actions · 2023-10-11T08:04:55Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

NielsRogge mentioned this issue Sep 11, 2023

Add DINOv2 depth estimation #26092

Merged

4 tasks

github-actions bot closed this as completed Oct 20, 2023

ducha-aiki mentioned this issue Apr 5, 2024

DPTForDepthEstimation with Dinov2 does not use pretrained weights #30069

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dinov2 for depth estimation #26057

Dinov2 for depth estimation #26057

rfan-debug commented Sep 8, 2023 •

edited

Loading

amyeroberts commented Sep 8, 2023

NielsRogge commented Sep 9, 2023

rfan-debug commented Sep 11, 2023

github-actions bot commented Oct 11, 2023

Dinov2 for depth estimation #26057

Dinov2 for depth estimation #26057

Comments

rfan-debug commented Sep 8, 2023 • edited Loading

Feature request

Motivation

Your contribution

amyeroberts commented Sep 8, 2023

NielsRogge commented Sep 9, 2023

rfan-debug commented Sep 11, 2023

github-actions bot commented Oct 11, 2023

rfan-debug commented Sep 8, 2023 •

edited

Loading