The gradient-based-depth-map-fusion from yuinsky

Multi-resolution Monocular Depth Map Fusion by Self-supervised Gradient-based Composition

This repository contains code and models for our paper:

[1] Yaqiao Dai, Renjiao Yi, Chenyang Zhu, Hongjun He, Kai Xu, Multi-resolution Monocular Depth Map Fusion by Self-supervised Gradient-based Composition, AAAI 2023

Changelog

[November 2022] Initial release of code and models

Setup

Download the code.

 git clone https://github.com/YuiNsky/Gradient-based-depth-map-fusion.git
 cd Gradient-based-depth-map-fusion

Set up dependencies: 2.1 Create conda virtual environment.

conda env create -f GBDF.yaml
conda activate GBDF

2.2 Install pytorch in virtual environment.

pip install torch==1.9.1+cu111 torchvision==0.10.1+cu111 torchaudio==0.9.1 -f https://download.pytorch.org/whl/torch_stable.html

Download fusion model model_dict.pt and place in the folder models.
Download one or more backbone pretrained model.

LeRes: res50.pth or res101.pth, place in the folder LeRes.

DPT: dpt_hybrid-midas-501f0c75.pt, place in the folder dpt/weights.

SGR: model.pth.tar , place in the folder SGR.

MiDas: model.pt, place in the folder MiDaS.

NeWCRFs: model_nyu.ckpt, place in the folder newcrfs.
The code was tested with Python 3.8, PyTorch 1.9.1, OpenCV 4.6.0.

Usage

Place one or more input images in the folder input.
Run our model with a monocular depth estimation method:
```
python run.py -p LeRes50
```
The results are written to the folder output, every result is the combination of input image, backbone prediction and our prediction.

Use the flag -p to switch between different backbones. Possible options are LeRes50 (default), LeRes101, SGR, MiDaS, DPT and NeWCRFs.

Evaluation

Our evaluation contains three published high resolution datasets, which are Multiscopic, Middleburry2021 and Hypersim.

To evaluate our model on Multiscopic, you can download this dataset here. You need to download the test dataset, rename it as multiscopic and place it in folder datasets.

To evaluate our model on Middleburry2021, you can download this dataset here. You need to unzip the dataset, rename it as 2021mobile and place it in folder datasets.

To evaluate our model on Hypersim, you can download the whole dataset here. We also provide the evaluation subsets hypersim. You need to download the subsets and place it in folder datasets.

Then you can evaluate our fusion model with specified monocular depth estimation method and dataset:

python eval.py -p LeRes50 -d middleburry2021

Use the flag -p to switch between different backbones. Possible options are LeRes50 (default), SGR, MiDaS, DPT and NeWCRFs.

Use the flag -d to switch between different datasets. Possible options are middleburry2021 (default), multiscopic and hypersim.

Training

Our model was trained based on backbone LeRes and dataset HR-WSI, we use guided filter to preprocess the dataset and select high quality results as our training datasets based on canny edge detection. You need to download the preprocessed dataset HR and place it in the folder datasets.

Then you can train our fusion model using GPU:

python train.py

Citation

Please cite our papers if you use this code.

@article{dai2022multi,
  title={Multi-resolution Monocular Depth Map Fusion by Self-supervised Gradient-based Composition},
  author={Dai, Yaqiao and Yi, Renjiao and Zhu, Chenyang and He, Hongjun and Xu, Kai},
  journal={arXiv preprint arXiv:2212.01538},
  year={2022}
}

License

MIT License

yuinsky / gradient-based-depth-map-fusion Goto Github PK

gradient-based-depth-map-fusion's Introduction

Multi-resolution Monocular Depth Map Fusion by Self-supervised Gradient-based Composition

Changelog

Setup

Usage

Evaluation

Training

Citation

License

gradient-based-depth-map-fusion's People

Contributors

Stargazers

Watchers

Forkers

gradient-based-depth-map-fusion's Issues

Recommend Projects

Recommend Topics

Recommend Org