https://github.com/aaltoml/gp-mvs

Multi-View Stereo by Temporal Nonparametric Fusion

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.5%) to scientific vocabulary

Keywords

depth-estimation gaussian-processes iccv2019

Last synced: 5 months ago · JSON representation

Repository

Multi-View Stereo by Temporal Nonparametric Fusion

Basic Info

Host: GitHub
Owner: AaltoML
License: mit
Language: Python
Default Branch: master
Homepage: https://aaltoml.github.io/GP-MVS/
Size: 3.89 MB

Statistics

Stars: 62
Watchers: 6
Forks: 6
Open Issues: 1
Releases: 0

Topics

depth-estimation gaussian-processes iccv2019

Created almost 7 years ago · Last pushed over 6 years ago

https://github.com/AaltoML/GP-MVS/blob/master/

# Code: Multi-view Stereo by Temporal Nonparametric Fusion
[Yuxin Hou](#)  [Juho Kannala](https://users.aalto.fi/~kannalj1/)  [Arno Solin](http://arno.solin.fi)

Codes for the paper:

* Yuxin Hou, Arno Solin, and Juho Kannala (2019). **Multi-view stereo by temporal nonparametric fusion**. *International Conference on Computer Vision (ICCV)*. Seoul, Korea. [[arXiv](https://arxiv.org/abs/1904.06397)] [[video](https://www.youtube.com/watch?v=iellGrlNW7k)] [[project page](https://aaltoml.github.io/GP-MVS/)]

## Summary

We propose a novel idea for depth estimation from unstructured multi-view image-pose pairs, where the model has capability to leverage information from previous latent-space encodings of the scene. This model uses pairs of images and poses, which are passed through an encoder-decoder model for disparity estimation. The novelty lies in soft-constraining the bottleneck layer by a nonparametric Gaussian process prior.

![](assets/ipad-chairs.gif)

*Example depth estimation result running in real-time on an iPad.*

## Prerequisites

* Python3
* Numpy
* Pytorch 0.4.0
* CUDA 9 (You can also run without CUDA, but then you need to remove all  `.cuda()` in codes)
* opencv
* tensorboardX
* imageio
* path.py
* blessings
* progressbar2

## Training

As we mentioned in our paper, the training use the split pretrained MVDepthNet model as statring point.
Check the [link](https://github.com/HKUST-Aerial-Robotics/MVDepthNet) to get the pretrained model.
```
python train.py train_dataset_path --pretrained-dict pretrained_mvdepthnet --log-output
```

## Testing

For testing run
```
python test.py formatted_seq_path --savepath disparity.npy --encoder encoder_path --gp gp_path --decoder decoder_path
```
Our pretrained model can be downloaded [here](https://drive.google.com/open?id=10n-A2H-Of_1hx5Zdy0V7HXxF24xwaYPN).

## Use your own data for testing

The formatted sequence have the folder structure like this:
* `K.txt`: The txt file stores the camera intrinsic matrix 
* `poses.txt`: The text file stores extrinsic matrixs for all frames in the sequence in order.
* `images`: The folder includes all RGB images(.png), and the images are ordered by name.
* `depth`: The folder includes all ground truth depth map(.npy), and the name is matched with the images'name.

We also provide one example sequence: [redkitchen seq-01-formatted](https://drive.google.com/file/d/1VceP2eYjS25NhtJHAYrFhGRFWhr98k3U/view?usp=sharing).

## Acknowledgements

The encoder/decoder codes build on [MVDepthNet](https://github.com/HKUST-Aerial-Robotics/MVDepthNet). Some useful util functions used during training are from [SfmLearner](https://github.com/ClementPinard/SfmLearner-Pytorch). Most of the training data are collected by [DeMoN](https://github.com/lmb-freiburg/demon). We appreciate their work!

## License

Copyright Yuxin Hou, Juho Kannala, and Arno Solin.

This software is provided under the MIT License. See the accompanying LICENSE file for details.

Owner

Name: AaltoML
Login: AaltoML
Kind: organization
Location: Finland

Website: http://arno.solin.fi
Repositories: 20
Profile: https://github.com/AaltoML

Machine learning group at Aalto University lead by Prof. Solin

GitHub Events

Total

Watch event: 3

Last Year

Watch event: 3

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/aaltoml/gp-mvs

Science Score: 10.0%

Keywords

Repository

Basic Info

Statistics

Topics

https://github.com/AaltoML/GP-MVS/blob/master/

Owner

GitHub Events

Total

Last Year