Skip to content

This repository contains code and results for Course Project by Deep Learning Spring 2020 course offered at Information Technology University, Lahore, Pakistan. This project is only for learning purposes and is not intended to be used for commercial purposes.

Notifications You must be signed in to change notification settings

wassam21/G1N_Project_DLSpring2020

Repository files navigation

G1N_Project_DLSpring2020

"This repository contains code and results for Course Project by Deep Learning Spring 2020 course offered at Information Technology University, Lahore, Pakistan. This project is only for learning purposes and is not intended to be used for commercial purposes." Course Link: http://im.itu.edu.pk/deep-learning-spring-2020/

Note

This project uses MonoDepth as a base project. For original code and paper, please refer to original github website "https://github.com/mrharicot/monodepth#models".

Adding Losses to Monodepth Estimation with Left-Right Consistency

Abstract

The arrival of autonomous robots and self-driving cars have sprawled computer vision demanding novel and more rigid work in this field. Depth Estimation is one such application whereby we estimate the distance between various objects in an image or a video frame. Hence, we extract out spatial information. We can mine out this information by algorithms and related techniques but, we propose a deep network that solves the same problem. We pose the problem as an image reconstruction by taking left and right images from a calibrated binocular camera.The network works by taking both images and returning four more images all of varying scales, along with losses, called per pixel scene depth. The network then selects the finest scale and uses it to output a depth map at the original image’s resolution. During training, we take both left and right images however, only a single image is required at testing time. Hence, the network can be deployed on images taken by ordinary cameras. KITTI and Cityscapes datasets were used on this network.

Dataset

This model requires rectified stereo pairs for training. There are two main datasets available:

KITTI

link

Cityscapes

link

System Diagram / Pipeline

Figure is take from [1]. pipeline

Results

                       Pre-Trained Model                                            Our Model

bresults

References

[1] Godard, Clément, Oisin Mac Aodha, and Gabriel J. Brostow. "Unsupervised monocular depth estimation with left-right consistency." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 270-279. 2017.

About

This repository contains code and results for Course Project by Deep Learning Spring 2020 course offered at Information Technology University, Lahore, Pakistan. This project is only for learning purposes and is not intended to be used for commercial purposes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages