DepthNet

Monocular depth estimation with ResNet-50

Data Sources

I collect most of the depth data for this project using Depth Dector, an iOS Application i created to obtain depth information via iDevices's TrueDepth camera.

NYU Depth Dataset V2

The NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinect.

Model

Architecture

Architecuture figure from Iron Laina et al. paper

The images are first feed into a ResNet-50 network and then go through four upsampling blocks. Different from what described in the figure, our model has an input size of 320x240x3 (WxHxC) and output size of 160x128x1 (WxHxC).

Features to implement

Data augmentation
More loss functions
Experiments on Upsample/Upconvolution block

Citation

Deep Optics for Monocular Depth Estimation and 3D Object Detection by Julie Chang and Gordon Wetzstein

Depth Estimation from Single Image Using CNN-Residual Network by Xiaobai Ma, Zhenglin Geng, and Zhi Bie

Deeper Depth Prediction with Fully Convolutional Residual Networks by Iro Laina

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

DepthNet

Data Sources

NYU Depth Dataset V2

Model

Architecture

Features to implement

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

DepthNet

Data Sources

NYU Depth Dataset V2

Model

Architecture

Features to implement

Citation