ICTACT Journals - View Articles

KeywordsAuthorPaper Title

Abstract

The paper presents a new Convolutional Neural Network (CNN) architecture, called stacked stereo CNN, for computing disparity map from stereo images. In stacked stereo CNN, left and right image patches are stacked back-to-back and fed to a single tower CNN. This is in contrast to Siamese network where two towers are used, one for the left patch and other for the right patch. The proposed network is trained on a large set of similar and dissimilar image patches, which are generated from stereo images and their ground truth images from Middlebury stereo datasets. The network returns a dissimilarity score for a pair of image patch which is used to compute the cost volume. The cost volume is further refined using post processing steps before generating the final disparity map. The proposed network is evaluated on Middlebury datasets and achieves comparable results to the state-of-art algorithms.

Authors

Rachna Verma¹, Arvind Kumar Verma²
Jai Narain Vyas University, India¹, Jai Narain Vyas University, India²

Keywords

Stereo Vision, Patch Matching, Disparity Map, CNN

Yearly Full Views

January	February	March	April	May	June	July	August	September	October	November	December
0	0	0	0	0	0	0	0	0	0	0	0

Published By :
ICTACT

Published In :
ICTACT Journal on Image and Video Processing
( Volume: 11 , Issue: 3 , Pages: 2366-2371 )

Date of Publication :
February 2021

DOI :
10.21917/ijivp.2021.0336

Page Views :
230

Full Text Views :

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.