PATCH BASED STEREO MATCHING USING CONVOLUTIONAL NEURAL NETWORK

ICTACT Journal on Image and Video Processing ( Volume: 11 , Issue: 3 )

Abstract

vioft2nntf2t|tblJournal|Abstract_paper|0xf4ff10b52b000000cc45080001000300
The paper presents a new Convolutional Neural Network (CNN) architecture, called stacked stereo CNN, for computing disparity map from stereo images. In stacked stereo CNN, left and right image patches are stacked back-to-back and fed to a single tower CNN. This is in contrast to Siamese network where two towers are used, one for the left patch and other for the right patch. The proposed network is trained on a large set of similar and dissimilar image patches, which are generated from stereo images and their ground truth images from Middlebury stereo datasets. The network returns a dissimilarity score for a pair of image patch which is used to compute the cost volume. The cost volume is further refined using post processing steps before generating the final disparity map. The proposed network is evaluated on Middlebury datasets and achieves comparable results to the state-of-art algorithms.

Authors

Rachna Verma1, Arvind Kumar Verma2
Jai Narain Vyas University, India1, Jai Narain Vyas University, India2

Keywords

Stereo Vision, Patch Matching, Disparity Map, CNN

Published By
ICTACT
Published In
ICTACT Journal on Image and Video Processing
( Volume: 11 , Issue: 3 )
Date of Publication
February 2021
Pages
2366-2371