vioft2nntf2t|tblJournal|Abstract_paper|0xf4ff10b52b000000cc45080001000300 The paper presents a new Convolutional Neural Network (CNN) architecture, called stacked stereo CNN, for computing disparity map from stereo images. In stacked stereo CNN, left and right image patches are stacked back-to-back and fed to a single tower CNN. This is in contrast to Siamese network where two towers are used, one for the left patch and other for the right patch. The proposed network is trained on a large set of similar and dissimilar image patches, which are generated from stereo images and their ground truth images from Middlebury stereo datasets. The network returns a dissimilarity score for a pair of image patch which is used to compute the cost volume. The cost volume is further refined using post processing steps before generating the final disparity map. The proposed network is evaluated on Middlebury datasets and achieves comparable results to the state-of-art algorithms.
Rachna Verma1, Arvind Kumar Verma2 Jai Narain Vyas University, India1, Jai Narain Vyas University, India2
Stereo Vision, Patch Matching, Disparity Map, CNN
January | February | March | April | May | June | July | August | September | October | November | December |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Published By : ICTACT
Published In :
ICTACT Journal on Image and Video Processing ( Volume: 11 , Issue: 3 , Pages: 2366-2371 )
Date of Publication :
February 2021
Page Views :
230
Full Text Views :
|