About the Cost Volume #4

shuluoshu · 2017-07-03T07:46:12Z

I thought the cost volume is constructed by left cost features and the corresponding right cost features with deviation d. But I wonder that for Middlebury dataset, the disparity range for each image is different and some is as large as 400, so I wonder how can this be solved.

LinHungShi · 2017-07-03T10:43:58Z

The original paper concatenates the left and the right features across all disparities. In your case, it just concatenates features from D = 0 to D = 400.

shuluoshu · 2017-07-03T10:47:36Z

Thanks, so for those whose max disparity are less than 400, it means that I have to construct the same 400 dims , with some filling zero, am I right? And yet I wonder for such a input : batchsize* f DH*W , does it require a huge GPU memory for training ?

LinHungShi · 2017-07-03T11:42:00Z

About your first question, I think you're right. As indicated by the paper, the dimension of cost volume is DxHxWx2F, which means each feature pair is a DxHxW array. For the second question, yes, you'll need lots of memory to run the model.

xue1liu2 · 2017-07-05T07:41:03Z

你实现的代码，能复现论文的效果吗？

LinHungShi · 2017-07-05T09:26:04Z

Hi, unfortunately I haven't trained the model with Scene_Flow data. It seems that the model will run out of memory with if hyper parameters are set too high. In addition, it took me more than 15 seconds to run an iteration with batch size of 1.

shuluoshu · 2017-07-05T11:49:50Z

Well , I have only a 16G memory and a single TitanX GPU and I wonder that if D is set as high as 400, will it run out of memory ?

xue1liu2 · 2017-07-11T10:16:38Z

Yes,It will ran out of memory.

LinHungShi · 2017-07-11T10:56:41Z

It seems the final layer (soft argmin) limits the output value since it is an affine combination of disparity values. I am pondering if we can use linear combinations instead. In that case, we might be able to reduce disparity levels.

LinHungShi · 2017-09-05T14:26:08Z

I have updated the repository to make it easier to use. Please check it.

Sarah20187 · 2018-01-17T16:25:21Z

@LinHungShi I can't understand how they train the left and the right features without image patches. Could you help me?

LinHungShi · 2018-01-17T20:13:15Z

Could you explain your problem in more details? From what I understand, they do patch the images during training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the Cost Volume #4

About the Cost Volume #4

shuluoshu commented Jul 3, 2017

LinHungShi commented Jul 3, 2017

shuluoshu commented Jul 3, 2017

LinHungShi commented Jul 3, 2017

xue1liu2 commented Jul 5, 2017

LinHungShi commented Jul 5, 2017

shuluoshu commented Jul 5, 2017

xue1liu2 commented Jul 11, 2017

LinHungShi commented Jul 11, 2017

LinHungShi commented Sep 5, 2017

Sarah20187 commented Jan 17, 2018

LinHungShi commented Jan 17, 2018

About the Cost Volume #4

About the Cost Volume #4

Comments

shuluoshu commented Jul 3, 2017

LinHungShi commented Jul 3, 2017

shuluoshu commented Jul 3, 2017

LinHungShi commented Jul 3, 2017

xue1liu2 commented Jul 5, 2017

LinHungShi commented Jul 5, 2017

shuluoshu commented Jul 5, 2017

xue1liu2 commented Jul 11, 2017

LinHungShi commented Jul 11, 2017

LinHungShi commented Sep 5, 2017

Sarah20187 commented Jan 17, 2018

LinHungShi commented Jan 17, 2018