Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How the grad flow back while training APN? #5

Open
JohnnieXDU opened this issue Mar 19, 2018 · 4 comments
Open

How the grad flow back while training APN? #5

JohnnieXDU opened this issue Mar 19, 2018 · 4 comments

Comments

@JohnnieXDU
Copy link

Hi, im trying to reimplement RACNN, and your src code helps me a lot.

But here are some questions confuse me.

  1. How the grad flow back to APN while training APN? I mean that APN have 2 inputs: last image and coordinates[tx, ty, tl] from FC layer, and the output is cropped finer image. But while backprop, how to compute grad of [tx, ty, tl]?

  2. Rank loss indeed takes as inputs two probabilities in paper, how to optimize 3 APN networks from 2 loss?

Thanks a lot.

@ouceduxzk
Copy link
Owner

  1. this is nota a fully end-2-end training process, basically , you need to manually save the tx, ty, t from the highest response value of conv5_4 and you use this as the ground truth for training apn, you do not need to take care of gradient update at all
  2. the idea is that net1 and net2 both gives a score s1 and s2, you use the rank that s2 > s1 to guide the net to learn better in the finer image

@zhangjian0826
Copy link

@ouceduxzk Hi, thanks for your code, but how to save the tx, ty, tl from the highest response value of conv5_4 ? Could you show me the code or just the steps?

@ouceduxzk
Copy link
Owner

@zhangjian0826 I did not have the code now, but the idea is that for each image, you do the inference and save the output of the conv5_4 and then you do some calculation on top of this output to find out the x, y coord of the max value

@jackshaw
Copy link

@ouceduxzk ,Hi,how do you think should we caculate the coordinate of the max response of conv5_4 ?I sum the feature maps along 512 channels to a single feature map and then find the location of the max response of this single feature map.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants