Twitter Issue: "triplet loss is flawed" #31

nbstrong · 2019-05-28T23:38:58Z

https://twitter.com/alfcnz/status/1133372277876068352

Unfortunately that triplet loss is flawed. The most offending negative sample has zero gradient. That power of 2 should be a power of ½.
I feel bad so many people still use it. 😕 https://t.co/M3daSGzlMK
— Alfredo Canziani (@alfcnz) May 28, 2019

There's some discussion going on in her replies as well, but if there is an issue it should be addressed here.

adambielski · 2019-05-29T07:56:20Z

Yes, I'm aware, I commented on the thread as well.
The implementation is technically correct, it follows the loss formulation from the papers.
But if we look at gradients it can indeed be problematic and suboptimal.
Even if in many cases this formulation seems to work in practice, users should be aware of potential issues - I'll add a clarification and loss alternatives.

jonkoi · 2019-10-07T07:58:17Z

Hi,

From what I understood from the Twitter discussion, power of ½ will create a stronger push or gradient against negatives when they are close. Is that correct?

Moreover, what's the point of the margin when, from what I understand, it is zero out in the gradient calculation?

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Twitter Issue: "triplet loss is flawed" #31

Twitter Issue: "triplet loss is flawed" #31

nbstrong commented May 28, 2019 •

edited

Loading

adambielski commented May 29, 2019

jonkoi commented Oct 7, 2019

Twitter Issue: "triplet loss is flawed" #31

Twitter Issue: "triplet loss is flawed" #31

Comments

nbstrong commented May 28, 2019 • edited Loading

adambielski commented May 29, 2019

jonkoi commented Oct 7, 2019

nbstrong commented May 28, 2019 •

edited

Loading