Why don't you use IOU for training?

Question

Why don't you use IOU for training?

When people try to solve the problem of semantic segmentation using CNN, they usually use softmax-crossentropy loss during training (see Fully conv. - Long ). But when it comes to comparing the performance of different approaches, measures such as cross-connect are reported.

My question is: why don't people train directly to the extent that they want to optimize? It seems strange for me to train to a certain extent during training, but to evaluate in a different way for tests.

I see that IOU has problems for training samples where there is no class (union = 0 and intersection = 0 => divide zero by zero). But when I can ensure that every sample of my truth is based on all classes, is there another reason not to use this measure?

+4

deep-learning machine-learning computer-vision image-segmentation

zimmermc Nov 07 '16 at 21:48

source share

3 answers

mathetes · Answer 1 · 2017-07-31T13:32:20+0000

Make this article where they come up with a way to make the IoU concept differentiable. I implemented my solution with amazing results!

lejlot · Answer 2 · 2016-11-07T21:59:13+0000

: " -, ?". - , . (, , ). (softmax crossentropy) . , -, , , , , , - (, , , ).

apil.tamang · Answer 3 · 2018-02-02T01:39:11+0000

.

, , IoU, . , , .

An assessment of this direction is that earlier comments mean that errors are differentiable . I believe that there is nothing about the IoU indicators that the network can use to say: "Hey, this is not quite here, but I should perhaps move my bounding box a little to the left!"

Just subtle information, but hope this helps.

Why don't you use IOU for training?

More articles: