Alleged misuse of technical terms. I am working on a semantic segmentation project through CNN; trying to implement an architecture like Encoder-Decoder, so the output is the same size as the input.
How do you create shortcuts? What loss function should be used? Especially in a situation of severe class imbalance (but the ratio between classes varies from image to image).
The problem is related to two classes (objects of interest and background). I am using Keras with a shadoworflow backend.
So far, I plan that the expected outputs will be the same as the inputs, using pixel marking. The final layer of the model has either softmax activation (for 2 classes) or sigmoid activation (to express the probability that the pixels belong to the class of objects). I am having problems creating an appropriate objective function for such a task, such as:
Function(y_pred, y_true),
in agreement with Keras.
Please try to clarify the dimensions of the tensors used (model input / output). Any thoughts and suggestions are greatly appreciated. Thank!
source
share