I implemented a binary logistic regression classifier. To play, I replaced the sigmoid function (1/1 + exp (-z)) with tanh. The results were exactly the same, with the same threshold of 0.5 for classification, and although tanh is in the range {-1.1}, while the sigmoid is in the range {0.1}.
Does it really matter that we use a sigmoid function, or can any differentiable nonlinear function, such as tanh, work?
Thanks.
source
share