I am new to NN. I was getting MNIST Handwritten Dataset. They indicated the percentage of error for each method that they adopted in tabular form. Here is the link to the page. In the NN section, they indicated how:
3-layer NN, 300 + 100 hidden units
2-layer NN, 300 latent units, standard error
What does 3 layers mean? this is
InputLayer+HiddenLayer+OutputLayer
OR
InputLayer + 3 Hidden Layer+ outputLayer
Also what is the meaning of 300 + 100 hidden units. If you specify two numbers 300 + 100, one hidden layer should contain 300 units , and the next HiddenLayer should contain 100 units .
If so, then why 3 Layer NN?
source share