Most of these questions are what you need to try different options to see what works best. This is a problem with ANN. There is no βbetterβ way to do almost anything. You need to find out what works for your specific problem. However, I will give advice for your questions.
1) I prefer gradual learning. I think it is important that the network scales are updated after each template.
2) This is a difficult question. It depends on the complexity of your network. How many input nodes, output nodes, and training patterns are. For your problem, I can start at 100 and try the ranges up and down from 100 to see if there are any improvements.
3) ( ) . 5 , , , , . . , .
4) , 26 , . . , , , .