I am trying to classify a large set of images using nVidia DIGITS and Caffe. Everything works well when I use the standard networks and networks that I built.
However, when I run the GoogleNet example, I can see the results of several levels of accuracy. How can there be several levels of accuracy in CNN? Having multiple loss levels is understandable, but what does multiple accuracy mean? I get several accuracy graphs during training. Like this image: 
lossX-top1 and lossX-top5 denote accuracy levels. I understand from Prototex that they rate the top and top 5 accuracy values, but what are the lossX accuracy levels?
Despite the fact that some of these graphs converge to about 98%, when I manually test the learning network for 'validation.txt' , I get a significantly lower value (corresponding to the bottom three accuracy graphs).
Can someone shed some light on this? How can there be several levels of accuracy with different values?
source share