Is the learning speed good for the Adam method?

Question

Is the learning speed good for the Adam method?

I train my method. I got the result as shown below. Is this a good learning speed? If not, is it tall or short? This is my result.

lr_policy: "step"
gamma: 0.1
stepsize: 10000
power: 0.75
# lr for unnormalized softmax
base_lr: 0.001
# high momentum
momentum: 0.99
# no gradient accumulation
iter_size: 1
max_iter: 100000
weight_decay: 0.0005
snapshot: 4000
snapshot_prefix: "snapshot/train"
type:"Adam"

This is a link

At low levels of learning, improvements will be linear. Thanks to the high levels of training, they will look more exponential. Higher learning speeds will reduce loss faster, but they are stuck in worse loss values.

+6

deep-learning machine-learning neural-network caffe

user8264 Mar 23 '17 at 2:53

source share

2 answers

Thomas Pinetz · Answer 1 · 2017-03-23T07:28:08+0000

. . 0.0005 0.0001 , . , , .

, , - , . , , , . , , , , .

Juan Camilo Zapata · Answer 2 · 2017-03-23T04:47:17+0000

(, 0,1), , , . , , 100 , 100 . , .

, , .

Is the learning speed good for the Adam method?

More articles: