PyTorch Burst Matching

The batchnorm pulse protocol convention is (default = 0.1), as in other libraries, for example. Typically, the tensor size is 0.9 or 0.99 by default? Or maybe we just use a different agreement?

+4
source share
1 answer

It seems that the parameterization convention is different in pytorch than in tensor stream, so 0.1 in pytorch is equivalent to 0.9 in tensor stream.

More precisely:

In Tensorflow:

running_mean = decay*running_mean + (1-decay)*new_value

In PyTorch:

running_mean = (1-decay)*running_mean + decay*new_value

This means that the value decayin PyTorch is equivalent to the value (1-decay)in Tensorflow.

+1
source

Source: https://habr.com/ru/post/1692427/


All Articles