Suppose you have a four-element matrix M
ab cd
and maxpool (M) returns d . Then the maxpool function really depends only on d . So, the derivative of maxpool with respect to d is 1, and its derivative with respect to a, b, c is zero. So you backpropagate 1 to the module corresponding to d , and you skip back zero for other units.
source share