You are looking at the right paper, but I think you expect the algorithm to do what it is not going to do. It produces a low-level approximation to your input as the product of two matrices, but nothing about the multiplying matrices pinches the output values.
You can clamp or round values. You may not want this because you get additional information about how strong the 5 predicted score is. I believe that it is also theoretically impossible for the algorithm to assume that the maximum possible value is the maximum observed value at the input.
source share