The first thing to do is create a 256x256 output matrix, assuming you want to keep the original 256x256 matrix in its purest form. Then move the 256x256 input matrix values ββto the output matrix. The next step is to add 16x16 elements to the output matrix.
But before anyone can answer this, you need to explain how the 16x16 matrix relates to the 256x256 matrix. In other words, will the first element (0,0) of the 16x16 matrix be added to the first element (0,0) of the 256x256 matrix? What about the secound (0, 1) element - do you know where this will be added? What about the element (1, 0) of the 16x16 matrix - which element of the 256x256 matrix is ββadded? Once you find out, you can simply encode some loops to add the corresponding 256x256 matrix output element to each element of the 16x16 matrix.
source share