The main purpose of applying data dimensionality reduction is to maximize capture the initial data distribution even after dimensionality reduction . Therefore, we want to make sure that we record the variance of the data as much as we can .
Say you have an N * N matrix, and we perform SVD ( Decomposition of a singular value ) on X. Then we observe singular values, diagonal entries in the resulting S-matrix. ( X = USV )
And you want to disable them at some index K based on the desired percent variance:
Σ i = 1 K sigma (i) / Σ i = 1 N sigma (i)
If you select the first K columns of U , then you reduce the original N-measure to K-dimension.
Aaron source share