(from: wikipedia)
The whitening transformation is a method which transforms a set of random variables having the Σ into a set of new random variables whose covariance is aI, where a is a constant and I is the . The new random variables are uncorrelated and all have 1. The method is called "whitening" because it transforms the input matrix to the form of , which by definition is uncorrelated and has uniform variance. It differs from decorrelation in that the variances are made to be equal, rather than merely making the covariances zero. That is, where decorrelation results in a diagonal covariance matrix, whitening produces a scalar multiple of the identity matrix.
Definition
Define X to be a with covariance matrix Σ and mean 0. The matrix Σ can be written as the of X and XT:
(注,我初一看以为写错了,按我的想法好像写成 E[XiXj] 更靠谱,我又想了想,后来明白了,E[XiXj]这样写的话,括号里面成了两个变量,而实际应该是一个变量(总体),前后可以是两个变量样本值。所以 E[XXT]这样写是有道理的,只是注意不要误解吧。还可以这样写 E[XYT],X和Y是同分布,但是这是表示两个一般变量协方差的常用方法,显然和这里X是向量的环境不符合。)
Define Σ1/2 as
Define the new random vector Y = Σ-1/2X. The covariance of Y is
Thus, Y is a white random vector.
Based on the fact that the covariance matrix is always , Σ1/2 can be derived using :Where the matrix Λ1/2 is a diagonal matrix with each element being square root of the corresponding element in Λ. To show that this equation follows from the prior one, multiply by the transpose: