04 Random Vector properties

Mean

$E [X] := (E [X_{i}])$ .

Linearity

If $X$ is a random matrix, A, B, C are constant matrices: $E [A X B + C] = A E [X] B + C$ . Proof: by using $(AXB){i, j} = A{i,:} X B_{:, j}$, which is a linear combination of $X_{k, l}$ .

Also, if $X$ is random vector, $E [a^{T} X] = a^{T} E [X]$ .

Covariance

Definition

How correlated are deviations of X, Y from their means?\ $c o v (X, Y) = E_{x, y} [(X - E [X]) (Y - E [Y])]$ .

Extension to vectors

$c o v (X, Y) = E_{x, y} [(X - E [X])^{T} (Y - E [Y])]$ :\ corresponds to measuring $c o v (X_{i}, Y_{j})$ .

Correlation

(Pearson) correlation coefficient: $c o r r (X, Y) = \frac{c o v (X, Y)}{σ_{X} σ_{Y}}$ : normalized covariance.

Correlation vs Independence

If $X_{i} ⊥ X_{j}, C o v [X_{i}, X_{j}] = 0$ : even if they are only pairwise independent. But, \ $c o v (X, X^{2}) = 0$ even if $(X, X^{2})$ not $⊥$ .

If $C o v [X_{i}, X_{j}] = 0$ holds, then Xi and Xj are uncorrelated. If they are independent, they are uncorrelated; but not necessarily vice versa.

Covariance matrix

$Σ = v a r [X] = c o v (X, X) = E [(X - μ) (X - μ)^{T}] = E [X X^{T}] - μ μ^{T}$ .

Diagonal has variances of $(X_{i})$ . It is diagonal if $(X_{i})$ are independent.

Effect of linear transformation

$V a r [B X + a] = E [(B X - B E [X]) (B X - B E [X])^{T}] = B V a r [X] B^{T}$ . As in the scalar case, constant shifts have no effect.

Special case: $v a r [a^{T} X] = a^{T} v a r (X) a$ .

Nonnegative definiteness

$Σ ⪰ 0$ as $a^{T} E [(X - μ) (X - μ)^{T}] a \geq 0$ .

If $a^{T} Σ a = 0$ , with probability 1, $a^{T} X - a^{T} μ = 0$ , so some ${X_{i}}$ are linearly dependent. So, $X$ lies on the hyperplane/ subspace with normal a.

Precision matrix

$V = Σ^{- 1}$ . Consider partial correlation deduced in case of multidimensional normal distribution.

Moment generating function

$E [e^{t^{T} X}]$ is the moment generating function.