02 Multiple random variables

Random vector

A random vector is an n-dim vector X=(Ai), which are a bunch of jointly distributed random variables. Similarly, X can be a m×n random matrix.

Below, we consider X=(X1,X2), where X1:(S1;F1,v1)R1 and X2:(S2;F2,v2)R2.

A random vector is itself a random variable X:(S1×S2;F1×F2,v)(R1×R2).

Marginalization

The marginalization properties of the joint/ product probability space leads to: Pr(X1E1,X2S2)=Pr(X1E1), so E1×S2fX(x)dv=E1x2S2fX(x1,x2)dv2dv1=E1fX1(x1)dv1.

Hence, x2S2fX(x1,x2)dv2=fX1(x1).

Conditional pdf

Definition (described elsewhere) \of conditional probabilities of the form Pr(A|B) breaks down if Pr(B), the probability measure of the event B is 0.

One can craft a similar definition to cover events X2=b with v2(X2=b)=0. Then, \ Pr(X1E1|X2=b)=Pr(X1E1X2=b)fX2(b)= E1fX(x1,b)fX2(b)1dv1.

fX(x1,b)fX2(b)1=fX1|X2=b(x1) is aka conditional pdf.

Inversion

Similar to the Bayes’s rule, using the definition, one can invert the conditional pdf.

fX2|X1=x1(x2)=fX1|X2=x2(x1)fX2(x2)fX1(x1)=fX1|X2=x2(x1)fX2(x2)S2fX1|X2=x2(x1)fX2(x2)dv2.

Improper densities

Note that the construction of fX2|X1=x1(x2) works even if the prior pdf fX1(x1) is an improper density which does not sum to 1! This sometimes makes the task of modeling random processes easier.

Independence

One can extend the notion of independence of events to random variables, which represent a pair of algebras of events.

Suppose that fX(x)=fX1(x1)fX2(x2). Then, E1,E2:Pr(X1E1,X2E2)=Pr(XE1)Pr(X2E2). In such a case, X1 and X2 are independent. This is denoted by I(X1,X2).

Also independence of events corresponds to independence of corresponding Indicator random variables: AB if IAIB.

Conditional Independence

Conditional: XY|Z fXY|Z(x,y|z)=fX|Z(x|z)fY|Z(y|z)fX|Y,Z(x|y,z)=fX|Z(x|z) fX,Y,Z(x,y,z)=fX,Z(x,z)fY,Z(y,z)fZ(z).\ Marginal: XY when Z=ϕ.

Amongst sets of vars: {Xi}{Yi}|{Zi} iff \ f(Xi|Yj,{Zk})(xi|yj,{zk})=f(Xi|{Zk})(xi|{zk})i,j.

Marginal independence without conditional independent: \ XY|X+Y. Conditional independent sans marginal independent: consider suitable Bayesian network.

Graphical models can be used to specify this.