05 Other density families

Sampling distributions

Sampling distributions are distributions of the functions of samples drawn from other distributions.

Standard normal square sum

Aka Chi square distribution with k degrees of freedom.

If $X_{i} \sim N (0, 1)$ , $\sum_{i = 1}^{k} X_{i}^{2} \sim χ_{k}^{2}$ . This is same as the distribution of $\sum (\frac{Y_{i} - μ}{μ})^{2}$ .

Used in goodness of fit tests. \chk

Student’s t distribution with k degrees of freedom

$\frac{Z}{\sqrt{W / n}}$ , with $Z \sim N (0, 1), W \sim χ_{n}^{2}, Z ⊥ W$ .

F distribution

$\frac{w_{1}/n_{1}}{w_{2}/ n_{2}} \distr F_{n_{1}, n_{2}} : w_{i} \distr \chi^{2}{n{i}}$.

Heavy tailed distributions

$l t_{x \to \infty} \frac{P r (X > x)}{e^{- ϵ x}} = \infty$ . Eg: Power law distribution, cauchy distribution.

Power law distributions

$p (x) \propto x^{- g}; p (x) = x^{- g} Z^{- 1}$ for normalizing constant Z. $l t_{x \to 0} p (x) = \infty$ : so must have lower bound $x_{m i n}$ . log p vs log x graph looks like a straight line.

Aka scale free distribution. The only \why distribution with the property: $\exists g (b) : p (b x) = g (b) p (x)$ .

A subset of heavy-tailed distribution family.

Includes Zipf’s law distribution.

With exponential cutoff

$p (x) \propto x^{- α} e^{- x β}$ . log p vs log x graph looks like a straight line which suddenly bends: exponential term starts kicking in. Akin to gamma distribution.

Zipf’s law for resource usage

Frequency/ probability of usage of resources often follows Zipf’s law: Pr([res used]) $\propto f (r e s o u r c e)^{- k}$ . Eg: words used in document.

Mixture distribution

Often, one models the pdf of $X$ as being a convex combination of multiple pdf’s.

Other pdf’s

Uniform and triangular distributions

Uniform distribution; used when not information is available except min, max. Triangular distribution is used when mode is also known.

Log normal distribution

Take $X \sim N (μ, σ^{2})$ . Then $Y = e^{X}$ has log normal distribution. Wide variety of shapes, heavy tailed.

Gumbel distribution

Used in worst case analysis. CDF: $G (x | μ, b) = e^{- e^{- \frac{x - μ}{b}}}$ , PDF: $g (x | μ, b) = \frac{e^{- \frac{x - μ}{b}}}{b} e^{- e^{- \frac{x - μ}{b}}}$ .

Probability simplex coordinate powering

Aka Dirichlet distribution. This is the conjugate prior for multinomial distribution.

Support is ${x \in R^{k} : \sum_{i} x_{i} = 1, x_{i} > 0}$ : or actually ${x \in R^{k - 1} : \sum_{i} x_{i} < 1, x_{i} > 0}$ . pdf is $p (x; a) \propto \prod_{i = 1 : k} x_{i}^{a_{i} - 1}$ for parameters $a \geq 0$ .

2-dim case

Aka beta(a,b) distribution. This takes up a wide variety of shapes: convex, concave, neither etc..

This is the conjugate prior for bernoulli/ binomial distribution - and a special case of Dirichlet distribution.

Pdf: $f (x) \propto x^{a - 1} (1 - x)^{b - 1}$ for $x \in [0, 1]$ .

Wigner semicircle distribution

Supported on [-R, R], like a semicircle.