05 Other density families

Sampling distributions

Sampling distributions are distributions of the functions of samples drawn from other distributions.

Standard normal square sum

Aka Chi square distribution with k degrees of freedom.

If XiN(0,1), i=1kXi2χk2. This is same as the distribution of (Yiμμ)2.

Used in goodness of fit tests. \chk

Student’s t distribution with k degrees of freedom

ZW/n, with ZN(0,1),Wχn2,ZW.

F distribution

\(\frac{w_{1}/n_{1}}{w_{2}/ n_{2}} \distr F_{n_{1}, n_{2}} : w_{i} \distr \chi^{2}{n{i}}\).

Heavy tailed distributions

ltxPr(X>x)eϵx=. Eg: Power law distribution, cauchy distribution.

Power law distributions

p(x)xg;p(x)=xgZ1 for normalizing constant Z. ltx0p(x)=: so must have lower bound xmin. log p vs log x graph looks like a straight line.

Aka scale free distribution. The only \why distribution with the property: g(b):p(bx)=g(b)p(x).

A subset of heavy-tailed distribution family.

Includes Zipf’s law distribution.

With exponential cutoff

p(x)xαexβ. log p vs log x graph looks like a straight line which suddenly bends: exponential term starts kicking in. Akin to gamma distribution.

Zipf’s law for resource usage

Frequency/ probability of usage of resources often follows Zipf’s law: Pr([res used]) f(resource)k. Eg: words used in document.

Mixture distribution

Often, one models the pdf of X as being a convex combination of multiple pdf’s.

Other pdf’s

Uniform and triangular distributions

Uniform distribution; used when not information is available except min, max. Triangular distribution is used when mode is also known.

Log normal distribution

Take XN(μ,σ2). Then Y=eX has log normal distribution. Wide variety of shapes, heavy tailed.

Gumbel distribution

Used in worst case analysis. CDF: G(x|μ,b)=eexμb, PDF: g(x|μ,b)=exμbbeexμb.

Probability simplex coordinate powering

Aka Dirichlet distribution. This is the conjugate prior for multinomial distribution.

Support is {xRk:ixi=1,xi>0}: or actually {xRk1:ixi<1,xi>0}. pdf is p(x;a)i=1:kxiai1 for parameters a0.

2-dim case

Aka beta(a,b) distribution. This takes up a wide variety of shapes: convex, concave, neither etc..

This is the conjugate prior for bernoulli/ binomial distribution - and a special case of Dirichlet distribution.

Pdf: f(x)xa1(1x)b1 for x[0,1].

Wigner semicircle distribution

Supported on [-R, R], like a semicircle.