2 Mean and variance

Mean: estimation

Consistency

Aka Law of large numbers

Let ${X_{i}}$ iid. $\hat{X}{n} = n^{-1}\sum^{n} X{i}$. As $v a r [\hat{X_{n}}] = σ^{2} / n \to 0$ as $n \to \infty$ , Weak law: ${\hat{X}}_{n}$ is a consistent estimator of $μ$ .

Normalness of estimator distribution

Aka Central limit theorem (CLT)

Take estimator $U_{n} = \frac{\bar{X} - μ}{\frac{σ}{\sqrt{n}}}$ . $l t_{n \to \infty} P r (U_{n} \leq u) = \int_{- \infty}^{u} \frac{1}{\sqrt{2 π}} e^{- t^{2} / 2} d t$ : so approaches CDF of N(0,1): See convergence of moment generating function (MGF) below. So, as n increases, $v a r [\bar{X}]$ becomes smaller: visualize pdfs of $X, \bar{X}{30}, \bar{X}{50}$; see how curve becomes more normal and gets thinner and taller. Generally, can use CLT when $n > 30$ .

Proof showing convergence to Normal MGF

Theorem: MGF $ M_{U_{n}(t) \to $ MGF of N(0, 1)}

Proof: iid ${X_{i}}$ . $m_{U_{n}} (t) = E [e^{\frac{t (\sum X_{i} - n μ)}{\sqrt{n} σ}}] = \prod E [e^{\frac{t}{\sqrt{n}}} (\frac{X_{i} - μ)}{σ}] = m_{Z} (t / \sqrt{n})^{n}$ : implicitly defining Z with $E [Z] = 0, v a r [Z] = E [Z^{2}] = 1$ .

But, by Taylor, $m_{Z} (t / \sqrt{n}) = m_{Z} (0) + m_{Z}^{'} (0) (t / \sqrt{n}) + m_{Z}'^{'} (h) (t / \sqrt{n})^{2} (1 / 2!) = 1 + E [Z] t + m^{}'^{'} (h) (\frac{t^{2}}{2 n})$ for some $h \in (0, t / \sqrt{n})$ ; so $m_{Z} (t / \sqrt{n}) = 1 + m^{}'^{'} (h) (\frac{t^{2}}{2 n}) \to 1 + \frac{t^{2}}{2 n}$ as $n \to \infty$ . So, $m_{U_{n}} (t) \to (1 + \frac{t^{2}}{2 n})^{n} \to e^{t^{2} / 2}$ , MGF of N(0, 1).

Normal distr: Pivotal quantity to estimate mean

Student’s t distribution is used to estimate $μ$ when distribution is assumed to be Normal, n is small and $σ$ is unknown. Tables only go up to n = 30 or 40. If $σ$ were known, would use normal distribution, or if $n > 30$ would estimate $σ$ and use normal distribution tables.

As $(n - 1) \frac{S^{2}}{σ^{2}} \sim χ_{n - 1}^{2}$ ,

$\sqrt{n} \frac{\bar{X} - μ}{S} \sim t_{n - 1}$ .

Goodness of empirical estimate

Can apply Chernoff bounds and Azuma Hoeffding inequality etc.. to judge goodness of empirical estimate.

For Binary valued random variables: A/B testing confidence interval, precision calculator here.

Variance estimation

The biased and unbiased estimators

$S^{2} = n^{- 1} \sum (X_{i} - \bar{X})^{2}$ biased: $B [S^{2}] = n^{- 1} E (\sum X_{i}^{2} - 2 \bar{X} \sum X_{i} + n {\bar{X}}^{2}) - σ^{2} = n^{- 1} (n E [X^{2}] - 2 E [n {\bar{X}}^{2}] + n E [{\bar{X}}^{2}]) - σ^{2} = n^{- 1} (n σ^{2} + n μ^{2} - n v a r [\bar{X}] + n μ^{2}) - σ^{2} \to n^{- 1} (n - 1) σ^{2} - σ^{2}$ from central limit thm. So, defined as $S^{2} = (n - 1)^{- 1} \sum (X_{i} - \bar{X})^{2}$ to get unbiased estimator. Difference small as $n \to \infty$ .

Normal distr: Pivotal quantity to estimate variance

$N (μ, σ^{2})$ assumed. If $S^{2} = \frac{\sum (X_{i} - \bar{X})^{2}}{n - 1}$ , then $(n - 1) \frac{S^{2}}{σ^{2}} \sim χ_{n - 1}^{2}$ \why.

So, can use this as pivotal quantity.

Sequential data Sample statistics

k-step Moving averages

Suppose that $r a n (X_{i}) \in R$ , and that the sample size is $n$ .

Simple moving average

This is simply the mean of the last $k$ $X_{i}$ .

Exponential Weighed

Here, one uses an exponentially decreasing weight (with decreasing $i$ ) while taking a weighted average of $k$ $X_{i}$ .

Applications

This is useful while predicting stock prices, for example.