Try   HackMD

Mathematical Details of the Law of Large Numbers

Setup

Let

X1,X2,X3, be a sequence of independent and identically distributed random variables.
Let
μ=E[X]
be the expected value of each
X
.
Let
σ2=Var(X)
be the variance of each
X
.

Define the sample mean:

X¯n=1ni=1nXi

Weak Law of Large Numbers (WLLN)

For any

ε>0:

limnP(|X¯nμ|<ε)=1

This means

X¯n converges in probability to
μ
, written as:
X¯npμ

Strong Law of Large Numbers (SLLN)

P(limnX¯n=μ)=1

This means

X¯n converges almost surely to
μ
, written as:
X¯na.s.μ

Implications

  1. The sample mean is a consistent estimator of the population mean.
  2. As sample size increases, the probability of large deviations from the true mean decreases.
  3. In the limit, the sample mean will converge to the true population mean.
  4. This applies to any distribution with a finite mean, not just normal distributions.

Note on Convergence

  • WLLN: Convergence in probability
  • SLLN: Almost sure convergence (stronger than convergence in probability)

The LLN doesn't specify the rate of convergence. This is addressed by other theorems, such as the Central Limit Theorem.


Central Limit Theorem (CLT)

Setup

Let

X1,X2,X3,,Xn be a sequence of independent and identically distributed random variables.
Let
μ=E[X]
be the expected value of each
X
.
Let
σ2=Var(X)
be the variance of each
X
.

Define the sample mean:

X¯n=1ni=1nXi

Statement of the Central Limit Theorem

As

n approaches infinity, the distribution of the standardized sample mean:

X¯nμσ/n

converges in distribution to a standard normal random variable

ZN(0,1).

In other words:

n(X¯nμ)/σdN(0,1) as n

Implications

  1. For large

    n,
    X¯n
    is approximately normally distributed with:
    E[X¯n]=μ

    Var(X¯n)=σ2/n

  2. The distribution of

    X¯n can be approximated by:
    X¯nN(μ,σ2/n)

  3. The standard error of the mean is given by:

    SE(X¯n)=σ/n

Key Points

  • Applies regardless of the shape of the original distribution (as long as
    μ
    and
    σ2
    are finite).
  • "Sufficiently large"
    n
    depends on the original distribution, but
    n30
    is often used as a rule of thumb.
  • Enables the use of z-scores and t-scores in hypothesis testing and confidence intervals.

Comparison with Law of Large Numbers (LLN)

  • LLN:
    X¯npμ
    (WLLN) or
    X¯na.s.μ
    (SLLN)
  • CLT:
    n(X¯nμ)/σdN(0,1)

CLT provides information about the distribution and spread of

X¯n, while LLN only addresses its convergence to
μ
.

Applications

  1. Hypothesis testing
  2. Construction of confidence intervals
  3. Approximation of binomial distribution by normal distribution
  4. Quality control in manufacturing
  5. Sampling theory in surveys and polls

The CLT is a cornerstone of statistical inference, allowing us to make probabilistic statements about sample means even when we don't know the underlying population distribution.