Limiting Distributions - Data 89 Course Notes

Sections 6.1 through 6.3 developed methods for approximating smooth functions (exponentials, logarithms) and factorials. In this chapter we will use those tools to derive the distributions for two essential probability models. Both can be derived as limits of the familiar Binomial distribution from Section 2.2.

Recall that $X \sim \text{Binomial}(n,p)$ if $X$ is the total number of successes in $n$ independent, identical binary trials with success probability $p$ . Then:

X \in \{0,1,2,...,n\} \text{ and } \text{PMF}(x) = \left( \begin{array}{c} n \\ x \end{array} \right) p^x (1 - p)^{n - x}.

(1)

Open the distribution plotter linked below to experiment with the Binomial PMF.

from utils_dist import run_distribution_explorer

run_distribution_explorer("Binomial");

Poisson Distributions¶

We express the statement, $X$ is drawn from a Poisson distribution with parameter $\lambda$ :

X \sim \text{Poisson}(\lambda).

(3)

The Poisson distribution is normalized since, by the Taylor series expansion of the exponential:

\sum_{x = 0}^{\infty} e^{-\lambda} \frac{\lambda^x}{x!} = e^{-\lambda} \sum_{x = 0}^{\infty} \frac{\lambda^x}{x!} = e^{-\lambda} e^{\lambda} = 1

(4)

Experiment with the Poisson distribution below. Try setting $\lambda = 5$ .

from utils_dist import run_distribution_explorer

run_distribution_explorer("Poisson");

You should notice that the result looks roughly Binomial with a mode near $\lambda = 5$ . We will recover the Poisson as a limit of the Binomial distribution.

To find the appropriate limit, open the Binomial explorer again.

from utils_dist import run_distribution_explorer

run_distribution_explorer("Binomial");

Now, attempt the following.

Gradually increase $n$ . Start with $n = 10$ , then work upwards.
As you increase $n$ , decrease $p$ . Keep $p \approx 5/n$ . This will keep the peak of the Binomial PMF near $x = 5$ even as $n$ increases. We need to keep the peak near $x = 5$ since the Poisson PMF with $\lambda = 5$ was peaked near $x = 5$ .

You should see that, as $n$ increases, your PMF looks closer and closer to the PMF for $\text{Poisson}(5)$ .

Repeat this experiment for $\lambda = 2$ and $\lambda = 10$ . Each time track the Binomial PMF as you increase $n$ , while decreasing $p$ . You should see that, if you keep $p = \lambda/n$ , then, in each case, the sequence of Binomial PMF’s will approach the Poisson PMF.

... as a Binomial Limit¶

This experiment illustrates a limiting relationship between the Poisson and Binomial distributions. This relationship is sometimes called the Law of Small Numbers:

Before proving this law, it’s worth examining the limit statement.

Holding $n \times p(n) = \lambda$ is equivalent to keeping $\mathbb{E}[X] = n \times p(n)$ fixed at $\lambda$ as $n$ increases. So, this is a limit where, as the number of trials increases, the chance of success per trial decreases, so that the expected total number of successes remains constant.
Like any limiting statement, the Law of Small Numbers is most useful as an approximation. It guarantees that, when $n$ is large, and $p$ is small, then $X \sim \text{Binomial}(n,p)$ is approximately Poisson distributed. This result is useful since the Poisson distribution is easier to work with than the Binomial.

The Law of Small Numbers is often invoked when we ask about the total number of successes in a large number of trials that each rarely succeed. The “small” in small numbers references the idea that, if $p$ is small, then $\mathbb{E}[X] = n p$ is much smaller than $n$ . We could have as well named this limiting relationship the “Law of Rare Counts.”

When do limits of this kind occur in practice?

... from Exponential Waiting Times¶

We’ve actually already seen situation where the limit involved in the Law of Small Numbers is sensible.

Recall the random incidents model from Section 6.2:

Exponential Waiting Time Process

Incidents occur randomly in time. Let $T$ denote the time between successive incidents. Assume that:

$T$ is continuously distributed.
If $[t,t']$ and $[s,s']$ are two nonoverlapping time intervals ( $t < t' < s < s'$ ) then the event that an incident occured in the first time interval is independent of the event that an incident occurs in the second time interval.
The chance an incident occurs in a time interval $[t,t']$ is only a function of the duration of the interval, and is $\mathcal{O}(t' - t)$ when $t' - t$ is small. That is, the chance an incident occurs in a short time interval is proportional to the duration of the time interval:
$\text{Pr}(\text{at least one incident in } {[t,t']}) = \mu (t' - t) + \mathcal{O}((t' - t)^2)$
(6)
for some $\lambda > 0$ .

Under this collection of assumptions, $T \sim \text{Exponential}(\lambda)$ .

Suppose that, instead of asking for the time between successive incidents, we ask for the total number of incidents that occur between times 0 and time $t$ . Let $X$ denote the total number of incidents that occur. Then:

X \sim \text{Poisson}(\lambda t).

(7)

In other words:

Proof

Consider a time period of fixed length, $[0,t]$ . Partition the time interval into $n$ even parts of length $\Delta t = t/n$ ,

\begin{aligned} [0,t] & = \left[0,\frac{t}{n}\right] \cup \left[\frac{t}{n},2 \frac{t}{n} \right] \cup \left[2 \frac{t}{n}, 3 \frac{t}{n} \right] \cup ... \cup \left[(n -1)\frac{t}{n}, n \frac{t}{n} \right] \\ & = \cup_{j=1}^{n} \left[(j-1)\frac{t}{n},j \frac{t}{n} \right] \end{aligned}.

(8)

Let $X$ denote the total number of incidents in the whole time period. Let $I_j$ be an indicator for the event that one or more incidents occurs between times $(j-1) t/n$ and $j t/n$ .

Assume $n$ is large. Then $\Delta t = t/n$ is small. By assumption, the waiting time between successive events is exponential, so on a small time interval, the chance of seeing a single event is $\mathcal{O}(\Delta t)$ . If the waiting times between successive incidents are exponentially distributed, then the chance of seeing two or more incidents is $\mathcal{O}(\Delta t^2)$ . Let’s show that, if we use large enough $n$ , then, with high probability, none of our small intervals, $[(j-1)t/n,t/n]$ , will contain two or more events.

Since the intervals are disjoint, the chance that any of the small intervals contain two or more events is a union:

\begin{aligned} \text{Pr}(\text{any of } [(j-1)t/n, jt/n] \text{ contain more than one incident}) = & \text{Pr}(\cup_{j=1}^n [(j-1)t/n, jt/n] \text{ contains more than one incident}) \\ & = \sum_{j=1}^n \text{Pr}([(j-1)t/n, jt/n] \text{ contains more than one incident}). \end{aligned}

(9)

Under assumption (3), all time intervals of equal length are statistically identical. Therefore $\text{Pr}([(j-1)t/n, jt/n] \text{ contains more than one incident}) = \text{Pr}([0, t/n] \text{ contains more than one incident}).$

\begin{aligned} \text{Pr}(\text{any of } [(j-1)t/n, jt/n] \text{ contain more than one incident}) & = \sum_{j=1}^n \text{Pr}([0, t/n] \text{ contains more than one incident}) \\ & = n \text{Pr}([0, t/n] \text{ contains more than one incident}) \\ & = n \mathcal{O}(\Delta t^2) = n \mathcal{O}((t/n)^2) = \mathcal{O}(t^2/n). \end{aligned}

(10)

As $n$ goes to infinity, $\mathcal{O}(t^2/n)$ goes to zero. So, we can safely assume that, if we pick $n$ large, then none of the intervals $[(j-1)t/n,jt/n]$ contain more than one incident.

In that case, the total number of incidents, disregarding incidents that share the same small interval, equals:

X_n = \sum_{j=1}^n I_j.

(11)

Now, $X_n$ is a sum of indicators. All indicators are Bernoulli, so $X(n)$ is a sum of Bernoulli random variables. Each indicator is independent and identically distributed. So $X(n)$ is a sum of independent, identical, Bernoulli variables. It follows that:

X_n \sim \text{Binomial}(n,p(\Delta t))

(12)

where $p(\Delta t)$ is the chance that an incident occurs in an interval of length $\Delta t$ . By assumptions, $p(\Delta t) = \lambda \Delta t + \mathcal{O}(\Delta t^2) = \lambda t/n + \mathcal{O}((t/n)^2)$ . Therefore:

\mathbb{E}[X_n] = n p(t/n) = \lambda t + \mathcal{O}(t^2/n).

(13)

So, $X_n$ is a binomial random variable on $n$ trials, with success probability $p$ that vanishes as $n$ diverges such that:

\lim_{n \rightarrow \infty} n p(t/n) = \lambda t.

(14)

So, by the law of small numbers,

X = \lim_{n \rightarrow \infty} X_n \sim \text{Poisson}(\lambda t).

(15)

In the setting described above, the limit that appears in the law of small numbers is sensible. The expected value of $X(n)$ should converge to something sensible as $n$ diverges since the number of intervals in the partition, $n$ , was an arbitrary number introduced to help analyze $X = \lim_{n \rightarrow \infty} X(n)$ .

Proof of the Law of Small Numbers¶

To establish the law of small numbers, we need to show that the Binomial PMF converges to the Poisson PMF in the limit as $n$ diverges, provides $p(n)$ behaves like $\lambda/n$ . The support of the Binomial converges to the support of the Poisson since $\{0,1,2,...,n\}$ approaches $\{0,1,2,...,\infty\}$ as $n$ diverges.

Substituting $p(n) = \lambda/n$ into the Binomial PMF gives:

\left( \begin{array}{c} n \\ x \end{array} \right) \left(\frac{\lambda}{n} \right)^x \left(\frac{n - \lambda}{n} \right)^{n - x} = \left( \begin{array}{c} n \\ x \end{array} \right) \left(\frac{\lambda}{n - \lambda} \right)^x \left(1 - \frac{\lambda}{n} \right)^{n}.

(16)

One term is ready for a limit. By the limiting expression for the exponential:

\lim_{n \rightarrow \infty} \left(1 - \frac{\lambda}{n} \right)^{n} = e^{-\lambda}.

(17)

This is the normalizing constant of the Poisson distribution.

Next, we need to work out the limit of:

\left( \begin{array}{c} n \\ x \end{array} \right) \left(\frac{\lambda}{n - \lambda} \right)^x = \frac{n!}{x!(n - x)!} \left(\frac{\lambda}{n - \lambda} \right)^x.

(18)

To find this limit, we will expand it as a product. Let:

r(x+1) = \frac{\text{Pr}(X = x+1)}{\text{Pr}(X = x)}

(19)

denote the ratio of successive values of the PMF. Then, the PMF at $x$ can be expanded:

\begin{aligned} \text{Pr}(X = x) & = \text{Pr}(X = 0) \times \frac{\text{Pr}(X = 1)}{\text{Pr}(X = 0)} \times \frac{\text{Pr}(X = 2)}{\text{Pr}(X = 1)} \times ... \times \frac{\text{Pr}(X = x)}{\text{Pr}(X = x - 1)} \\ & = \text{Pr}(X = 0) \prod_{y=1}^x r(y) \end{aligned}

(20)

This form is convenient since:

\text{Pr}(X = 0) = \left( \begin{array}{c} n \\ 0 \end{array} \right) p^0 (1 - p)^n = 1 \times 1 \times \left(1 - \frac{\lambda}{n} \right)^{n}

(21)

which converges to the normalizing constant $e^{-\lambda}$ we derived before.

Moreover, each ratio $r$ is simple:

\begin{aligned} r(y) & = \frac{n!}{n!} \frac{(y - 1)!}{y!} \frac{(n - (y - 1))!}{(n - y)!} \frac{p^y}{p^{y-1}} \frac{(1 -p)^{n - y}}{(1 - p)^{n - (y - 1)}} \\ & = \frac{n - y + 1}{y} \frac{p}{1 - p} \end{aligned}

(22)

So, if $p = \lambda/n$ , then:

r(y) = \frac{n - y + 1}{y} \frac{\lambda/n}{1 - \lambda/n} = \frac{n - y + 1 }{y} \frac{\lambda}{n - \lambda}

(23)

Now, if $n$ diverges, $n$ will dominate $- y + 1$ , so the first term simplifies to $n/y$ , and $n$ will dominate $-\lambda$ , so the second term will simplify to $\lambda/n$ . Therefore:

\lim_{n \rightarrow \infty} \frac{n - y + 1}{y} \frac{\lambda}{n - \lambda} = \lim_{n \rightarrow \infty} \frac{n}{y} \frac{\lambda}{n} = \frac{\lambda}{y}.

(24)

So, the PMF at $x$ , in the limit of infinite $n$ , is:

\text{PMF}(x) = \text{Pr}(X = 0) \prod_{y=1}^x r(y) = e^{-\lambda} \prod_{y=1}^x \frac{\lambda}{y} = e^{-\lambda} \frac{\lambda^x}{x\times(x - 1)\times ... 2 \times 1} = e^{-\lambda} \frac{\lambda^x}{x!}.

(25)

The right hand side is the Poisson PMF! $\square$

Normal Distributions¶

We express the statement, $X$ is drawn from a standard normal distribution with parameter:

X \sim \text{Normal}(0,1).

(28)

You can experiment with the normal distribution using the code cell below. Notice that, changing the mean parameter translates the density, while changing the standard deviation dilates it.

from utils_dist import run_distribution_explorer

run_distribution_explorer("Normal");

The normal distribution is an important model since it is achieved as the limit of many other distributions. In particular, if we draw a set of $n$ independent, identical samples from a distribution with finite variance, and compute their sample average, then the sample average will be approximately normally distributed for large $n$ .

In this chapter, we’ll recover the formula for the standard normal density:

\text{PDF}(x) = \frac{1}{\sqrt{2 \pi}} e^{-\frac{1}{2} x^2}

(29)

from a limit of Binomial random variables.

... as a Binomial Limit¶

First, run the code cell below to visualize a Binomial PMF.

from utils_dist import run_distribution_explorer

run_distribution_explorer("Binomial");

This time, keep the success probability, $p$ , fixed, and increase $n$ . Start with $p = 0.5$ and $n = 4$ , then gradually increase $n$ . You should see that, even for relatively small $n$ , the Binomial PMF approaches a bell curve shape. If you repeat this experiment for $p \neq 0.5$ you’ll see the same result, though the bell curve will start out skewed.

So, consider the Binomial PMF with $p = 0.5$ :

\begin{aligned} \text{PMF}(x) & = \left( \begin{array}{c} n \\ x \end{array} \right) \left(\frac{1}{2} \right)^x \left(1 - \frac{1}{2} \right)^{n - x} \\ & = \left( \begin{array}{c} n \\ x \end{array} \right) \left(\frac{1}{2} \right)^x \left(\frac{1}{2} \right)^{n - x} \\ & = \left( \begin{array}{c} n \\ x \end{array} \right) \left(\frac{1}{2} \right)^n. \end{aligned}

(30)

So, when $p = 1/2$ , the Binomial PMF is proportional to the choose coefficient as a function of $x$ :

\text{PMF}(x) \propto \left( \begin{array}{c} n \\ x \end{array} \right).

(31)

This means that our analysis of the Binomial PMF will serve two ends at once. First, we will see that, as $n$ diverges, the Binomial PMF produces a normal curve. Second, by showing that the Binomial PMF approaches a normal curve, we will also develop a normal approximation for binomial coefficients.

Before starting, we will have to fix a basic discrepancy between our two models.

If $Z \sim \text{Normal}(0,1)$ then $\mathbb{E}[Z] = 0$ and $\text{Var}[Z] = 1$ .

In contrast, if $X_n \sim \text{Binomial}(n,p)$ then $\mathbb{E}[X_n] = n p$ and $\text{Var}[X_n] = n p (1 - p)$ . So, $X_n$ cannot converge to $Z$ as $n$ diverges, since the expected value of a Binomial distribution is proportional to $n$ and is nonzero when $p \neq 0$ . Worse, its standard deviation grows at rate $\mathcal{O}(\sqrt{n})$ .

To fix this issue, we will show that a standardized version of $X$ approaches $Z$ . To standardize a random variable, subtract off its mean and divide by its standard deviation (see Section 4.3).

So, let:

Z_n = \frac{X_n - n p}{\sqrt{n p (1 - p)}}

(32)

We’ve added the subscript “sub $n$ ” to $Z$ to indicate that $Z_n$ is a standardized Binomial random variable on $n$ trials.

When $p = 0.5$ :

Z_n = \frac{X_n - n \times 0.5}{\sqrt{n 0.5^2}} = \frac{1}{\sqrt{n}}(2 X_n - n).

(33)

In terms of $Z_n$ :

X_n = \frac{1}{2}(n + \sqrt{n} Z_n) = \frac{n}{2} \left(1 + \frac{1}{\sqrt{n}} Z_n \right).

(34)

Now, the PMF of $Z_n$ is:

\text{Pr}(Z_n = z) = \text{Pr} \left( \frac{1}{\sqrt{n}}(2 X_n - n) = z \right) = \text{Pr}\left(X_n = \frac{n}{2} \left(1 + \frac{1}{\sqrt{n}} z \right) \right).

(35)

Then, using the formula for the Binomial PMF:

\text{Pr}(Z_n = z) = \left(\frac{1}{2} \right)^n \left( \begin{array}{c} n \\ \frac{n}{2} \left(1 + \frac{1}{\sqrt{n}} z \right) \end{array} \right).

(36)

To simplify, first expand the Binomial coefficient as a ratio of factorials:

\left( \begin{array}{c} n \\ \frac{n}{2} \left(1 + \frac{1}{\sqrt{n}} z \right) \end{array} \right) = \frac{n!}{\left(\frac{n}{2} \left(1 + \frac{1}{\sqrt{n}} z \right) \right)! \times \left(\frac{n}{2} \left(1 - \frac{1}{\sqrt{n}} z \right) \right)!}

(37)

Next, apply Stirling’s approximation to approximate each term:

\begin{aligned} & n! \simeq \sqrt{2 \pi e} \left( \frac{n}{e} \right)^{n + \frac{1}{2}}, \\ & \left(\frac{n}{2} \left(1 \pm \frac{1}{\sqrt{n}} z \right) \right)! \simeq \sqrt{2 \pi e} \left( \frac{n}{2 e} \left(1 \pm \frac{1}{\sqrt{n}} z \right) \right)^{\frac{n}{2} \left(1 \pm \frac{1}{\sqrt{n}} z \right) + \frac{1}{2}} \end{aligned}

(38)

Substituting each term for its approximation, then cancelling like terms, gives:

\left( \begin{array}{c} n \\ \frac{n}{2} \left(1 + \frac{1}{\sqrt{n}} z \right) \end{array} \right) \sim \frac{2^{n + 1}}{\sqrt{2 \pi (n - z^2)}} \left(1 + \frac{1}{\sqrt{n}} z \right)^{-\frac{n}{2}\left(1 + \frac{1}{\sqrt{n}} z \right)} \left(1 - \frac{1}{\sqrt{n}} z \right)^{-\frac{n}{2}\left(1 - \frac{1}{\sqrt{n}} z \right)}

(39)

Therefore:

\begin{aligned} \text{Pr}(Z_n = z) &= \left( \frac{1}{2}\right)^n \left( \begin{array}{c} n \\ \frac{n}{2} \left(1 + \frac{1}{\sqrt{n}} z \right) \end{array} \right) \\ & \simeq \frac{2}{\sqrt{2 \pi (n - z^2)}} \left(1 + \frac{1}{\sqrt{n}} z \right)^{-\frac{n}{2}\left(1 + \frac{1}{\sqrt{n}} z \right)} \left(1 - \frac{1}{\sqrt{n}} z \right)^{-\frac{n}{2}\left(1 - \frac{1}{\sqrt{n}} z \right)}. \end{aligned}

(40)

When $n$ is large, $n - z^2$ will be dominated by $n$ . Therefore, we can make the approximation:

\text{Pr}(Z_n = z) \simeq \frac{2}{\sqrt{n}} \frac{1}{\sqrt{2 \pi}} \left(1 + \frac{1}{\sqrt{n}} z \right)^{-\frac{n}{2}\left(1 + \frac{1}{\sqrt{n}} z \right)} \left(1 - \frac{1}{\sqrt{n}} z \right)^{-\frac{n}{2}\left(1 - \frac{1}{\sqrt{n}} z \right)}.

(41)

The standard normal random variable, $Z$ , is continuously distributed, so is parameterized by a density. Each $Z_n$ is a discrete random variable. To recover a density from a probability, we need to divide out by the length of an interval.

In this case we can construct a density from $Z_n$ by replacing $Z_n$ with a random variable $W_n$ , where $W_n|Z_n = z \sim \text{Uniform}(z - \Delta z_n/2, z + \Delta z_n/2)$ where $\Delta z_n$ is the gap between successive possible values of $Z_n$ . Since $Z_n = \frac{1}{\sqrt{n}}(2 X_n - n)$ , and $X_n$ are integer valued, $\Delta z_n = \frac{2}{\sqrt{n}}$ .

Then, work with the density function of $W_n$ :

\frac{1}{\Delta z_n} \text{Pr}(Z_n = z) = \frac{1}{\sqrt{2 \pi}} \left(1 + \frac{1}{\sqrt{n}} z \right)^{-\frac{n}{2}\left(1 + \frac{1}{\sqrt{n}} z \right)} \left(1 - \frac{1}{\sqrt{n}} z \right)^{-\frac{n}{2}\left(1 - \frac{1}{\sqrt{n}} z \right)}.

(42)

This procedure is the same as:

Representing the PMF of $Z_n$ with a bar plot. The widths of the bars is $\Delta z_n$ .
Scaling the height of the bars by their widths so that their area returns the PMF value. This returns the density function for $W_n$ .

Integrating over the density function of $W_n$ , with bounds equal to the endpoints of the bars, will sum over the PMF of $Z_n$ . So, all probability questions we could ask about $Z_n$ could be answered by integrating over the density function of $W_n$ .

Now that we’ve handled the normalizing constants, focus on the functional form:

\begin{aligned} \left(1 + \frac{1}{\sqrt{n}} z \right)^{-\frac{n}{2}\left(1 + \frac{1}{\sqrt{n}} z \right)} \left(1 - \frac{1}{\sqrt{n}} z \right)^{-\frac{n}{2}\left(1 - \frac{1}{\sqrt{n}} z \right)} & = \left[\left(1 + \frac{1}{\sqrt{n}} z \right) \times \left(1 - \frac{1}{\sqrt{n}} z \right) \right]^{-\frac{n}{2}} \left(\frac{1 - \frac{1}{\sqrt{n}} z}{1 + \frac{1}{\sqrt{n}} z} \right)^{\frac{z}{2} \sqrt{n}} \\ & = \left(1 - \frac{1}{n} z^2 \right)^{-\frac{n}{2}} \left(\frac{1 - \frac{1}{\sqrt{n}} z}{1 + \frac{1}{\sqrt{n}} z} \right)^{\frac{z}{2} \sqrt{n}} \end{aligned}

(43)

To take the limit as $n$ goes to infinity, express each term in the form used for the limiting definition of the exponential:

\begin{aligned} & \left(1 - \frac{1}{n} z^2 \right)^{-\frac{n}{2}} = \left[\left(1 - \frac{1}{n} z^2 \right)^n \right]^{-\frac{1}{2}} \simeq \left[e^{-z^2} \right]^{-\frac{1}{2}} = e^{\frac{1}{2} z^2} \\ & \left(1 - \frac{1}{\sqrt{n}} z \right)^{\frac{z}{2} \sqrt{n}} = \left[\left(1 - \frac{1}{\sqrt{n}} z \right)^{\frac{z}{2} \sqrt{n}} \right]^{\frac{z}{2}} \simeq \left[e^{-z} \right]^{\frac{z}{2}} = e^{-\frac{1}{2} z^2} \\ & \left(1 + \frac{1}{\sqrt{n}} z \right)^{-\frac{z}{2} \sqrt{n}} = \left[\left(1 + \frac{1}{\sqrt{n}} z \right)^{\frac{z}{2} \sqrt{n}} \right]^{-\frac{z}{2}} \simeq \left[e^{z} \right]^{-\frac{z}{2}} = e^{-\frac{1}{2} z^2} \\ \end{aligned}

(44)

Therefore:

\lim_{n \rightarrow \infty} \left(1 - \frac{1}{n} z^2 \right)^{-\frac{n}{2}} \left(\frac{1 - \frac{1}{\sqrt{n}} z}{1 + \frac{1}{\sqrt{n}} z} \right)^{\frac{z}{2} \sqrt{n}} = e^{\frac{1}{2} z^2} \times e^{-\frac{1}{2} z^2} \times e^{-\frac{1}{2} z^2} = e^{-\frac{1}{2} z^2}

(45)

So:

\lim_{n \rightarrow \infty} \frac{1}{\Delta z_n} \text{Pr}(Z_n = z) = \frac{1}{\sqrt{2 \pi}} e^{-\frac{1}{2} z^2}.

(46)

The expression on the left is a PDF since $\Delta z_n$ converges to zero as $n$ diverges. The expression on the right is the standard normal density function. Therefore:

It follows that:

6.4 Limiting Distributions

Poisson Distributions¶

... as a Binomial Limit¶

... from Exponential Waiting Times¶

Proof of the Law of Small Numbers¶

Normal Distributions¶

... as a Binomial Limit¶