Phew. That was a lot to think through.
Here’s a summary of the important definitions and results from Chapter 1. Reading the summary is not a substitute for reading the chapter. It will provide most of the information to complete your studysheets, but copying results from the summary to the studysheet is not a substitute for completing the sheet yourself. Make sure you can find where each result listed here was explained in the main chapter text.
If there is one table to summarize the chapter, it is this:
Outcomes, Events, and Sets¶
These results are all explained in Section 1.1.
A random process is some process that produces unpredictable outcomes
An outcome is a specific, distinct, result of the process
The outcome space, , is the set of all possible outcomes
An event is any collection of outcomes. Events, are subsets of .
Sets may be defined:
explicitly by listing their entries. For example, .
implicitly by defining rules that the entries all must satisfy, and that, if satisfied, ensure membership in the set. For example,
The size of a set, , is denoted . If a set is finite, then its size is the number of entries in the set.
Logic and set operations
Sets can be defined by combining a collection of rules into logical sentences. For instance,
Appending not before a set’s implicit definition produces the set complement. For example:
Concatenating sets with an or produces their union. For example, if .
Concatenating sets with an and produces their intersection. For example, then .
Modifying a probability statement with an if adds conditions that restrict the space of possible outcomes . We denote if with a vertical bar |.
In summary:
| Logical | Set Operation | Notation |
|---|---|---|
| not | complement | |
| or | union | |
| if | restrict | |
| and | intersect |
Probability as Proportion¶
These results are all explained in Section 1.2.
A probability measure is a function that accepts events and returns their chance. We denote the measure, so is the chance the event occurs.
Probability as Frequency: The chance of an event equals the long run frequency with which it would occur in an arbitrarily long sequence of trials
It follows that:
All chances are between 0 and 1
The chance that something happens, , equals 1
Chances for disjoint events add: if and are disjoint.
Expanding an event to include more outcomes never makes it less likely. Contracting an event so it includes fewer outcomes never makes it more likely.
We say that all outcomes are equally likely if:
They would occur with the same long run frequency
We have no better model and want to start simple
The features that distinguish outcomes cannot possibly influence their frequency, or the process that selects outcomes
If all outcomes are equally likely then probability is equivalent to proportion:
The probability of every outcome is where is the number of possible outcomes
The probability of every event is:
So, if all outcomes are equally likely, we can compute probabilities by (a) enumerating the outcome space, (b) counting the number of possible outcomes, (c) enumerating the event, (d) counting the number of ways the event can happen, and (d) evaluating their ratio.
The Rules of Chance¶
All of these results are explained in Section 1.3.
A probability model is a choice of outcome space, all relevant events, and probability measure, such that:
Nonnegativity: for all events .
Normalization: .
Additivity: if and are disjoint.
Ensuing probability rules:
Complements:
Sub-additivity:
Joint and Marginal Probability¶
All of these results are explained in Section 1.4.
A joint probability is the probability that two events both happen:
Since is contained in both and , .
Given a collection of joint probabilities, the marginal probabilities are the chances of the individual events, .
The act of breaking an event into all the ways it can occur is called partitioning (breaking into disjoint parts)
The act of summing the chances of disjoint parts is called marginalization
Joint and marginal probabilities may be arranged into a joint probability table where
The sum of the joint probabilities in any row or column must add to the corresponding marginal
The sum of all joint probabilities must equal 1
The sum of any pair of marginals must equal 1
For example:
| Event | not | Marginals | |
|---|---|---|---|
| not | |||
| Marginals | 1 |
Conditional Probability¶
All of these results are explained in Section 1.5.
A conditional probability is the probability of one event given that another occurs:
Conditioning on an event, , restricts the set of possible outcomes to
Conditioning on does not change the relative likelihood (e.g. the odds) of any outcomes in
Normalization is the action of scaling a list of nonnegative numbers by their sum
To find conditional probabilities from a joint probability table:
Excerpt the appropriate rows or columns of the joint table
Scale all entries by their sum, which equals the marginal assigned to the row/column (e.g. normalize)
The conditional probability of given is always the ratio of a joint to a marginal:
The multiplication rule expresses any joint as a product of a marginal and a conditional:
An outcome tree is a diagram with one node for every possible event in a sequence of events, arrows for possible transitions between nodes, labelled by the marginal, or conditional, probabilities of the transition.
We can use the multiplication rule to compute chances by evaluating products along paths in outcome trees
Bayes Rule recovers from marginals for and conditionals for given :
Independent Events¶
All of these results are explained in Section 1.6.
Events and are independent if and only if any of the following are true:
Knowing the outcome of one tells us nothing about the other.
that is, the conditionals equal the marginals because we learn nothing by conditioning
that is, the conditionals don’t depend on the conditioning statement, since the events tell us nothing about each other
that is, the joint is the product of the marginals
This is a special case of the general multiplication rule. Only use it for independent events.
This is useful for computing joint probabilities and checking independence.
Do not take this as the definition of independence. It’s really a consequence
If two events are not independent, then they are dependent.