Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

1.1 Outcomes, Events and Sets

Probability is used (and misused) in amazingly diverse ways. We might ask for the probability that:

  • it rains tomorrow, or an earthquake strikes,

  • a lottery ticket hits big, or a poker hand wins,

  • an atom decays, or an electron tunnels and flips a computer bit,

  • the stock market crashes, or a stock doubles in value,

  • a political candidate wins an election, or a poll returns a spurious result,

  • a medical test returns a true detection, or a recommended vaccine is safe,

  • an image contains a pedestrian and the pedestrian is waiting to cross the road, or that the last three letters in this list are etc.

To study probability we will need an equally flexible language that can sensibly discuss electrons, policitians, stock prices, and word prediction.

Outcomes and Outcome Spaces

Any random process or experiment produces outcomes that cannot be predicted from information that is available before the outcome occurs. We will use the following language to describe the possible outcomes of a random process:

If we want to define a set by listing its entries, we use curly braces, {...}\{...\}, and use commas to separate distinct elements of the set. For instance, if Ω=5|\Omega| = 5, then Ω={ω1,ω2,ω3,ω4,ω5}\Omega = \{\omega_1,\omega_2,\omega_3,\omega_4,\omega_5\} where ωj\omega_j is the jthj^{th} possible outcome. The order of the elements in the list does not matter.

Sets and Subsets

Sets can contain smaller sets. For instance, the set {HH,HT,TH,HH}\{HH,HT,TH,HH\} contains the smaller set {HH,HT,TH}\{HH,HT,TH\}. It is common to denote sets contained in the outcome space with capital letters, e.g. A={HH,HT,TH}A = \{HH,HT,TH\}. We use the \in symbol to show that an element is contained in a set. For instance, HTAHT \in A.

Now that we have multiple sets, we can start comparing their entries. It is standard practice to start from a Venn diagram:

Venn Diagram.

Every set of outcomes is a subset of the outcome space Ω\Omega. Every set of outcomes, AA, divides Ω\Omega into two disjoint parts since every element of Ω\Omega is either in AA, or not in AA. We let AcA^c denote the complement of AA. This is collection of all outcomes in Ω\Omega that are not in AA. Any collection of disjoint sets that break a larger set into disjoint pieces is a partition of the larger set. For instance, the pair {A,Ac}\{A,A^c\} is a partition of Ω\Omega.

If A=ΩA = \Omega, then AcA^c is empty, since every outcome is in AA. The empty set is the opposite of Ω\Omega. It contains no outcomes. The empty set is given the special symbol \emptyset.

Events

Sets can be defined explicitly by listing their entries. Sets can also be defined implicitly by introducing a list of conditions that, if satisfied, gaurantee that an outcome is contained in the set.

The relationship between a condition (or list of conditions) on outcomes, and a collection of outcomes that satisfy the condition, is the first key idea in probability. It relates natural language descriptions of events to precise collections of outcomes.

Any time we go to ask a question of the form, “What is the probability that ...” or “What is the chance that ...” we will need to replace the ellipsis (...) with an event. For instance, “What is the probability that the sum of two die add to an even value?” or “What is the probability that a randomly sampled student is a Data Science major and is pursuing a double major in Economics or Engineering?”

If we randomly poll a student and the student:

  1. is a Data Science major, and

  2. is pursuing a double major, and

  3. their double major is either in Economics or Engineering

then the event whose chance we were interested in has occured.

Every event is a list of conditions on outcomes. So, every time we ask “What is the probability that some event occurs?” We are really asking, “What is the probability that the outcome of this random process satisfies the conditions that define the event?” Since every list of conditions defines a set, EE, “What is the probability that the outcome of this random process is contained in the set EE?”

In notation, we could express this question “What is Pr(ωE)\text{Pr}(\omega \in E)?” or, in standard shorthand, “What is Pr(E)\text{Pr}(E)?”

These observations suggest the following definition:

Many probability questions can be expressed simply, “Find Pr(E)\text{Pr}(E).” We will focus about a third of our effort in this class to this problem, i.e. to computing chances.

Example: Permutations

The following example is adapted from the Data 140 textbook.

Suppose you are shuffling three cards labeled aa, bb, and cc. Then the space of all possible outcomes is

Ω = {abc, acb, bac, bca, cab, cba}\Omega ~=~ \{ abc, ~acb, ~bac, ~bca, ~cab, ~cba \}

The event {abc, acb, bca, cba}\{abc, ~ acb, ~ bca, ~cba \} can be described as “aa appears first or last”. 🛠️ Try filling in the events in the table below:

EventVerbal DescriptionSubset
AAaa appears first{abc,acb}\{abc, acb\}
BBaa and bb are next to each other
CCthe letters are in reverse alphabetical order
DDaa does not appear
EEbb is either first, second, or third
FFthe letters form a word that means ‘taxi’

Notice that we can describe events that contain every possible outcome. Then the event equals Ω\Omega.

We can also describe events that are impossible. If an event is impossible then no outcome satisfies the event, so the corresponding set is empty. Hence, D=D = \emptyset in the table above.

Combining Events

We will often define events as combinations of other events. For instance, we might be interested in the event that a randomly sampled student is either a sophomore or junior. This event concatenates the events, the student is a sophomore, and the event, the student is a junior. Formally, we combine events by listing the constraints that define each component of the event, then rules for combining those constraints. These rules can be boiled down to four logical operations:

  1. not: e.g. event AA does not happen. Applying “not” replaces AA with its complement, AcA^c.

    • So, every time you see (or think) the word not you should consider the event’s complement.

    A complement.
  2. or: e.g. either event AA happens or event BB happens. Applying “or” combines all the outcomes contained in the collections EE and FF.

    • For instance, the set of die rolls that are either even or less than 4 is the set {2,4,6,1,3}\{2,4,6,1,3\}. Notice that we never list an identical outcome twice. This set is produced by listing all the distinct entries that appear in {2,4,6}\{2,4,6\} or {1,2,3}\{1,2,3\}.

    • The set operation that corresponds to or is a union, denoted \cup. For instance, if A={2,4,6}A = \{2,4,6\} and B={1,2,3}B = \{1,2,3\} then AB={1,2,3,4,6}A \cup B = \{1,2,3,4,6\}. You should read ABA \cup B as “AA union BB”. Like in a good marriage, the union of two sets is the collection of all entries owned by both sets.

    • Notice: combining sets with the contraction “or” never makes the sets smaller. If they are distinct, it makes the sets larger. So, adding an “or” statement makes an event more general. The more ways an event can occur, the more likely it is. So, expanding an event with an “or” statements never decreases its chance.

    A union B.
  3. and: e.g. event AA happens and event BB happens. Applying “and” restricts our attention to only the outcomes that are contained in both sets. These are the outcomes that satisfy all of the conditions in AA and BB. For instance, the set of all die rolls that are even and are less than 4 is the set {2}\{2\}.

    • The set operation that corresponds to and is a intersection, denoted \cap. For instance, if A={2,4,6}A = \{2,4,6\} and B={1,2,3}B = \{1,2,3\} then AB={2}A \cap B = \{2\}. You should read ABA \cap B as “AA intersect BB”.

    • Notice: combining sets with the contraction “and” never makes the sets larger. It usually makes the sets smaller. So, adding an “and” statement makes an event more specific. Contracting an event with an “and” statements never increases its chance.

    A intersect B.
  4. if: e.g. what is the probability that event AA happens if event BB happens? Applying if restricts the outcome space. If BB happens, then any outcome ωF\omega \notin F cannot happen. So, since we don’t need to consider impossible events when computing chances, we can replace Ω\Omega with BB.

    • Restricting the outcome space to all outcomes that satisfy an if statement is called conditioning.

    • For instance, the probability that a die roll is less than 4 if the roll is even should be computed using the outcome space {2,4,6}\{2,4,6\}.

    • It is common to use “given that” instead of “if”. For instance, the statements, “Find the probability that a die roll is less than 4 if it is even” is the same as the statement “Find the probability that a die roll is less than 4 given that the roll is even.”

    • The standard notation for “if” is a vertical bar. E.g. Pr(AB)\text{Pr}(A|B) means, “the probability that EE happens if BB happens”.

In summary:

P.S. A Comment on Notation

We introduced a lot of notation in this section. Notation is crucial in mathematics since abstract notation can express a general idea that is realized in many different specific ways. For instance, Pr(E)\text{Pr}(E) means the probability of the event EE, which can serve as shorthand for any of the probabilities listed at the start of the chapter. We could let EE stand for “it rains tomorrow” or “this stock doubles in value in the next quarter” without changing the notation used for the probability of an event. Compressing many different statements with an abstract shorthand is powerful since it will allow us to prove common facts about apparently unrelated systems. It will also allow us to say complicated things quickly.

Notation is also intimidating and exclusionary. Like any language, it is foreign on first exposure, and can seem more like a special code, or pedant’s jargon. We will do our best to avoid unneeded notation and to explain every notation introduced. In exchange, we promise that the notation we do use is worth learning, and expect you to learn it.

If you struggle with new notation, or found the notation above opaque, be proactive, and treat it like you would a foreign language. Make a table, with symbols on one side, name each symbol, then add a plain language description of each symbol or notation. It can help to write out a long form explanation, then a concise, pithy, “key” that you use to express the essence of the idea. Try to make the “key” memorable.

Many ideas in set theory are best represented visually with a Venn diagram. So, try drawing a Venn diagram for the cases A=BA = B, ABA \subset B, BAB \subset A, and AA and BB disjoint. Make space in your table for these visuals. You’ll rehearse these skills in your first discussion section.