Random Variables and Discrete Probability distributions

Section 3.1 Random Variables and Discrete Probability distributions

Assigning Values to Outcomes.

There are times when we are interested not in the sample space of an experiment, but rather in values we assign to the sample space. Consider the following examples.

Example 3.1.1. Tossing a Coin and Counting Heads.

You toss a fair coin three times and count the number of heads. List both the outcomes for the experiment and the possible values that result from the experiment.

Solution

The outcomes in the experiment make up the sample space shown below.

\begin{equation*} \lbrace HHH, HHT, HTH, HTT, THH, THT, TTH, TTT \rbrace \end{equation*}

However, the values that we are interested in are the possible number of heads that come up. These are:

\begin{equation*} \lbrace 0, 1, 2, 3\rbrace \end{equation*}

Note that the values which we are interested in are the numbers 0 (no heads), 1 (one head), 2 (two heads) or 3 (three heads). Each value is associated with a certain subset of the sample space. For example, the value 2 is generated by the event $\lbrace HHT, HTH, THH \rbrace\text{.}$ Not all of our examples are this straightforward. Consider the following.

Example 3.1.2. Selecting a Person and Measuring Height.

You select a random US Citizen and measure their height in feet and inches. List both the outcomes for the experiment and describe the possible values that result from the experiment.

Solution

The outcomes in the experiment are all of the citizens of the US that could be chosen.

The values are all of the possible heights of US citizens.

In this example, we don't know what outcomes make up the event giving a height of 6.0 ft. However, we are fairly certain that there are many different outcomes (i.e. people) who have a height of 6.0 feet.

In this section, we will learn how to assign values to events in a sample space in a systemmatic way. The values will be said to be the values of a random variable. As we shall see, random variables, just like “ordinary” variables, come in two different varieties: discrete and continuous. We will focus for the first few sections of this chapter on discrete random variables and how to study their entire range of values and probabilities, called their probability distribution.

Objectives

After finishing this section you should be able to

describe the following terms:
- expected value
- probability distribution
- probability histogram
- random variable
- standard deviation of a discrete random variable
- variance of a discrete random variable
accomplish the following tasks:
- Recognize and assign values to random variables
- Create a probability distribution for discrete random variables
- Construct and work with probability histograms
- Find the expected value of a discrete random variable.
- Find the standard deviation of a discrete random variable.

Subsection 3.1.1 Random Variables

In Chapter 1 we studied data collection and the variables that resulted. In Chapter 2, we looked at probability. We can now combine those two topics—the study of variables and probability—into a single topic, the study of variables that result from random processes. Such variables are, not surprisingly, called random variables.

Definition 3.1.3.

A random variable is a variable that takes on values corresponding to the outcome of some random process.

Think back to Section 2.1 on randomness and simulation. In that section we introduced the notion of a random process and a deterministic process. A simple way to tell if a variable is random or “ordinary” is to consider the source of its values. If the values come from a deterministic process—that is, they will be the same each time the measurement is taken, then the variable is not random. If the underlying process is a random process, then so is the variable. Consider the following example.

Example 3.1.4. Identifying Random Variables.

Determine which, if any, of the following variables are random variables.

The height of your grandmother
The height of a randomly selected grandmother from your city
The number of heads that appear when a fair coin is tossed 10 times
The number of heads that appear when a single, double-headed coin is flipped.

Solution

In each case, we ask ourselves if the value we measure is a result of a random or deterministic process.

Since your grandmother can be assumed to have the same height each time you measure her (within a set time-frame of course), this is a deterministic process. Thus, her height is not a random variable.
When you randomly choose a grandmother to measure, you will get different grandmothers resulting in different heights. The underlying process is random and therefore, this is a random variable.
Flipping a coin is by nature a random event. The number of heads could be anything from 0-10 in this process. Therefore, this variable is a random variable.
Since the coin has a heads on both sides, we know for certain that one head will appear. The process is deterministic, and therefore the variable is not random.

Just like “ordinary” variables, random variables can be either discrete or continuous. Recall that a discrete variable is one that takes on a finite number of values (or an infinite number with “spaces” between the values), whereas a continuous variable can take on any value in a given range. Looking back at Example 3.1.4 you should see that both discrete and continuous random variables are represented.

Example 3.1.5. Classifying Random Variables as Discrete or Continuous.

Which of the random variables from Example 3.1.4 is continuous, and which is discrete?

Solution

The number of heads in ten flips of a coin must take on one of the values

\begin{equation*} \lbrace 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 \rbrace\text{.} \end{equation*}

This is therefore a discrete random variable.

The height of a grandmother can take on any value within a “reasonable” range (we don't expect 10ft tall grandmothers, nor do we expect 1ft tall grandmothers). It is therefore a continuous random variable.

In this text, we will follow standard notation and refer to random variables using capital letters from the end of the alphabet, such as $X\text{,}$ or $Y\text{.}$ The notation $X=x$ stands for the set of outcomes in an experiment which give the random variable $X$ the value $x\text{.}$ Consider the following revisitation of Example 3.1.1.

Example 3.1.6. Identifying Events Based on Random Variable Values.

A random variable $X$ is defined to be the number of heads in three flips of a fair coin. Find each of the following.

$X=1$
$X=3$
$P(X=2)$

Solution

Remembering that “$X=x$” stands for a set of outcomes, we find that:

“$X=1$” is the set $\lbrace HTT, THT, TTH \rbrace$ — in other words, the set of outcomes with $X=1$ head in the 3 flips.
“$X=3$” is the set $\lbrace HHH \rbrace$ — this is the only outcome in which $X$ (the number of heads) is 3.
$P(X=2)$ is the probability of the event $\lbrace HHT, HTH, THH \rbrace\text{.}$ Since the coin is fair and each flip is independent of the previous flip, each of these outcomes has probability $\frac{1}{2}\times \frac{1}{2} \times \frac{1}{2} = \frac{1}{8}\text{.}$ Therefore,

\begin{equation*} P(X=2) = \frac{1}{8} + \frac{1}{8} + \frac{1}{8} = \frac{3}{8}\text{.} \end{equation*}

Figure 3.1.7. Random Variables I

Figure 3.1.8. Random Variables II

Checkpoint 3.1.9.

Below are the descriptions of several variables.

The number of students in a randomly selected college course at your favorite university
The amount of money that a randomly selected lottery ticket is worth in a certain scratch-ticket lottery game
The length of a randomly selected piece of music from your collection
The number of cats that you own
The length of your favorite piece of music in your collection
The weight of a randomly selected nickel currently in circulation.

Question: identify each of the variables above as a Discrete Random Variable, Continuous Random Variable, Discrete Deterministic Variable, or Continuous Deterministic Variable.

\(x\)	\(P(X=x)\)
\(0\)	\(\frac{10}{21}\)
\(1\)	\(\frac{10}{21}\)
\(2\)	\(\frac{1}{21}\)

\(y\)	\(P(Y=y)\)
\(1\)	\(\frac{1}{3}\)
\(2\)	\(\frac{2}{9}\)
\(3\)	\(\frac{4}{27}\)
\(\vdots\)	\(\vdots\)

\(w\)	\(P(W=w)\)
\(0\)	\(\frac{3}{4}\)
\(1\)	\(\frac{1}{12}\)
\(3\)	\(\frac{1}{12}\)
\(5\)	\(\frac{1}{12}\)

\(x\)	\(P(X=x)\)
\(2\)	\(0.237\)
\(5\)	\(a\)
\(8\)	\(0.422\)
\(12\)	\(0.011\)
\(70\)	\(0.216\)

Section 3.1 Random Variables and Discrete Probability distributions

Assigning Values to Outcomes.

Example 3.1.1. Tossing a Coin and Counting Heads.

Example 3.1.2. Selecting a Person and Measuring Height.

Objectives

Subsection 3.1.1 Random Variables

Definition 3.1.3.

Example 3.1.4. Identifying Random Variables.

Example 3.1.5. Classifying Random Variables as Discrete or Continuous.

Example 3.1.6. Identifying Events Based on Random Variable Values.

Checkpoint 3.1.9.

Checkpoint 3.1.10.

Checkpoint 3.1.11.

Subsection 3.1.2 Probability Distributions for Discrete Random Variables

Definition 3.1.12.

Example 3.1.13. Creating a Probability Distribution Table.

Example 3.1.15. Finding a Probability Distribution Formula.

Checkpoint 3.1.19.

Checkpoint 3.1.21.

Checkpoint 3.1.23.

Subsection 3.1.3 Probability Histograms

Definition 3.1.28.

Example 3.1.29. Constructing a Probability Histogram from a Distribution Table.

Example 3.1.32. Constructing a Probability Histogram from a Distribution Formula.

Checkpoint 3.1.36.

Checkpoint 3.1.38.

Checkpoint 3.1.40.

Subsection 3.1.4 Expected Value

Example 3.1.42. Finding the Average Value of a Random Variable.

Definition 3.1.44.

Example 3.1.45. Finding the Expected Value of a Raffle Ticket.

Observation 3.1.48.

Checkpoint 3.1.51.

Checkpoint 3.1.53.

Checkpoint 3.1.54.

Subsection 3.1.5 Standard Deviation

Definition 3.1.55.

Definition 3.1.56.

Example 3.1.57. Finding the Standard Deviation of a Random Variable.

Checkpoint 3.1.63.

Checkpoint 3.1.65.

Checkpoint 3.1.66.