Randomness and Simulation

Subsection 2.1.1 Random vs. Pseudo-random Numbers

We have already seen the terms “random sample” in this course, and we will see it a lot more as we continue. It is important, therefore, that we understand what randomness means.

Example 2.1.1. Identifying Random Processes.

Which of the following processes is random?

a person thinks of a number between one and ten
a student randomly fills in bubbles on a standardized test sheet
a computer program randomly assigns the winning lottery numbers

Solution

Surprisingly, the answer is none of these are truly random processes.

To be truly random, a process must have no predictability—show no preference towards one or more outcomes. A person choosing a number is likely to have a “favorite” number or to be influenced by something he just saw or heard. A student filling in bubbles is likely to make a design, or even to “try to be random” and evenly spread out the bubbles, which is not in fact random. Even a computer program comes up with “random numbers” using a predictable algorithm. The computer program is an example of the following.

Definition 2.1.2.

A pseudo-random process is one that appears to be random, but which, when repeated with the same initial inputs, will always produce the same results.

Where then can we get a reliable source of random information? This can actually be a philosophical question. Is anything in the universe truly random, or is everything deterministic—meaning if we know the initial conditions, we can predict exactly what will happen. In this class, we assume that physical phenomena that we observe are, if not random, so complex that they might as well be random. We can gather random numbers then from sources such as:

the time between the decay of radioactive material, or
time between the observation of cosmic rays, or
wind gust speeds and direction.

None of these are terribly practical for us, so instead we use either a pseudo-random number generator on a computer, or a random number table which records digits based on processes similar to those mentioned above.

Definition 2.1.3.

A random number table is a list of digits recorded based on some random process. For example,

2217726304387410092537086270581997622725849795907032825001108963
3217535822643800292254644943760642389043766557204107354186024508
8906427308645681412198226653885873285801699027843110380420067664
8740522639824530519902027044464984322000946238678577902639002954
8887003319933147508331265192321413908608671496383528968974910533
4943760642389043766557204107354186024508432200094623867858226440

To use a random number table to help us generate a string of random numbers, we first “randomly” select a starting point in the table, and then use the digits that follow.

Example 2.1.4. Using a Random Number Table.

You wish to randomly pick a sample of 6 people from a group of 100 people. Use the random number table provided above to do this.

Solution

We will assign each person in our group of 100 a two-digit number from 00 to 99. This means we will take groups of two digits from the table above, skipping over any repeating numbers since we don't want to pick the same “person” twice.

In order to select our six pairs of digits, we must first pick a starting point. We'll do this by rolling a six-sided die (since there are six rows in the table). Let's say this comes up with the number 3. Then we will start at the beginning of the third row in the table. The first six pairs of two-digit numbers from that row are \(89, 06, 42, 73, 08,\) and \(65\text{.}\)

Each of these numbers represents one of our people, and there are no repeats. So the six people we will use in our sample are those assigned numbers 89, 6, 42, 73, 8, and 64.

Figure 2.1.5. Using Random Number Tables I

Figure 2.1.6. Using Random Number Tables II

Checkpoint 2.1.7.

You wish to collect a sample of 10 individuals from a population of 100. To do this, you assign numbers 0-99 to these individuals, and then use the random number table below, starting with the first entry and taking two digits at a time to select your sample.

52557 13440 30790 31858 28653 38267 09427 95946 09832 68174
93146 91673 22649 29722 35062 19040 67106 96350 82060 51489
16645 21177 60697 15577 24381 51084 70974 11304 37199 12631

Question: What are the numbers of the individuals will be included in your sample?

Starting Point	Outcomes	Response Var.
row 1, column 1	2217726304 \(\Rightarrow\) bbct	4 fish caught
row 2, column 11	6438002922 \(\Rightarrow\) tbbtc	5 fish caught
row 3, column 21	982266538858732 \(\Rightarrow\) *tbbtttbtttttbc	14 fish caught
row 4, column 31	4984322000 \(\Rightarrow\) b*tbbbbc	7 fish caught
row 5, column 41	6714963835 \(\Rightarrow\) ttcb	4 fish caught
row 6, column 51	23867858226440 \(\Rightarrow\) bbttttttbbtbbc	14 fish caught
row 1, column 21	3708627058 \(\Rightarrow\) btc	3 fish caught
row 2, column 31	0642389043 \(\Rightarrow\) ctb	3 fish caught
row 3, column 41	6990278431 \(\Rightarrow\) t**ccttb	6 fish caught
row 4, column 51	7790263900 \(\Rightarrow\) tt*cb	4 fish caught

Section 2.1 Randomness and Simulation

Objectives

Subsection 2.1.1 Random vs. Pseudo-random Numbers

Example 2.1.1. Identifying Random Processes.

Definition 2.1.2.

Definition 2.1.3.

Example 2.1.4. Using a Random Number Table.

Checkpoint 2.1.7.

Checkpoint 2.1.8.

Checkpoint 2.1.9.

Subsection 2.1.2 Random Processes

Definition 2.1.10.

Definition 2.1.11.

Example 2.1.12. Identifying Processes I.

Example 2.1.13. Identifying Processes II.

Checkpoint 2.1.16.

Checkpoint 2.1.17.

Checkpoint 2.1.18.

Subsection 2.1.3 Simulation

Definition 2.1.19.

Definition 2.1.20.

Component.

Outcome.

Trial.

Response Variable.

Example 2.1.21. Identifying Parts of a Random Process.

Checkpoint 2.1.24.

Checkpoint 2.1.25.

Checkpoint 2.1.26.

Subsection 2.1.4 Conducting Simulations

Example 2.1.27. Identifying Assumptions.

Algorithm 2.1.28.

Example 2.1.29. Conducting a Simulation.

Checkpoint 2.1.33.

Checkpoint 2.1.34.

Checkpoint 2.1.35.

Subsection 2.1.5 Cautions Regarding Simulation

Example 2.1.36. Analyzing Simulation Results.

Principle 2.1.37. Cautions Regarding Simulations.

Don't Overstate Your Case.

Model Outcomes Correctly.

Run Enough Trials.

Checkpoint 2.1.40.

Checkpoint 2.1.41.

Checkpoint 2.1.42.