Skip to main content

Section 1.3 Describing Data Numerically: Measures of Center and Spread

Introduction.

While describing data sets using graphs gives a good overview, it does not give us the detail that we need to do more advanced analysis. Graphs are meant to give a picture of the data, but in order to make decisions based on a set of data, we need to have numerical summaries.

In this section, we will look at several types of numerical summaries, or measures, which we can compute from a set of quantitative data. These measures include both:

  1. Measures of Center.

    A measure of center gives a single number that represents the "typical" value in the set of data. One very familiar measure of center is the average. When you say that the average score on an exam is 80%, you are claiming that the typical student scored 80%. This is not, however, the only way to measure the center of a data set, as we shall see in this section.

  2. Measures of Spread.

    Also called a measure of variation, this number or set of numbers indicates how spread out the data is. For example, if three students take an exam and all score 80%, the average would be 80%, but there is no variation between the scores. But if three students scored 60%, 80%, and 100%, the average would still be 80%, but now there is a large amount of variation between their scores–they are much more spread out.

Each of these values can be computed for data drawn from a sample (a small number of values from a larger population), or for a census of an entire population. We give them different names, however, depending on the source.

Definition 1.3.1.

A measurement describing some characteristic of a sample is called a statistic.

Definition 1.3.2.

A measurement describing some characteristic of a population is called a parameter.

The measures of center that we will look at often come with a related measure of variation or spread, so we will examine these together where appropriate.

Subsection 1.3.1 The Mode

We have already seen the term mode used in the context of a graph. Recall that the mode of a histogram or bar graph is the class or category that has the tallest bar. That is, it is the class or category that appears the most in the data set. This seems to be a reasonable way to describe a “typical” element of a data set and leads to the following definition.

Definition 1.3.3.

The mode of a set of values is the value that appears most often. If more than one value is tied for the most appearances, each one is a mode. If no value appears more than once, there is no mode.

The mode is unique among our measures of center as it is the only one that applies to both quantitative variables and qualitative variables. Consider the following examples.

A breakout presentation at a business convention attracts 16 participants. The job title of each of these participants is as follows. Find the mode of this set of data.

manager owner H.R. director owner
manager H.R. director manager manager
owner manager manager H.R. director
manager owner owner manager
Table 1.3.5. Job Titles
Solution

The mode is the most commonly appearing value. One way to find that is to construct a frequency table. Doing so for this data produces the following table, from which we can see that the mode is “manager”.

Value Frequency
manager 8
H.R. director 3
owner 5
Table 1.3.6. Frequency Table

A survey of twelve grocery store shoppers asked how many times a month these shoppers visit their favorite grocery store. Find the mode of the resulting data, shown below.

6 2 5 1 3 5 3 2
7 4 1 5 3 6 4 8
Table 1.3.8. Number of Visits Per Month
Solution

Again, the best starting point is to construct a quick frequency table.

Value 1 2 3 4 5 6 7 8
Frequency 2 2 3 2 3 2 1 1
Table 1.3.9. Frequency Table

Notice that in this data set, both the 5 and the 3 appear three times. Every other value appears two or fewer times. This makes this set of data bimodal with modes 3 and 5.

While the mode is an easy way to measure the center, and is in fact the only way we have to measure the center of a set of qualitative data, it is usually not the best choice for measuring the center of a set of quantitative data. To see why this may be so, consider the next example.

Find the mode of the following sets of numbers.

  1. \(\lbrace 1, 2, 5, 7, 12, 15, 19, 22, 50 \rbrace\)

  2. \(\lbrace 1, 1, 20, 23, 26, 24, 29, 30, 27, 32, 19\rbrace\)

Solution

The modes of these sets of numbers are as follows:

  1. This data set contains no repeated values. Therefore, according to the definition, there is no mode for this set.

  2. This data contains only one repeated value, the 1. So the mode is 1. However, 1 is definitely not the “typical” value in the set since all other values are between 19 and 32.

In summary, you should use the mode to measure the center of any qualitative set of data, or as a quick, but not definitive, measure of center for quantitative data. But be careful! The mode may not exist, there may be multiple modes, and if there is a single mode it may be very different from the “typical” values in the data set.

Figure 1.3.11. Finding the Mode I
Figure 1.3.12. Finding the Mode II

The following colors of cars were observed in the parking lot of the local grocery store.

blue green black white
white green blue yellow
white white blue black
black green red green
Table 1.3.14. Car Colors

Question: Which color(s) is/are the mode(s) for this set of data?

Answer

green and white

Consider the following set of data.

\begin{equation*} \lbrace 4, 6, 3, 9, 7, 4, 2, 6, 9, 3, 8, 5, 1 \rbrace \end{equation*}

Question: What is/are the mode(s) for this set of data?

Answer

4, 6, 9, and 3 are all modes

The following pets were found in 10 randomly selected homes.

cat fish bird hamster ferret
dog rabbit rat gerbil pig
Table 1.3.17. Pets

Question: What is/are the mode(s) for this data set?

Answer

there is no mode

Subsection 1.3.2 Midrange and Range

When we think of the “typical” value in a set of data, we may also think of the word middle. Our next measure of center is exactly that—the middle of the data set.

Definition 1.3.18.

The midrange of a set of data is the value in the exact center of the data. To compute the midrange, use the formula:

\begin{equation*} \text{midrange} = \frac{\text{maximum} + \text{minimum}}{2} \end{equation*}

where maximum is the maximum value in the data set and minimum is the minimum value.

Notice that the midrange does involve some computation using numerical values, so it can only be used with quantitative variables, where such computations make sense. To measure how spread out the data is from the midrange, we use the range.

Definition 1.3.19.

The range of a set of data is the difference between the highest and lowest values in the data set. That is:

\begin{equation*} \text{range} = \text{maximum} - \text{minimum}. \end{equation*}

Let's try out these new definitions with an example.

A certain pond has been well stocked with fish for the upcoming fishing season. In order to keep a fish from this pond, it must be at least 13 inches long. To try to determine if the fish in the pond are long enough to allow fishing, a fish and game warden randomly catches and releases 20 different fish, finding the following lengths, in inches. Find the range and midrange of this data.

13.2 10.8 14.6 17.2 17
18.3 6.5 13.4 16.7 11.3
12.3 9.8 9.4 11.6 12.1
8.7 12.6 13 14.3 13.1
Table 1.3.21. Fish Lengths in Inches
Solution

To find the midrange and range we need to identify the maximum and minimum values in the data set. The largest value is 18.3 and the smallest is 6.5, Thus we compute the midrange using:

\begin{equation*} \text{midrange} = \frac{18.3+6.5}{2} = 12.4 \end{equation*}

and we get the range using:

\begin{equation*} \text{range} = 18.3 - 6.5 = 11.8. \end{equation*}

It looks like the pond is not yet ready to open for fishing. The typical fish, as measured by the midrange, is only 12.4 inches long and could not be kept.

The midrange and range are useful tools for measuring the center and spread of a set of data, but there are several problems with them. The biggest problem is that both the midrange and range are extremely sensitive to outliers.

Definition 1.3.22.

An outlier in a data set is a value that is either much larger than the rest of the data or much smaller than the rest of the data.

To see how an outlier can affect the midrange and range, let's take another look at the fish-length data from the previous example.

The game warden believes that the 6.5 inch fish he caught in the sample above is an outlier. He decides to redo the computation without that fish. Find the new midrange and range.

Solution

Without the 6.5 inch outlier, the new minimum is 8.7. This gives the following values:

\begin{equation*} \text{midrange} = \frac{18.3+8.7}{2} = 13.5 \end{equation*}

and

\begin{equation*} \text{range} = 18.3 - 8.7 = 9.6. \end{equation*}

With this new computation, it looks like the fish are in fact large enough to be caught—the typical fish length, as measured by the midrange, is 13.5 inches.

Notice what a big difference the single 6.5 inch fish made! This is what we mean when we say that the midrange is sensitive to outliers. Another problem with the range and midrange is that they only use the extreme maximum and minimum values of the data set. Ideally, we would want every value in the set of data to have some part in computing the “typical” value and the “spread” of the data. For this reason, we will keep exploring different measures of center and spread.

Figure 1.3.24. Finding the Midrange and Range I
Figure 1.3.25. Finding the Midrange and Range II

Consider the following set of data.

\begin{equation*} \lbrace 4, 6, 3, 9, 7, 4, 2, 6, 9, 3, 8, 5, 1 \rbrace \end{equation*}

Question: What is the midrange for this data?

Consider the following set of data.

\begin{equation*} \lbrace 4, 6, 3, 9, 7, 4, 2, 6, 9, 3, 8, 5, 1\rbrace \end{equation*}

Question: What is the range for this data?

Consider the following set of data.

\begin{equation*} \lbrace 15, 25, 7, 19, 400, 27, 51, 32, 19, 77, 52, 15 \rbrace \end{equation*}

Question: What is the range for this data?

Answer

393

Consider the following set of data.

\begin{equation*} \lbrace 15, 25, 7, 19, 400, 27, 51, 32, 19, 77, 52, 15\rbrace \end{equation*}

Question: What is the midrange for this data?

Answer

203.5

Subsection 1.3.3 Median and Quartiles

To reinforce why we need a measure of center that is resistant to outliers, consider the following example.

The most expensive house in a certain market is a $2,500,000.00 mansion. The least expensive house is an $8,000.00 mobile home. Do you think that the midrange is an accurate measure of the “typical” home value in this market?

Solution

To answer this question, let's first compute the midrange. This is:

\begin{equation*} \text{midrange} = \frac{2500000 + 8000}{2} = 1254000. \end{equation*}

Do you think that the typical house really costs one and a quarter million dollars? Probably not! The midrange is not an appropriate measure of center for this data.

Once again, the outliers in this data set are affecting the midrange. To solve that problem, we introduce a new measure of center that is both resistant to outliers (meaning that they do not affect it very much) and considers more than just the two extreme values in the data set.

Definition 1.3.31.

The median of a data set is the middle number, or average of the two middle numbers, in the set of data once it has been arranged in increasing order.

You may have heard of the median before. It is often used as a measure of center in data sets where we expect outliers, such as housing prices. Another common set of data in which the median is used is household income. The phrase “median household income” may sound familiar. For those of us working in Washington State with Bill Gates, it is a good thing that the median is used to measure typical incomes! Let's practice computing the median.

Find the median of each set of data below.

  1. \(\lbrace 14, 17, 10, 9, 16, 19, 12, 10, 15, 13 \rbrace\)

  2. \(\lbrace 100, 215, 164, 117, 1000, 205, 178 \rbrace\)

Solution

There are two steps to computing the median, and it is important that we do step one first. That first step is to arrange the data in order. The second step is to locate the middle value. Let's try this on both of the data sets above.

  1. Our first step is to re-arrange this set of numbers in order, from smallest to largest:

    \begin{equation*} \lbrace 9, 10, 10, 12, 13, 14, 15, 16, 17, 19\rbrace \end{equation*}

    Then we find the middle number. Since there are an even number of values (ten of them), the exact middle lies half way between the two center values (5th and 6th) which are 13 and 14. So the median is \(\frac{13+14}{2} = 13.5\text{.}\)

  2. Again, we start by ordering the data:

    \begin{equation*} \lbrace 100, 117, 164, 178, 205, 215, 1000 \rbrace \end{equation*}

    As there are an odd number of values in this data set (seven of them), the median is the single value in the exact center (the 4th value), which is 178.

Notice that the 1000 in the second data set is a definite outlier. However, in computing the median, the actual value does not matter. The only thing that matters is its relationship to the rest of the data. This 1000 could be replaced by a more reasonable 220 and the median would not change. Bill Gates could make only $200,000 a year, and the median household income in Washington State would not change! This is what it means to be resistant to outliers.

To measure the variation in a set of data, we can use a slight twist on the median. Notice that the median divides the set of data into two pieces. If we take each of those pieces and divide them in half, we are dividing the data into quarters. We can then look at the range between the first quarter of the data and the third quarter of the data. This gives us a measurement similar to the range. But since we are only looking at the middle half of the data, we will not be affected by any outliers at the extremes.

Definition 1.3.33.

Any quantitative data set can be divided up into quarters using five quartiles. These quartiles are:

  • the zeroth quartile (\(Q_0\)), which is the minimum value in the data set.

  • the first quartile (\(Q_1\)), which divides the bottom quarter of the data from the rest of the data.

  • the second quartile (\(Q_2\)), which divides the bottom half of the data from the top half. This is more commonly called the median.

  • the third quartile (\(Q_3\)), which divides the bottom three-quarters of the data from the top quarter.

  • the fourth quartile (\(Q_4\)), is the maximum value in the data set.

The range between \(Q_1\) and \(Q_3\) is of particular interest and, as mentioned above, it has a special name.

Definition 1.3.34.

The Interquartile Range or IQR is the difference between the 3rd and 1st quartiles. That is,

\begin{equation*} \text{IQR} = Q_3 - Q_1. \end{equation*}

To see how this works, let's revisit the fish-length example.

A certain pond has been well stocked with fish for the upcoming fishing season. In order to keep a fish from this pond, it must be at least 13 inches long. To try to determine if the fish in the pond are long enough to allow fishing, a fish and game warden randomly catches and releases 20 different fish, finding the lengths from Example 1.3.20. Find the median and IQR for this data.

Solution

We first arrange the data in order. One way to make this process quicker is to cut and paste the data into a spreadsheet program and ask it to sort the values for you. This results in:

\begin{equation*} \scriptstyle 6.5, 8.7, 9.4, 9.8, 10.8, 11.3, 11.6, 12.1, 12.3, 12.6, 13, 13.1, 13.2, 13.4, 14.3, 14.6, 16.7, 17, 17.2, 18.3 \end{equation*}

Since there are 20 values in the data set, the median will be the average of the 10th and 11th values. That is:

\begin{equation*} \text{median} = \frac{12.6+13}{2} = 12.8. \end{equation*}

According to the median, the fish are not quite long enough!

To compute the IQR we must find \(Q_1\) and \(Q_3\text{.}\) We find \(Q_1\) by computing the median of the first half of the data set. Those first ten numbers are:

\begin{equation*} 6.5, 8.7, 9.4, 9.8, 10.8, 11.3, 11.6, 12.1, 12.3, 12.6. \end{equation*}

The median of this data, and thus the first quartile, is:

\begin{equation*} Q_1 = \frac{10.8+11.3}{2} = 11.05. \end{equation*}

We can find \(Q_3\) in the same way, but working with the top half of the data.

\begin{equation*} 13, 13.1, 13.2, 13.4, 14.3, 14.6, 16.7, 17, 17.2, 18.3. \end{equation*}

The median of the top half of the data is:

\begin{equation*} Q_3 = \frac{14.3+14.6}{2} = 14.5. \end{equation*}

Therefore, the IQR is \(IQR = 14.5-11.05 = 3.45\text{.}\) This means that the middle half of the fish fall into a range of 3.45 inches.

Figure 1.3.36. Finding Medians, Quartiles, and IQR I
Figure 1.3.37. Finding Medians, Quartiles, and IQR II

Consider the following set of data.

\begin{equation*} \lbrace 15, 25, 7, 19, 400, 27, 51, 32, 19, 77, 52, 15\rbrace \end{equation*}

Question: What is the median for this set of data?

Answer

26

Consider the following set of data.

\begin{equation*} \lbrace 15, 25, 7, 19, 400, 27, 51, 32, 19, 77, 52, 15\rbrace \end{equation*}

Question: What is the first quartile for this data?

Answer

17

Consider the following set of data.

\begin{equation*} \lbrace 4, 6, 3, 9, 7, 4, 2, 6, 9, 3, 8, 5, 1 \rbrace \end{equation*}

Question: What is the median for this data?

Consider the following set of data.

\begin{equation*} \lbrace 4, 6, 3, 9, 7, 4, 2, 6, 9, 3, 8, 5, 1\rbrace \end{equation*}

Question: What is the third quartile for this data?

Subsection 1.3.4 Mean and Standard Deviation

While the median is a good measure of center to use in the case where we know a set of data contains outliers, it does have one major disadvantage. Namely, it does not take the value of every data point into consideration. In many instances, we want each value in the data set to have weight in determining the “typical” value. To achieve this goal, we turn back to an already familiar measure of center, commonly called the average. We will, however, use a more precise name for this measure.

Definition 1.3.42.

The arithmetic mean, which we will call simply the mean, is a measure of center found by taking the sum of all values in the data set and dividing by the number of values in the data set. Symbolically,

\begin{equation*} \text{Population:}\quad\mu = \frac{\sum x_i}{n} \end{equation*}
\begin{equation*} \text{Sample:}\quad\overline{x} = \frac{\sum x_i}{n} \end{equation*}

Where the \(x_i\) represent the values in the data set, \(n\) is the number of values in the data set, and the Greek letter \(\sum\) stands for summation.

Notice that there is a different symbol used for the mean depending on whether it is the mean of an entire population, or just the mean of a sample. For a population, the Greek letter \(\mu\) (pronounced “myoo”) is used for the mean. For a sample, we use \(\overline{x}\) (read “x bar”) to represent the mean. Keep this distinction in mind, as it can be a clue as to what type of data you are dealing with on a homework or exam problem.

Another thing to note is that the name arithmetic mean is important. First, we don't call this the average, even though most people think of it as the average. The reason is simply that the word average means “typical” and we have seen several ways to measure the “typical” value in a data set. Next, we specify that this is the arithmetic mean because there are actually different types of means. We could, for example, use the geometric mean, which is given by the formula:

\begin{equation*} \sqrt[n]{x_1 \times x_2 \times \cdots \times x_n} \end{equation*}

For many data sets the geometric mean and arithmetic mean are very close. But you wouldn't want to have your homework average computed using the geometric mean. To understand why not, think about what would happen if only one of your assignments was not turned in! In this text we will always use the arithmetic mean. For that reason, we can simplify things by stating up front that we mean the arithmetic mean when we say mean.

The measure of spread most commonly used with the mean is called the variance. This can be thought of as the average (or mean) distance that each data point lies away from the mean. If we compute the average, however, we would get zero because of the +/- signs. So we square these differences, resulting in the following definition.

Definition 1.3.43.

The variance measures the average square of the distance of a value from the mean in a set of data. Symbolically,

\begin{equation*} \text{Population:}\quad\sigma^2 = \frac{\sum (x_i - \mu)^2}{n} \end{equation*}
\begin{equation*} \text{Sample:}\quad s^2 = \frac{\sum (x_i -\overline{x})^2}{n-1} \end{equation*}

Again we use a different symbol to represent the variance of a population, the Greek letter \(\sigma^2\) (pronounced “sig-muh squared”), and a sample, the Latin letter \(s^2\) (pronounced “s squared”). But the variance of a population also differs in a more substantial way from that of a sample. Notice that in a population, we divide the sum of the differences squared by \(n\text{,}\) the number of values in the population. But in a sample, we divide by \(n-1\text{.}\) This is an important distinction to remember. The reason for it is, unfortunately, beyond the scope of the course, but involves the fact that in a sample there is less variation than in the entire population.

One problem with using the variance as a measure of spread is that the units are different from the units of the data set. For example, if we say that the mean number of children in a US household is 2.2 with a variance of 0.4, then the mean is in “children”. That is, there are an average of 2.2 children in US households. The variance, however, has units “square children.” What does it mean to say that the number of children varies by 0.4 square children? To remedy that, we often take the square root of the variance to get a measure of spread that is in the same units as the mean.

Definition 1.3.44.

The standard deviation of a set of data is the square root of the variance. Symbolically,

\begin{equation*} \text{Population:}\quad\sigma = \sqrt{\frac{\sum(x_i - \mu)^2}{n}} \end{equation*}
\begin{equation*} \text{Sample:}\quad s = \sqrt{\frac{\sum (x_i - \overline{x})^2}{n-1}} \end{equation*}

Let's test out these new formulas on an example.

Find the mean and standard deviation of sample data \(\lbrace 4, 6, 3, 8, 4\rbrace\text{.}\)

Solution

We will do this in three steps.

  1. We first compute the mean as follows.

    \begin{equation*} \overline{x} = \frac{4 + 6 + 3 + 8 + 4}{5} = \frac{25}{5} = 5. \end{equation*}
  2. Next, we compute the variance. Using a table can make this computation easier.

    \(x_i\) \((x_i - \overline{x})\) \((x_i - \overline{x})^2\)
    \(4\) \((4 - 5) = -1\) \((-1)^2 = 1\)
    \(6\) \((6 - 5) = 1\) \((1)^2 = 1\)
    \(3\) \((3 - 5) = -2\) \((-2)^2 = 4\)
    \(8\) \((8 - 5) = 3\) \((3)^2 = 9\)
    \(4\) \((4 - 5) = -1\) \((-1)^2 = 1\)
    \(s^2 =\) \(\frac{16}{5-1} = 4\)
    Table 1.3.46.
  3. Finally, the standard deviation is \(s = \sqrt{4} = 2\text{.}\)

The number for this example turned out very nice. This is not always the case. Usually the standard deviation will involve decimals. In general, you should not round your answers during any intermediate steps, and then round your standard deviation to one decimal more than the number of decimals in the original data set. One quick way to estimate the standard deviation is based on the range of the data.

In the above example, our estimate of \(\frac{8-3}{4} = 1.25\) is not terribly good, but this can be a quick and easy way to approximate the standard deviation if all you have are the minimum and maximum values of your population data.

Figure 1.3.48. Finding Means and Standard Deviations I
Figure 1.3.49. Finding Means and Standard Deviations II

A population of long tailed cats has the following tail-lengths, in inches.

\begin{equation*} \lbrace 22, 19, 28, 23, 31\rbrace \end{equation*}

Question: Find the standard deviation for this population data. Round your final answer to one decimal place.

Answer

4.3

A sample of newly manufactured widgets showed the following number of manufacturing defaults.

\begin{equation*} \lbrace 2, 3, 0, 1, 0\rbrace \end{equation*}

Question: Find the standard deviation for this sample data. Round your final answer to one decimal place.

Answer

1.3

A sample of driving speeds from five randomly selected cars yielded the following values, in miles per hour.

\begin{equation*} \lbrace 68, 45, 71, 32, 55\rbrace \end{equation*}

Question: Find the standard deviation for this sample data. Round your final answer to one decimal place.

Answer

16.2

Subsection 1.3.5 Selecting Appropriate Summaries

We have now seen four different sets of numerical summaries:

But how do we know when to use a particular measure of center? To better answer this question, let's review the strengths and weaknesses of each of these numerical summaries in tabular format.

Qualitative Quantitative Sensitive Considers
Data Data to Outliers every Value
Mode sometimes sometimes no no
Midrange and Range no yes yes no
Median and IQR no yes no no
Mean and Standard Deviation no yes somewhat yes
Table 1.3.53. Comparison of Measures of Center and Spread

As we can see, each method has its own strengths and weaknesses. Based on these, we can summarize when to use which measures of center and spread, as outlined in the observation below.

Observation 1.3.54.
  • Mode.

    Use the mode when dealing with qualitative data, or for a quick measure of center for small sets of quantitative data. The mode may not exist, and should not be used in general for larger sets of quantitative data.

  • Midrange and Range.

    The midrange and range may be used for a quick estimate of center and variation, but should not be used if there are any indications of outliers.

  • Median and Inter-Quartile Range.

    Use the median and inter-quartile range if there are indications of strong outliers in your data. Data sets which with strong outliers would include income or housing price data.

  • Mean and Standard Deviation.

    The mean and standard deviation are the best all-purpose measures of center and variation for quantitative data. Unless your data set has strong outliers, these are the summary values you should use.

Determine which measure of center would be best in each of the following data sets.

  1. The salaries of all employee's at IBM.

  2. A quick assessment of how well a group of athletes did in a 100-meter dash.

  3. Determine the typical hair color in a group of teenagers.

  4. Finding the typical height of a pine tree in a given acre of forest.

Solution
  1. Salaries of all employees at a large firm such as IBM are likely to include outliers, such as the CEO's salary. For this reason, the median is probably the best measure of center.

  2. The midrange may be best. This set of data should not include many outliers, and the values will be small and relatively close together. This makes the midrange an acceptable way to quickly measure the center.

  3. Hair color is a qualitative variable, so our only choice is to use the mode.

  4. Pine tree heights on an acre of forest land should not have large outliers, and there are likely to be quite a few trees. Since there is no reason to use any of the other measures of center, the mean would be best here.

Often it is helpful to compute several summaries. One useful comparison relates the mean and median together. When the mean is larger than the median, the data will have a histogram that is skewed right. When the mean is smaller, the histogram will be skewed left. Finally, if the mean and median are equal, the histogram will be exactly symmetric.

Figure 1.3.56. Using Appropriate Measures I
Figure 1.3.57. Using Appropriate Measures II

A teacher wishes to summarize the score of a “typical” student on the most recent exam. The teacher is aware that one very gifted student received a perfect score, but that all other students struggled on the exam.

Question: Which measure of center should the teacher use to summarize the score of a typical student?

Answer

median

A basketball coach wishes to summarize the height of a group of students at basketball camp. Most of the students are in the same height range, with no unusually short or tall students whose heights would be considered outliers.

Question: Which measure of center should the coach use?

Answer

mean

A travel agent wishes to summarize the most common hotel choice in Walla Walla. She collects data from her most recent 250 clients, recording the hotel in which each client stayed.

Question: What measure of center should the travel agent use to summarize this data?

Answer

mode