Practice: Summarizing Numerical Data
1) Which of the following is most affected by an outlier in a dataset?
- Mean
- Median
- Mode
2) The median is the best measure of center for which type of data?
- Symmetrical and normally distributed
- Skewed and containing outliers
- Categorical and nominal
- Discrete and uniform
3) In a dataset, if every value is increased by 10, what happens to the mean and median?
- Both remain the same.
- Both increase by 10.
- The mean increases by 10, but the median remains the same.
- The mean remains the same, but the median increases by 10.
4) Which measure of center is best to describe the typical hair color of students in a class?
- Mean
- Median
- Mode
- All of the above
5) Which of the following statements about the mode is true?
- The mode is highly affected by outliers.
- The mode is the only measure of center that can be used for nominal data.
- Every dataset has exactly one mode.
- The mode is always a unique number in the dataset.
6) Mean vs Median
A college statistics class conducted a survey of how students spend their money. They asked 25 students to estimate how much money they typically spend each week on fast food. They determined that the mean amount spent on fast food each week is $31.52 and the median is $32. Later they realized that a value entered as $2 should have been $20. They recalculate the mean and the median.
Which of the following is true?
- The mean and median will increase.
- The mean will increase, but the median will remain the same.
- The mean will stay the same, but the median will increase.
- Both the mean and median will remain the same.
7) Preferring the Median
For which one of the following distributions will the median be a better measure of center than the mean?
Salary data for players in the National Basketball Association (NBA) where most of the players earn the league minimum and a few superstars earn very high salaries in comparison.
Repeated weight measurements of the same 1.6-ounce bag of Peanut M&Ms by students in a large chemistry class using a non-digital balance scale.
Height data from a large random sample of men.
Exam scores with a central peak around an average test score and a few students scoring lower than the average and a few scoring higher.
8) If the standard deviation of a dataset is small, what does this tell you about the data points?
- They are widely dispersed.
- They are clustered closely around the mean.
- The data is skewed.
- The data is symmetrical.
9) The Interquartile Range (IQR) is the difference between:
- The maximum and minimum values.
- The mean and the median.
- The first quartile (\(Q_1\)) and the third quartile (\(Q_3\)).
- The mode and the mean.
10) What is the standard deviation of the dataset {10, 10, 10, 10}?
- 10
- 1
- 0
- Undefined
11) The variance of a dataset is always:
- Greater than the standard deviation.
- Less than the standard deviation.
- Negative.
- Non-negative.
12) What does a large Interquartile Range (IQR) indicate?
- The dataset is symmetrical.
- The middle 50% of the data is widely spread out.
- The dataset has no outliers.
- The mean and median are the same.
13) Which of the following is not a measure of spread?
- Range
- Interquartile Range (IQR)
- Standard deviation
- Mean
- Part a)
- Part b)
- Part b)
- Part c)
- Part b)
- Part b)
- Part a)
- Part b)
- Part c)
- Part c)
- Part d)
- Part b)
- Part d)