HW 02

Class: STAT-211


Notes:

1.

There are four different data sets. Match them to the correct boxplot and histogram (both of them need to be matched). And each data set matches exactly one of the four boxplots and histograms.

Hint: You may try to match boxplot first, then try to match histogram.

Data Set 1:
Mean: 52.32
Median: 53
Std Dev: 38.297
Data Set 2:
Mean: 67.8
Median: 74
Std Dev: 17.049
Data Set 3:
Mean: 49.4
Median: 50
Std Dev: 20.265
Data Set 4:
Mean: 47.88
Median: 41
Std Dev: 19.020
A
boxplotcoll3B
boxplotcoll1
C
boxplotcoll2
D
boxplotcoll4
I
Histogramcoll2II
Histogramcoll1
III
Histogramcoll3
IV
Histogramcoll4
Data Set Boxplot Histogram
Data Set 1 B IV
Data Set 2 C I
Data Set 3 A III
Data Set 4 D II

2.

Phytopigments are a marker of the amount of organic matter that settles in sediments. Phytopigment concentrations in deep-sea sediments collected worldwide showed a very strong right-skew. Of these two summary statistics, 0.01 and 0.017 grams of phytopigment per square meter of bottom surface, which one is the mean and which one is the median and why?

3.

Provided below are a histogram and the five number summary for salaries (in $) for a random sample of U.S. marketing managers.
fig10

Minimum Q1 Median Mean Q3 Maximum
46360 69693 77020 80183 91750 129420

The IQR for these data is

Process:

4.

A sample of students from Texas A&M University were asked how many hours they slept on Wednesday night, to the nearest half. The results are as follows:
7, 9.5, 8, 9, 5, 3.5, 10.5, 7, 10, 12

We asked the same ten college students how much sleep they got on Friday night, the results look like this:
9, 7, 4, 9, 3.5, 7, 9, 4.5, 9, 19.5

What could you say about the difference in sleep for Friday night verses Wednesday night?

HW02-4.png|600

5.

This question is about reading the mathematical formulas. Try doing the computations using the formulas for this small data set: 11, 20, 5, 17, 31. Parts (a)--(i) below, guide you through this step-by-step.

(a) number of observations, n = 5

(b) sum of the observations, xi = 84

(c) mean, = xi/n = 84/5 = 16.8

(d) x1x¯ = -5.8

(e) (x1x¯)2 = 33.64

(f) (xix¯) = 0

(g) (xix¯)2 = 384.2

(h) variance, s2 = 96.2

(i) standard deviation, s = 9.81

6.

Identify the correct match of the Upper, Middle and Lower boxplots with the estimated densities 1, 2 and 3.

Boxplots test exam 2
Densities test exam 2

7.

I asked four of my coworkers how many pushups they could do. After a brief contest the results were:
12
32
72
96

Find the standard deviation of these numbers.
Use two decimals

s = 38

8.

Consider the following dataset:

13, 9, 18, 10, 7, 11, 8, 21, 12, 13, 8, 10, 12, 14, 14, 11, 12, 16, 15, 17, 15, 14, 18, 19, 20, 21, 23, 25, 24, 16, 18, 19.

(a) Compute the median for the above data. 14.5

(b) Compute the first quartile for the above data. 11.75

(c) Compute the third quartile for the above data. 18.25

(d) Find the interquartile range (IQR) for the above data. 6.5

9.

The CNN article found here ​says, "A proven way to ease anxiety naturally is with a bout of​ cardio." A friend of yours commented that she tried running the morning of tough exams but experienced no difference in her anxiety​ levels, so clearly, the study must be a fraud. Which of the following statements is​ correct?

10.

The boxplot below shows the amount of time it takes a student to run a mile. A summary of that data is one of the options below. Mark which data set summary matches the boxplot.

Pasted image 20250907194543.png|350

Mean=5.5
Std Dev=5
Median=6.5

Mean=5.5
Std Dev=8
Median=6.5

Mean=6
Std Dev=1
Median=5.5

Mean=7.5
Std Dev=6
Median=7.5

Mean=5.5
Std Dev=4
Median=5.5

Mean=5
Std Dev=7
Median=9

11.

Below is plotted a histogram of the lengths of 44 sharks (including some not yet fully mature). Lengths are in feet, and no shark was measured to be exactly an integer number of inches: none are on the border. Which of the following is true?

Pasted image 20250907194813.png|350

12.

Based on the following plot, how would you describe the data?

Pasted image 20250907195107.png|350

13.

Which of the following data sets should have the largest standard deviation?

14.

There are two different boxplots.

a) Which of the following terms best describes the shape of the boxplot?
fig9

b) Which of the following terms best describes the shape of the boxplot?
fig8

15.

Sixteen students were asked how many electronic devices they had in their home.
8, 17, 32, 57, 10, 6, 41, 50, 17, 27, 56, 17, 16, 7, 53, 10

Calculate the five number summary
a) Minimum: 6

b) Q1: 10

c) Median: 17

d) Q3: 45.5

e) Maximum: 57

f) What is the IQR? 35.5

16.

The histogram below shows the results from a survey asking 100 random kindergarten Americans "How many states in the US have you visited?"

Histogramstates

a) Based on the histogram, what percentage of kindergartners have visited 4 or 5 states? (answer to 2 decimals)

Answer: 20 (visually)

b) Based on the histogram, what percentage of kindergartners have visited at least two states? (answer to 2 decimals - and use the percentage as a decimal between 1 and 0)

Answer: 0.85 (visually)

17.

The picture below has three lines marked A, B, and C. One is the mean, one is the median, and one is the mode. Which one is which?

plotmeanmedianmode

A Mode
B Median
C Mean

The curve is right-skewed (positively skewed): the long tail extends to the right.
For skewed distributions, the relationship is:
Mode < Median < Mean

Looking at the vertical lines:

18.

Match each histogram/boxplot with one of the following descriptions: Skewed to the left, symmetric and bimodal, symmetric and unimodal, skewed to the right. Remark: Please ignore the diamonds in the boxplots, this is something odd that the statistical software puts in.

Agresti_2_86_1_7_SJS

19.

A study records the sex and weight (in kilograms) of 30 recently born bear cubs in Alaska. Which of the following statements is true?

The Statistical Abstract of the United States, prepared by the Census Bureau, provides the number of single-organ transplants for 2003 by organ. Assume that all types of single-organ transplants are presented below. The following two exercises are based on this table:

Heart 2034 Kidney 15146
Lung 1094 Pancreas 468
Liver 5047 Intestine 140

The data on single-organ transplants can be displayed in

Kidney transplants represented what percent of single-organ transplants in 2003?

about 63%
about 37%
about 58%
This percent cannot be calculated from the information provided in the table.

20.

The average salary of all female workers at a company is ​$44​,000. The average salary of all male workers is ​$52,000. What must be true about the average salary of all​ workers?

21.

Consider the following dataset:

10, 12, 12, 14, 14, 16, 16, 20, 22, 23, 23, 24, 25, 25, 27, 30, 35.

(a) Compute the median for the above data. 22

(b) Compute the first quartile for the above data. 14

(c) Compute the third quartile for the above data. 25

(d) Find the interquartile range (IQR) for the above data. 11

22.

Identify the correct match of the upper, middle and lower boxplots with their corresponding Histograms 1, 2, and 3.

Boxplots test exam 1
Picture1

23.!

Variability is a measure of the variance or diversity in the values of a data set.
A survey noted the color of random cars and random boats. The categorical data was graphed using the bar charts shown. Which one has greater variability?

barchartstable barchartvariable
Answer: Right

A survey noted the net worth of random college students and random retirees. The numerical data was graphed using the histograms shown. Which one has greater variability?

histogramstable histogramvariable
Answer: Right

College Students (Left Graph)

Retirees (Bottom Graph)

Conclusion

24.

Identify the correct match of Boxplots A, B, and C with their corresponding Histograms i., ii., and iii.

Exam 1 boxes and hists