a The shop's owner measured the rotations per minute of one of its lathe at various times over the course of a month. The obtained measurements are listed below.

250254261251257263253257265253259270253259291

We want to draw a combination of histogram and boxplot. Let's begin by drawing a histogram.

Histogram

A histogram is a graphical illustration of a data set. The data is grouped into specific intervals, which are called the bin width. This grouping is marked on a horizontal line.

We need to choose an appropriate bin width for each of the bars on our histogram. Remember all widths must have the same size. An approximation of how wide the bin width is the square root of the number of values in the data set. Let's count how many measurements we have.

250254261251257263253257265253259270253259291

We can see there are total of

15

measurements, therefore we can find the bin width.

15 = 3.872983 \dots \approx 4

Using a bin width of

4,

we are able to identify the number of observations in each interval starting from the minimum data value

250 .

Interval	Observations
$250 - 254$	$250,$ $251,$ $253,$ $253,$ $253,$ $254$
$254 - 258$	$257,$ $257$
$258 - 262$	$259,$ $259,$ $261$
$262 - 266$	$263,$ $265$
$266 - 270$	$270$
$270 - 274$	$-$
$274 - 278$	$-$
$278 - 282$	$-$
$282 - 286$	$-$
$286 - 290$	$-$
$290 - 294$	$291$

The histogram is the collection of rectangles drawn above the intervals. The height of these rectangles are proportional to the frequency of the data in the corresponding interval. Let's find the frequency of the data.

Interval	Observations	Frequency
$250 - 254$	$250,$ $251,$ $253,$ $253,$ $253,$ $254$	$6$
$254 - 258$	$257,$ $257$	$2$
$258 - 262$	$259,$ $259,$ $261$	$3$
$262 - 266$	$263,$ $265$	$2$
$266 - 270$	$270$	$1$
$270 - 274$	$-$	$0$
$274 - 278$	$-$	$0$
$278 - 282$	$-$	$0$
$282 - 286$	$-$	$0$
$286 - 290$	$-$	$0$
$290 - 294$	$291$	$1$

Now we have all the information we need to draw the histogram.

Boxplot

As a next step, we want to place a boxplot on top of the histogram. To create the boxplot, we need to determine the following five-number summary of the data set.

Minimum value 1^{st} Quartile Median 3^{rd} Quartile Maximum value

Examining the observations, we notice that they have been ordered from least to greatest.

250254261251257263253257265253259270253259291

Therefore, we can immediately identify the minimum and maximum value as

250

and

291 .

Also, the number of values in the data set is

15

, an odd number, which means the median must be the

8^{th}

observation.

250254261251257263253257265253259270253259291

To find the

1^{st}

and

3^{rd}

Quartile, we have to identify the middle value of the lower and upper half, which will be the average of the

4^{th}

and

5^{th}

value for the lower half and of the

1 1^{th}

and

1 2^{th}

value for the upper half.

250254261251257263253257265253259270253259291

Having identified the relevant values, we can calculate the quartiles.

1^{st} Quartile : 3^{rd} Quartile : \frac{2 5 3 + 2 5 3}{2} = 253 \frac{2 6 1 + 2 6 3}{2} = 262

Let's summarize what we have found.

Minimum value = 250 1^{st} Quartile = 253 Median = 257 3^{rd} Quartile = 262 Maximum value = 291

Notice that the measurement

291

is far away from the bulk of data distribution. Therefore, it is an outlier. We will mark it on a modified boxplot with a dot. This results in the right segment ending on measurement equal to

270,

which is the second highest obtained measurement.

Exercise