Sign In
| 11 Theory slides |
| 9 Exercises - Grade E - A |
| Each lesson is meant to take 1-2 classroom sessions |
Here are a few recommended readings before getting started with this lesson.
Match the Finnish name with the Latin name.
The following box plots show the distribution of the heights (in feet and inches) of the players on the Ohio State Buckeyes men's basketball and football teams in the 2020–2021 season.
Of the two sports, which tends to place more importance on a player's height? Identify the maximum and the median heights of each team.
The box plots show that the range of heights is similar for both teams, but Team B, on average, has taller players.
Height tends to be more advantageous in basketball than in football. Therefore, it is reasonable to conclude from the box plots that Team B is the basketball team and Team A is the football team.
Represented using a histogram, the same data set is used to show the distribution of the height of the players on the two teams.
The table below shows the average monthly high temperatures across three small towns.
One town is located in the State of Alaska, another in Florida, and the other in Nebraska.
Analyzing the data set and map, try to match each town with the correct corresponding state. Note that, generally, northern states tend to be colder than southern states.
Think about the relationship between the location and climate of each state of Alaska, Nebraska, and Florida. Then, consider each month's average high temperatures as shown in the data set. Which town is the warmest? Which town is the coldest?
Investigating the given data set and map, the following observations can be made.
Considering each observation from the data set and map, it is likely that Noma is in Florida, Mekoryuk is in Alaska, and Nehawka is in Nebraska.
The following applet shows the histograms of two data sets. Move the slider to investigate the observations separately.
(a−b)2=a2−2ab+b2
LHS+a2+2ab+b2≤RHS+a2+2ab+b2
LHS/4≤RHS/4
Factor out 2
a2+2ab+b2=(a+b)2
Simplify quotient
LHS≤RHS
The table below shows the average monthly low temperatures of two cities — Kansas City and Seattle. The two cities given are not necessarily in order.
According to the data set, City A and B have annual average low temperatures around 45∘F and 43∘F, respectively. Referencing the map below, Seattle is located much further north than Kansas City. It is typical that northern states — on average — are colder than southern states. Nevertheless, Seattle experiences less variance in temperature changes during each season due to the ocean's tempering effect on the climate.
Think of the climate similarities and differences of coastal areas compared to inland areas. Find the range and standard deviation of the temperatures.
Using the range and standard deviation — measures of spread — will help compare the two cities' average low temperatures. A graphing calculator can be used to find the standard deviation.
Range | Standard Deviation | |
---|---|---|
City A | 55−36=19 | 6.7 |
City B | 66−18=48 | 16.4 |
These measures of spread show that the temperature throughout the year changes much less in City A than in City B. Based on that analysis, and considering the information given about the tempering effect of the ocean, it is reasonable to conclude that City A is Seattle and City B is Kansas City. What a cool conclusion to make.
In the US stock market, a measure of how much a stock price fluctuates during a certain period of time is called historical volatility. The following data set from the year 2020 contains information about the daily closing stock price (in dollars) of two companies.
Low | Mean | High | Standard Deviation | |
---|---|---|---|---|
APDN | $2.52 | $6.89 | $15.21 | $2.24 |
DSS | $4.04 | $6.90 | $10.89 | $1.69 |
Which stock price was less volatile in 2020?
Which part of the table gives information about the spread of the stock price?
The numbers in the given table can be interpreted into the following sentences.
Both the range and standard deviation are smaller for DSS. These interpretations indicate that DSS's stock price fluctuated less over the year than the stock price of APDN. Therefore, it can be concluded that the stock price of DSS was less volatile in 2020.
Consider the following two histograms where neither the labels nor scales are specified.
Both of these histograms represent different distributions, and both have 26 columns.
In a lottery, all numbers are drawn with equal probability.
In a mathematics competition, only a few students answer all questions correctly. Still, a lot of students will be able to answer at least some of the questions correctly. Consequently, it is likely that the histogram is shaped like a mountain, with a peak in the center and low ends. The shape of Histogram A reflects this behavior.
In a lottery, all numbers are drawn with equal probability, so in the long run, it can be expected that there is little difference between the frequencies. The shape of Histogram B reflects this.
There is even more fascinating information to be discovered from the shapes of the histograms.
The height of the lone tall bar furthest to the left in Histogram A shows that in the AMC 8 competition, there were plenty of participants in 2020 who did not answer a single question correctly! Well, it is much more likely, however, that these participants registered but did not attend the competition.
Histogram A's peak shows that in 2020, on average, students in the AMC 8 competition answered less than half of the questions correctly.
The fluctuation of the bar heights in Histogram B shows that although an even distribution of the numbers is expected on the Powerball draw, some numbers historically came out fewer times.
The bar corresponding to 24 is more than twice as high as the bar corresponding to 16. However, this does not mean that 24 is twice as likely to come out in a draw. Nor does this mean that players should now play 16 because it will eventually catch up. The data is historical; it does not have any effect on the next draw.
Mean Length to Height Ratio | |
---|---|
Abramis Bjorkna | 2.55 |
Leuciscus Rutilus | 3.75 |
Osmerus Eperlanus | 5.95 |
Esox Lucius | 6.33 |
Next, the actual drawings can be used to find their length to height ratios. This measurement, however, is in pixels instead of centimeters. Most image software on a standard computer can show these measurements. Here, they are given. Recall that the drawings use the Finnish names.
The results, in increasing order, can be summarized as follows.
Length to Height Ratio (Images) | |
---|---|
Pasuri | 120361≈3.01 |
Särki | 106393≈3.71 |
Hauki | 59358≈6.07 |
Norssi | 55358≈6.51 |
The numbers in the two tables do not match exactly, which would make sense given that they are measured using different measurements, and the images are not matching in scale. Still, in both tables, two species have a ratio above 5 and two species have a ratio below 4. That means the following distinction can be made.
Latin Name (Data Set) | Finnish Name (Images) | |
---|---|---|
Longer Fishes | Osmerus eperlanus and Esox lucius | Hauki and Norssi |
Taller Fishes | Abramis bjorkna and Leuciscus rutilus | Pasuri and Särki |
Latin Name | Finnish Name | English Name |
---|---|---|
Abramis Bjorkna | Pasuri | Bream |
Leuciscus Rutilus | Särki | Roach |
Osmerus Eperlanus | Norssi | Smelt |
Esox Lucius | Hauki | Pike |
To calculate the standard deviation of a data set, we need to know the individual observations and the mean of the data set. We do not know the numbers nor how many there are. However, what we do know is that they are all equal. Therefore, let's say that we have n numbers and that all of them are equal to x. With this information, we can write an expression for their mean. Mean = x+x+⋯+x/n Notice that since we are adding n equal numbers we can rewrite the sum in the numerator as a product of n and x.
As we can see, the mean is the same as the values themselves. This means the difference between the individual observations and the mean are all 0. Now, let's calculate the standard deviation.
Consequently, the standard deviation is 0. If we think about it, this makes a lot of sense. The standard deviation measures the spread of observations around the mean. But if all observations are the same, then there is no spread at all. Therefore, the standard deviation is 0.
Let's call the middle value n. Since the integers are consecutive, we get the following sequence. n-2, n-1, n, n+1, n+2 If we add these numbers and divide by 5, we get the mean.
When we have an expression for both the mean and the observations, we can calculate the standard deviation.
Consequently, the standard deviation of five consecutive integers is sqrt(2).