| {{ 'ml-lesson-number-slides' | message : article.intro.bblockCount }} |
| {{ 'ml-lesson-number-exercises' | message : article.intro.exerciseCount }} |
| {{ 'ml-lesson-time-estimation' | message }} |
Here are a few recommended readings before getting started with this lesson.
Izabella's favorite candy, Frutty, is sold in packs of thirty candies with three different flavors — apple, orange, and banana.
Izabella wants to know how many banana-flavored candies there are in each pack, so she bought ten packs and counted the number of banana candies in each. Her results are as follows.Begin by finding the range of the data, then draw a number line which covers this range.
The smallest number in the data set is 8 and the largest is 12. This means that the dot plot can be displayed above a horizontal number line that covers at least the numbers from 8 to 12. Here, a number line from 7 to 13 will be used.
The number of dots drawn on the dot plot above a certain number should match the frequency of that number in the data set.From here, the dot plot can be drawn as follows.
A multiple-choice test has ten questions. After grading the test, the teacher produced the following dot plot to show how many correct answers each student had on the test.
How many students are there in the class?
Each dot represents the performance of one student on the test.
Each dot represents the performance of a student on the test. For example, since there is one dot above the number 4, it means that one student answered four questions correctly. The rest of the dot plot can be interpreted similarly.
Number | Dots Above the Number | Conclusion |
---|---|---|
0,1,2,3 | 0 | There are no students who answered fewer than four questions correctly. |
4 | 1 | One student answered four questions correctly. |
5 | 3 | Three students answered five questions correctly. |
6 | 2 | Two students answered six questions correctly. |
7 | 4 | Four students answered seven questions correctly. |
8 | 5 | Five students answered eight questions correctly. |
9 | 3 | Three students answered nine questions correctly. |
10 | 2 | Two students answered all ten questions correctly. |
A college hockey team played 23 games during a season. An enthusiastic fan made a dot plot of the number of goals the team scored in each game.
Group the data in a frequency table using the intervals asked in the prompt. The first interval will be the ages 40–44.
The frequency table below shows the grouping of the data starting at 40 and using 5-year intervals.
Interval | Frequency |
---|---|
40–44 | 2 |
45–49 | 7 |
50–54 | 12 |
55–59 | 13 |
60–64 | 8 |
65–69 | 2 |
70–74 | 1 |
Use these intervals and frequencies to draw the histogram.
In 1936, Sir Ronald Aymler Fisher published a paper entitled The Use of Multiple Measurements in Taxonomic Problems.
Fisher investigated several measurements of three species of flowers.
The histogram below shows the summary of the data about the sepal length of the Iris virginica flowers.
How many Iris Virginica flowers did Fisher investigate in this paper?
Consider the height of the rectangles in the histogram.
In a histogram, the height of the rectangles shows the frequency of the data elements in the corresponding interval.
A ranger is surveying a forest. He randomly selected 40 loblolly pines (Pinus taeda) and measured their heights. The histogram below is the summary of the data.
Rearrange the data in increasing order and find the five-number summary.
The box-plot is built using these points.
Putting all this together gives the box plot.
In the 1994 report The Population Biology of Abalone (Haliotis species) in Tasmania,
the authors presented and investigated the measurements of 4177 blacklip abalones.
The lengths of the shells in millimeters are summarized in the box plot below.
How many blacklip abalones' lengths were shorter than 90 millimeters in this experiment?
Which part of the box plot is at 90?
The left side of the box is at 90, so the first quartile of the lengths is 90 millimeters.
The problem is now to find out how many data points are less than the first quartile. The first quartile is the median of the lower half of the data set. In this experiment there are 4177 data points, so by dividing this by 2, the number of data points in the lower half can be found.Note that from the box plot, the only conclusion we can make is that the number of blacklip abalones shorter than 90 millimeters is less than 1045.
In fact, there were 60 blacklip abalones with a length of 90 millimeters in the experiment. The answer option 1007 reflects the actual answer to the question, but to get this value, the full data is needed — the box plot is not enough.
The heights, in feet, of red alder (Alnus rubra) trees in a forest are summarized in the following box plot.
In some cases, scientists use visual representations that go beyond the three types of plots discussed in this lesson. For example, the report about the blacklip abalones also contains data about their sex. This can be used to present a summary of the length in a stacked histogram.
In addition to representing the lengths, this stacked histogram has colored bars that indicate the distribution of male, female, and infant blacklip abalones.