{{ 'ml-label-loading-course' | message }}
{{ toc.name }}
{{ toc.signature }}
{{ tocHeader }} {{ 'ml-btn-view-details' | message }}
{{ tocSubheader }}
{{ 'ml-toc-proceed-mlc' | message }}
{{ 'ml-toc-proceed-tbs' | message }}
Lesson
Exercises
Recommended
Tests
An error ocurred, try again later!
Chapter {{ article.chapter.number }}
{{ article.number }}. 

{{ article.displayTitle }}

{{ article.intro.summary }}
Show less Show more expand_more
{{ ability.description }} {{ ability.displayTitle }}
Lesson Settings & Tools
{{ 'ml-lesson-number-slides' | message : article.intro.bblockCount }}
{{ 'ml-lesson-number-exercises' | message : article.intro.exerciseCount }}
{{ 'ml-lesson-time-estimation' | message }}
 Bivariate Quantitative Data
Reference

Data Displays

Concept

Line Graph

A line graph is used to show how a set of data changes over a period of time. To make a line graph, a scale and interval should be chosen. Then the pairs of data should be graphed and a line connecting the points should be drawn. Consider a table of values that represents the growth of a plant over several weeks.

Plant Growth
Week
Height,

The height data includes values from to so a scale from to inches with an interval of inch are reasonable. The horizontal axis can represent time in weeks and the vertical axis can represent the plant height in inches. Now, the points can be plotted on a coordinate plane and connected.

Line graph
By observing the upward and downward slant of lines connecting the points, the trends in the data can be described and future events can be predicted.
Concept

Dot Plot

A dot plot, also known as line plot, is a way to represent numerical data in which each data point is represented with a dot above a horizontal number line. The dots representing the same measurements are stacked above each other. Consider the following data set.
  • There are two s in this data set, so on the corresponding dot plot two dots are stacked above the number of the number line.
  • There is one in this data set, so on the corresponding dot plot a single dot is drawn above the number of the number line.
  • There are five s in this data set, so on the corresponding dot plot five dots are stacked above the number of the number line.
A dot plot illustrating the data set given in the text.
Dot plots are normally used for discrete data. For data sets containing more than data points, dot plots are often inconvenient and other representations are preferred.
Concept

Box Plot

A box plot or box and whisker plot can be used to illustrate the distribution of a data set. A box plot has three parts.

  • A rectangular box that extends from the first to the third quartiles and with a line between and indicating the position of the median.
  • A segment attached to the left of the box that extends from the first quartile to the minimum of the data set.
  • A segment attached to the right of the box that extends from the third quartile to the maximum of the data set.

A box plot is a scaled figure, usually presented above a number line. The set of numbers used to draw the box plot is called the five-number summary of the data set. Each of the five numbers is labeled accordingly.

Boxplot shown above a number line with a five-number summary from left to right as 1, 3, 5, 8, 10.
A box plot provides a visual illustration of the distribution of a data set. Each segment of the chart contains one quarter, or of the data, and the center of the data lies inside the box. The further apart the segments are, the greater the spread is for that quarter of the data.
Concept

Histogram

A histogram is a graphical illustration of a frequency distribution of a data set that contains numerical data. Histograms have several defining characteristics.

  • The data is grouped into specific ranges of values known as intervals.
  • All intervals in a histogram must be the same size.
  • Interval data is marked in groups along the horizontal axis.
  • A histogram is a collection of rectangles drawn above intervals.
  • The height of each rectangle is proportional to the frequency of the data in the corresponding interval.
Consider an example situation. A fruit store wants to examine the weights of the apples they sell. To see the distribution, it is not necessary to show each apple's weight individually. Instead, the apples can be grouped by their weights in intervals of and so on.
A histogram showing the distribution of the weight of apples
A histogram looks similar to a bar graph. The difference is that a histogram has numbers on the horizontal axis and the bars cannot have a space between each other because the data is continuous.
Concept

Bar Graph

A bar graph is a graphical representation of a frequency distribution of a categorical data. It is made of rectangular bars and each bar represents a category and its frequency in a data set. A bar graph is most often created using the data in a frequency table.

Bar graph with 5 categories: Category 1 has a value of 1, Category 2 has a value of 2, Category 3 has a value of 3, Category 4 has a value of 1, and Category 5 has a value between 3 and 4, closer to 4.
A bar graph can be oriented either horizontally or vertically by swapping which axis shows the categories and which contains the frequency of each category. For example, consider a group of students who are asked what their favorite color is. The categories and values will be adjusted accordingly.
Vertical and Horizontal Bar Graphs
The graphical representation of a frequency distribution for numerical data is called a histogram.
Concept

Pie Chart

A pie chart is a circular chart used to represent the relative frequencies of a data set. It is also called a circle chart. These charts are divided into several slices — each representing a group of the whole data set. The following characteristics are typical of pie charts.

  • Each slice is drawn using a different color to differentiate the groups.
  • The central angle of each group, as well as its area, is proportional to its relative frequency.

A pie chart allows the visualization of each individual data group when compared to the whole. Alone, however, the chart does not give information about the frequency of each group.

Pie Chart with Side Labels

Pie charts might also include the relative frequency of each group written as a percentage. It is also possible to include labels to represent each group with matching colors.

Pie Chart with Side Labels
Concept

Scatter Plot

A scatter plot is a graph that shows each observation of a bivariate data set as an ordered pair in a coordinate plane. Consider the following example, where a scatter plot illustrates the results gathered at a local ice cream parlor. This study records the number of ice creams sold and the corresponding air temperature.

Scatter plot of the number of ice creams sold based on temperatures with a positive correlation.
Among other insights, the graph shows that when the temperature is about approximately ice creams are sold. Additionally, as the temperature increased, the number of sales also increased. In this case, it can be said that there is a positive correlation between the data sets — the number of ice creams sold and the air temperature.
Concept

Stem-and-Leaf Plot

A stem-and-leaf plot is a table that orders numerical data, which can be either discrete or continuous, and shows how they are distributed. A stem-and-leaf plot is constructed by breaking each number from the data into a stem and a leaf. The stem of the number is all but the last digit and the leaf is always the last digit. Stem-and-leaf plots include a key that defines how the numbers in the set are to be interpreted.

Examples

The following stem-and-leaf plot represents the set of numbers and Since appears in the data set twice, the number will appear twice in the leaf of the line.
Here is a stem-and-leaf plot representing the numbers and The decimal point does not need to be included in the plot as it can be concluded from the key where it should be placed.
Note how the leaf part of the table is left empty when the stem is Since the role of the plot is to also show how the data is distributed, empty rows should not be omitted.
Loading content