{{ 'ml-label-loading-course' | message }}
{{ toc.name }}
{{ toc.signature }}
{{ tocHeader }} {{ 'ml-btn-view-details' | message }}
{{ tocSubheader }}
{{ 'ml-toc-proceed-mlc' | message }}
{{ 'ml-toc-proceed-tbs' | message }}
Lesson
Exercises
Recommended
Tests
An error ocurred, try again later!
Chapter {{ article.chapter.number }}
{{ article.number }}. 

{{ article.displayTitle }}

{{ article.intro.summary }}
Show less Show more expand_more
{{ ability.description }} {{ ability.displayTitle }}
Lesson Settings & Tools
{{ 'ml-lesson-number-slides' | message : article.intro.bblockCount }}
{{ 'ml-lesson-number-exercises' | message : article.intro.exerciseCount }}
{{ 'ml-lesson-time-estimation' | message }}
 Bivariate Quantitative Data
Reference

Data Displays

Concept

Chart

A chart is a graphical representation of statistical information, often in the form of a graph or diagram. Charts are used to visually represent numerical or categorical data, making it easier to understand patterns, trends, and relationships within the data. For instance, line graphs are often used to show trends over time, such as the rate of deforestation.

A line graph showing the number of Tropical Rainforests remaining over time
Other common types of charts include bar graphs, pie charts, scatter plots, or histograms. Each type of these charts is suited to represent different types of data and relationships.
Concept

Line Graph

A line graph is used to show how a set of data changes with respect to another quantity, often a period of time. To make a line graph, a scale and intervals for the coordinate axes are chosen. The data points are then graphed and a line connecting the points drawn. Consider a table of values that represents the growth of a plant over several weeks.

Plant Growth
Week
Height (in.)

The height data includes values from to so a scale from to inches with an interval of inch is reasonable. The horizontal axis can represent time in weeks and the vertical axis can represent the plant height in inches. Now the points can be plotted on a coordinate plane and connected with a line.

Line graph
By observing the upward and downward slant of the lines connecting the points, the trends in the data can be described and future events can be predicted.
Concept

Dot Plot

A dot plot, also known as a line plot, is a way to represent numerical or categorical data in which each data point is represented with a dot above a horizontal line with numbers or categories. Dots representing the same measurement are stacked above each other. Consider the following data set.
  • There are two in this data set, so two dots are stacked above the number on the number line of the corresponding dot plot.
  • There is one in this data set, so a single dot is drawn above number
  • There are five in this data set, so five dots are stacked above number on the number line.
A dot plot illustrating the data set given in the text.
Dot plots are normally used for discrete data. For data sets containing more than data points, dot plots are often inconvenient and other representations are preferred.
Concept

Box Plot

A box plot, or box and whisker plot, can be used to illustrate the distribution of a data set. A box plot has three parts.

  • A rectangular box that extends from the first to the third quartiles and with a line between and indicating the position of the median.
  • A segment attached to the left of the box that extends from the first quartile to the minimum of the data set.
  • A segment attached to the right of the box that extends from the third quartile to the maximum of the data set. The two segments are called whiskers.

If a data set has outliers, they are marked as separate points to the left and/or right of the whiskers. A box plot is a scaled figure and is usually presented above a number line. The set of numbers used to draw the box plot is called the five-number summary of the data set. Each of the five numbers is labeled below.

Boxplot shown above a number line with a five-number summary from left to right as 1, 3, 5, 8, 10.
A box plot provides a visual illustration of the distribution of a data set. Each segment of the plot contains one quarter, or of the data, and the center of the data lies inside the box. The further apart the segments are, the greater the spread is for that quarter of the data.
Concept

Histogram

A histogram is a graphical illustration of a frequency distribution of a data set that contains numerical data. Histograms have several defining characteristics.

  • The data is grouped into specific ranges of values known as intervals.
  • All intervals in a histogram must be the same size.
  • Interval data is marked in groups along the horizontal axis.
  • A histogram is a collection of rectangles drawn above the intervals.
  • The height of each rectangle is proportional to the frequency of the data in the corresponding interval.
Consider an example situation. A grocery store wants to examine the weights of the apples they sell. To read the distribution, it is not necessary to show each apple's weight individually. Instead, the apples can be grouped by their weights in intervals of grams: and so on.
A histogram showing the distribution of the weight of apples
A histogram looks similar to a bar graph. The difference is that a histogram always has intervals of numbers on the horizontal axis and the bars cannot have a space between each other because the data is continuous.
Concept

Bar Graph

A bar graph is a graphical representation of a categorical data. It is made of rectangular bars and each bar represents a category and its corresponding value. Bar graphs are commonly used to show frequency distributions, in which case they are often created using the data in a frequency table.

Bar graph with 5 categories: Category 1 has a value of 1, Category 2 has a value of 2, Category 3 has a value of 3, Category 4 has a value of 1, and Category 5 has a value between 3 and 4, closer to 4.
A bar graph can be oriented either horizontally or vertically by swapping which axis shows the categories and which shows the value of each category. For example, consider a group of students who are asked what their favorite color is. The categories and values will be adjusted accordingly.
Vertical and Horizontal Bar Graphs
The graphical representation of a frequency distribution for numerical data is called a histogram.
Concept

Pie Chart

A pie chart is a circular chart used to represent the relative frequencies of a data set. It is also called a circle chart. These charts are divided into several slices — each representing a group of the whole data set. The following characteristics are typical of pie charts.

  • Each slice is drawn using a different color to differentiate the groups.
  • The central angle of each group, as well as its area, is proportional to its relative frequency.

A pie chart allows the visualization of each individual data group when compared to the whole. Alone, however, the chart does not give information about the frequency of each group.

Pie Chart with Side Labels

Pie charts might also include the relative frequency of each group written as a percentage. It is also possible to include labels to represent each group with matching colors.

Pie Chart with Side Labels
Concept

Scatter Plot

A scatter plot is a graph that shows each observation of a bivariate data set as an ordered pair in a coordinate plane. Consider the following example, where a scatter plot illustrates the results gathered at a local ice cream parlor. This study records the number of ice creams sold and the corresponding air temperature.

Scatter plot of the number of ice creams sold based on temperatures with a positive correlation.
Among other insights, the graph shows that when the temperature is about approximately ice creams are sold. Additionally, as the temperature increased, the number of sales also increased. In this case, it can be said that there is a positive correlation between the variables of the data set — the number of ice creams sold and the air temperature.
Concept

Stem-and-Leaf Plot

A stem-and-leaf plot is a table that orders numerical data, which can be either discrete or continuous, and shows how they are distributed. A stem-and-leaf plot is constructed by breaking each number from the data into a stem and a leaf. The stem of the number is all but the last digit and the leaf is always the last digit. Stem-and-leaf plots include a key that defines how the numbers in the set are to be interpreted.

Examples

The following stem-and-leaf plot represents the set of numbers and Since appears in the data set twice, the number will appear twice in the leaf of the line.
Here is a stem-and-leaf plot representing the numbers and The decimal point does not need to be included in the plot as it can be concluded from the key where it should be placed.
Note how the leaf part of the table is left empty when the stem is Since the role of the plot is to also show how the data is distributed, empty rows should not be omitted.
Loading content