Chapter 4 Powerpoint
Chapter 4 Powerpoint
Describing Data: Displaying and Exploring Data Describing Data: Displaying and Exploring Data
GOALS FIVE
When you have completed this chapter, you will be able to: Compute and understand the coefficient of variation and the
coefficient of skewness.
ONE
Develop and interpret a dot plot. SIX
TWO Draw and interpret a scatter diagram.
Develop and interpret a stem-and-leaf display.
THREE SEVEN
Compute and interpret quartiles, deciles, and percentiles. Set up and interpret a contingency table.
FOUR
Construct and interpret box plots.
HSK/2021-2022 HSK/2021-2022
Goals Goals
4-3 4-4
Dot plots:
Report the details of each observation
Are useful for comparing two or more data sets
HSK/2021-2022 HSK/2021-2022
Dot Plot Example 1
4-5 4-6
4-7 4-8
4-9 4-10
4-11 4-12
Quartiles Quartiles
4-13 4-14
Quartiles Quartiles
Lp = (n+1)
HSK/2021-2022 HSK/2021-2022
Quartiles (continued) Quartiles (continued)
4-15 4-16
4-17 4-18
For a set of
The Interquartile This distance will observations the third
range is the distance include the middle 50 quartile is 24 and the
between the third percent of the first quartile is 10.
quartile Q3 and the observations. What is the quartile
first quartile Q1.
deviation?
The interquartile range is
24 - 10 = 14. Fifty
percent of the observations
Interquartile range = Q3 - Q1 will occur between 10 and
24.
4-19 4-20
Based on a sample of 20
A box plot is a graphical
deliveries,
display, based on quartiles,
that helps to picture a set of Buddy’s Pizza determined the
data. following information. The
minimum delivery time was 13
Five pieces of data minutes and the maximum 30
are needed to minutes. The first quartile was
construct a box 15 minutes, the median 18
plot: the Minimum minutes, and the third quartile
Value, the First 22 minutes. Develop a box plot
Quartile, the for the delivery times.
Median, the Third
Quartile, and the
Maximum Value.
HSK/2021-2022 HSK/2021-2022
Box Plots Example 4
4-21 4-22
HSK/2021-2022 HSK/2021-2022
Example 4 continued Example 4 continued
4-23 4-24
sk
3 X Median in a wider range for the
coefficient.
s
HSK/2021-2022 Coefficient of Variation HSK/2021-2022
Movie
4-25 4-26
Using the twelve stock prices, we find the mean to be Scatter Variables must be at least interval scaled.
84.42, standard deviation, 7.18, median, 84.5. diagram: A
technique
Coefficient of variation used to show Relationship can be positive (direct) or
the negative (inverse).
relationship
= 8.5% between
variables.
Coefficient of skewness
Example
3 X Median The twelve days of stock prices and the overall market
sk = -.035 index on each day are given as follows:
s
HSK/2021-2022 HSK/2021-2022
Example 2 revisited Scatter diagram
4-27 4-28
A contingency table is
Index
used to classify
(000s) Price
observations according to
8.0 96 two identifiable
7.5 92 characteristics.
7.5 91
Contingency tables are used
7.3 88
7.2 86 when one or both variables are
7.2 85 nominally scaled.
7.1 84
7.1 83 A contingency table is a
7.0 82 cross tabulation that
6.2 79 simultaneously
6.2 78 summarizes two variables
5.1 69 of interest.
HSK/2021-2022 HSK/2021-2022
Example 2 revisited Contingency table
4-29 4-30
Weight Loss
Weight Less 20 up to 40
45 adults, all 60 pounds
overweight, are randomly
Loss than 20 40 pounds
assigned to three weight loss Plan pounds pounds or more
programs. Twenty weeks Plan 1 4 8 3
into the program, a
researcher gathers data on Plan 2
weight loss and divides the 2 12 1
loss into three categories: Plan 3
less than 20 pounds, 20 up 12 2 1
to 40 pounds, 40 or more
pounds. Here are the Compare the weight loss under the three plans.
results.
HSK/2021-2022 Example 5 HSK/2021-2022
Example 5 continued
4-31 4-32
First Quartile (for grouped data) Third Quartile (for grouped data)
• The First Quartile is the value • The Third Quartile is the value
corresponding to the point below which corresponding to the point below which
25% of the observations lie in an ordered 75% of the observations lie in an ordered
data set. data set:
• where L=lower limit of the class containing Q1, CF= – where L=lower limit of the class containing Q3, CF=
cumulative frequency preceding class containing Q1, f= cumulative frequency preceding class containing Q3, f=
frequency of class containing Q1, i= size of class frequency of class containing Q3, i= size of class
containing Q1. HSK/2021-2022 containing Q3. HSK/2021-2022