0% found this document useful (0 votes)
25 views8 pages

Chapter 4 Powerpoint

This chapter discusses methods for describing and exploring data, including dot plots, stem-and-leaf displays, quartiles, box plots, and contingency tables. It provides examples of computing quartiles, percentiles, and coefficients of variation and skewness from datasets. The chapter also demonstrates how to set up and interpret scatter diagrams and contingency tables to compare distributions and examine relationships between variables.

Uploaded by

NorakPanha Cheam
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views8 pages

Chapter 4 Powerpoint

This chapter discusses methods for describing and exploring data, including dot plots, stem-and-leaf displays, quartiles, box plots, and contingency tables. It provides examples of computing quartiles, percentiles, and coefficients of variation and skewness from datasets. The chapter also demonstrates how to set up and interpret scatter diagrams and contingency tables to compare distributions and examine relationships between variables.

Uploaded by

NorakPanha Cheam
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

HSK / 2021-2022

Chapter Four Chapter Four


4-1 4-2

Describing Data: Displaying and Exploring Data Describing Data: Displaying and Exploring Data

GOALS FIVE
When you have completed this chapter, you will be able to: Compute and understand the coefficient of variation and the
coefficient of skewness.
ONE
Develop and interpret a dot plot. SIX
TWO Draw and interpret a scatter diagram.
Develop and interpret a stem-and-leaf display.
THREE SEVEN
Compute and interpret quartiles, deciles, and percentiles. Set up and interpret a contingency table.
FOUR
Construct and interpret box plots.
HSK/2021-2022 HSK/2021-2022
Goals Goals

4-3 4-4

This example gives the percentages of men and


Dot Plot
women participating in the workforce in a recent
year for the fifty states of the United States.
Compare the dispersions of labor force
participation by gender.

Dot plots:
 Report the details of each observation
 Are useful for comparing two or more data sets
HSK/2021-2022 HSK/2021-2022
Dot Plot Example 1

Thank you for your attention ! 1


HSK / 2021-2022

4-5 4-6

This example gives the percentages of men and


women participating in the workforce in a recent
year for the fifty states of the United States.
Percentage of women Percentage of men
Compare the dispersions of labor force
participating participating
participation by gender. In the labor force for the In the labor force for the
50 states. 50 states.

HSK/2021-2022 Example 1 HSK/2021-2022


(continued) Example 1 (continued)

4-7 4-8

Stem-and-leaf Displays Stock prices on twelve


consecutive days for a major
Stem-and-leaf Note: an advantage publicly traded company
display: A of the stem-and-leaf
statistical technique display over a
for displaying a set frequency
of data. Each distribution is we
numerical value is do not lose the
divided into two identity of each
parts: the leading observation.
digits become the
stem and the
trailing digits the
leaf.
HSK/2021-2022 Stem-and-leaf Displays HSK/2021-2022
Example 2

Thank you for your attention ! 2


HSK / 2021-2022

4-9 4-10

Stem and leaf display of stock prices Quartiles

HSK/2021-2022 Example 2 (Continued ) HSK/2021-2022 Quartiles

4-11 4-12

Quartiles Quartiles

HSK/2021-2022 Quartiles (continued) HSK/2021-2022


Quartiles (continued)

Thank you for your attention ! 3


HSK / 2021-2022

4-13 4-14

Quartiles Quartiles

Lp = (n+1)

HSK/2021-2022 HSK/2021-2022
Quartiles (continued) Quartiles (continued)

4-15 4-16

Using the twelve stock prices, we can find the 12


th
96 75 percentile
median, 25th, and 75th percentiles as follows: 11 92 Price at 9.75 observation = 88 + .75(91-88)
Q3 10 91 = 90.25
Quartile 3 9 88
8 86
50th percentile: Median
7 85
Q2 Price at 6.50 observation = 84 + .5(85-84)
Median 6 84
= 84.50
5 83
4 82
Q1
79 25 percentile
th
3
Quartile 1 2 78 Price at 3.25 observation = 79 + .25(82-79)
1 69 = 79.75
HSK/2021-2022 HSK/2021-2022
Example 2 (continued) Example 2 (continued)

Thank you for your attention ! 4


HSK / 2021-2022

4-17 4-18

For a set of
The Interquartile This distance will observations the third
range is the distance include the middle 50 quartile is 24 and the
between the third percent of the first quartile is 10.
quartile Q3 and the observations. What is the quartile
first quartile Q1.
deviation?
The interquartile range is
24 - 10 = 14. Fifty
percent of the observations
Interquartile range = Q3 - Q1 will occur between 10 and
24.

HSK/2021-2022 Interquartile Range HSK/2021-2022


Example 3

4-19 4-20

Based on a sample of 20
A box plot is a graphical
deliveries,
display, based on quartiles,
that helps to picture a set of Buddy’s Pizza determined the
data. following information. The
minimum delivery time was 13
Five pieces of data minutes and the maximum 30
are needed to minutes. The first quartile was
construct a box 15 minutes, the median 18
plot: the Minimum minutes, and the third quartile
Value, the First 22 minutes. Develop a box plot
Quartile, the for the delivery times.
Median, the Third
Quartile, and the
Maximum Value.
HSK/2021-2022 HSK/2021-2022
Box Plots Example 4

Thank you for your attention ! 5


HSK / 2021-2022

4-21 4-22

HSK/2021-2022 HSK/2021-2022
Example 4 continued Example 4 continued

4-23 4-24

The coefficient of variation is Skewness is the


the ratio of the standard measurement of the
Relative dispersion deviation to the arithmetic lack of symmetry of
mean, expressed as a the distribution.
percentage:
The coefficient of
skewness can range A value of 0 indicates a
symmetric distribution.
from -3.00 up to 3.00
when using the following
formula: Some software packages use a
different formula which results

sk 

3 X  Median  in a wider range for the
coefficient.
s
HSK/2021-2022 Coefficient of Variation HSK/2021-2022
Movie

Thank you for your attention ! 6


HSK / 2021-2022

4-25 4-26

Using the twelve stock prices, we find the mean to be Scatter Variables must be at least interval scaled.
84.42, standard deviation, 7.18, median, 84.5. diagram: A
technique
Coefficient of variation used to show Relationship can be positive (direct) or
the negative (inverse).
relationship
= 8.5% between
variables.
Coefficient of skewness
Example
3 X  Median  The twelve days of stock prices and the overall market
sk  = -.035 index on each day are given as follows:
s
HSK/2021-2022 HSK/2021-2022
Example 2 revisited Scatter diagram

4-27 4-28

A contingency table is
Index
used to classify
(000s) Price
observations according to
8.0 96 two identifiable
7.5 92 characteristics.
7.5 91
Contingency tables are used
7.3 88
7.2 86 when one or both variables are
7.2 85 nominally scaled.
7.1 84
7.1 83 A contingency table is a
7.0 82 cross tabulation that
6.2 79 simultaneously
6.2 78 summarizes two variables
5.1 69 of interest.
HSK/2021-2022 HSK/2021-2022
Example 2 revisited Contingency table

Thank you for your attention ! 7


HSK / 2021-2022

4-29 4-30

Weight Loss
Weight Less 20 up to 40
45 adults, all 60 pounds
overweight, are randomly
Loss than 20 40 pounds
assigned to three weight loss Plan pounds pounds or more
programs. Twenty weeks Plan 1 4 8 3
into the program, a
researcher gathers data on Plan 2
weight loss and divides the 2 12 1
loss into three categories: Plan 3
less than 20 pounds, 20 up 12 2 1
to 40 pounds, 40 or more
pounds. Here are the Compare the weight loss under the three plans.
results.
HSK/2021-2022 Example 5 HSK/2021-2022
Example 5 continued

4-31 4-32

First Quartile (for grouped data) Third Quartile (for grouped data)
• The First Quartile is the value • The Third Quartile is the value
corresponding to the point below which corresponding to the point below which
25% of the observations lie in an ordered 75% of the observations lie in an ordered
data set. data set:

• where L=lower limit of the class containing Q1, CF= – where L=lower limit of the class containing Q3, CF=
cumulative frequency preceding class containing Q1, f= cumulative frequency preceding class containing Q3, f=
frequency of class containing Q1, i= size of class frequency of class containing Q3, i= size of class
containing Q1. HSK/2021-2022 containing Q3. HSK/2021-2022

Thank you for your attention ! 8

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy