0% found this document useful (0 votes)
2 views

lecture-5-14102024-102018pm

Uploaded by

Qaiser Hafeez
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

lecture-5-14102024-102018pm

Uploaded by

Qaiser Hafeez
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 45

Measures of Variation

• For description of data, we need to know, how much is the spread


of our data
• OBJECTIVE 2
Describe data, using measures of variation, such as the range,
variance, and standard
deviation.
EXAMPLE 3–15 Comparison of Outdoor Paint
• A testing lab wishes to test two experimental brands of outdoor
paint to see how long each will last before fading. The testing lab
makes 6 gallons of each paint to test. Since different chemical
agents are added to each group and only six cans are involved,
these two groups constitute two small populations. The results (in
months) are shown. Find the mean of each group.
EXAMPLE: Comparison of Outdoor Paint
• A testing lab wishes to test two experimental brands of outdoor
paint to see how long each will last before fading. The testing lab
makes 6 gallons of each paint to test. Since different chemical
agents are added to each group and only six cans are involved,
these two groups constitute two small populations. The results (in
months) are shown. Find the mean of each group.
• Since the means are equal, you might conclude that both brands of paint last
equally well. However, when the data sets are examined graphically, a somewhat
different conclusion might be drawn.

• Even though the means are the same for both brands, the spread, or variation, is
quite different.
• Brand B performs more consistently; it is less variable.
Measures of Dispersion

• For the spread or variability of a data set, three measures are


commonly used:
• range,
• variance, and
• standard deviation.
Figure
Five starting players on two basketball teams
Figure
Shortest and tallest starting players on the teams
Definition

Range of a Data Set

The range of a data set is given by the formula

Range = Max – Min,

where Max and Min denote the maximum and minimum


observations, respectively.
Range

• Comparison of Outdoor Paint

• For brand A, the range is

• For brand B, the range is

• Make sure the range is given as a single number.

• The range for brand A shows that 50 months separate the largest data value from the smallest data value.
• For brand B, 20 months separate the largest data value from the smallest data value, which is less than one-half of
brand A’s range.
EXAMPLE 3–17 Internet Connections
• The number of cable modem connections in millions for a
select 6-year period are shown. Find the range.

55, 38, 14, 46, 60, 51

SOLUTION
EXAMPLE 3–17 Internet Connections
• The number of cable modem connections in millions for a
select 6-year period are shown. Find the range.

55, 38, 14, 46, 60, 51

SOLUTION
Range R = 60 - 14 = 46 million connections
Variation
• Data variation refers to the difference or distance each data value is from the mean. This difference
or distance is called a deviation.
• In the outdoor paint example, the mean for brand A paint is μ = 35 months, and say a specific can
lasted for 50 months, the deviation is X – μ or 50 - 35 = 15. Hence, the deviation for that data value
is 15 months.
• The sum of the deviations for all data values about the mean (without rounding) will always be
zero. That is, Σ(X - μ) = 0.

• So, we sum the squares, Σ(X - μ)2 and find the mean of these squares by dividing by N (the total
number of data values), symbolically Σ(X - μ)2/N. This measure is called the population variance
and is symbolized by σ2, where σ is the symbol for Greek lowercase letter sigma. Since this
measure (σ2) is in square units and the data are in regular units, statisticians take the square root of
the variance and call it the standard deviation.
Sample Variance and Standard Deviation

Definition
Sample Variance and Standard Deviation

Definition
Key Fact

Variation and the Standard Deviation

The more variation that there is in a data set, the larger is its
standard deviation.
Formula
EXAMPLE 3–20 Air Pollution
• The sulfur dioxide content of air pollution in millions of tons of five randomly selected cities is shown. Find
the variance and standard deviation for the data.

11 90 33 49 27
EXAMPLE 3–20 Air Pollution
• The sulfur dioxide content of air pollution in millions of tons of five randomly selected cities is shown. Find
the variance and standard deviation for the data.

11 90 33 49 27
European Auto
Sales

• Find the sample variance and


standard deviation for the
amount of European auto sales
for a sample of 6 years shown.
The data are in millions of
dollars.
11.2, 11.9, 12.0, 12.8, 13.4, 14.3
Variance and Standard Deviation for Grouped
Data
• Find the variance and the standard deviation for the frequency distribution of the data representing the number
of miles that 20 runners ran during one week.


Variance and Standard Deviation for Grouped
Data
• Find the variance and the standard deviation for the frequency distribution of the data representing the number
of miles that 20 runners ran during one week.


Variance and Standard Deviation for Grouped
Data
• Find the variance and the standard deviation for the frequency distribution of the data representing the number
of miles that 20 runners ran during one week.


steps for finding the variance and standard
deviation for grouped data
Uses of the Variance and Standard Deviation
1. To determine the spread of the data: If the variance or standard deviation is large, the data are
more
dispersed. This information is useful in comparing two (or more) data sets to determine which is more
(most) variable.

2. To determine the consistency of a variable: If the variance or standard deviation is large, the data
are less consistent. For example, in the manufacture of fittings, such as nuts and bolts, the variation in
the diameters must be small, or the parts will not fit together.

3. To determine the shape of the distribution: The variance and standard deviations are used to tell
the number of data values that fall within a specified interval in a distribution. For example,
Chebyshev’s theorem shows that, for any distribution, at least 75% of the data values will fall
within 2 standard deviations of the mean.

4. The variance and standard deviation are used in inferential statistics.


Coefficient of Variation
• Whenever two samples have the same units of measure, the variance and standard deviation for
each can be compared directly.
• For example, suppose an automobile dealer wanted to compare the standard deviation of miles
driven for the cars she received as trade-ins on new cars. She found that for a specific year, the
standard deviation for Buicks was 422 miles and the standard deviation for Cadillacs was 350
miles. She could say that the variation in mileage was greater in the Buicks.
• But what if a manager wanted to compare the standard deviations of two different variables, such
as the number of sales per salesperson over a 3-month period and the commissions made by these
salespeople?
• A statistic to compare standard deviations when the units are different is called the coefficient of
variation.
Section
Shape of Distribution
Tables
Data sets that have different variation

Means and standard deviations of the data sets are


Figures of the data sets
Key Fact

Three-Standard-Deviations Rule

Almost all the observations in any data set lie within three
standard deviations to either side of the mean.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy