0% found this document useful (0 votes)

34 views26 pages

4 - Stat - Measures of Variation 2021

This document discusses measures of variation in data sets, including range, deviation, variance, and standard deviation. It provides examples of calculating each measure and guidelines for finding population and sample standard deviation. The key points are that standard deviation measures how much values typically differ from the mean, and the empirical rule states that about 68% of values lie within 1 standard deviation of the mean, 95% within 2 standard deviations, and 99.7% within 3 standard deviations.

Uploaded by

Kinan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views26 pages

4 - Stat - Measures of Variation 2021

Uploaded by

Kinan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

§Week 4

Measures of Variation

1
Range

The range of a data set is the difference between the maximum and minimum date entries
in the set.
Range = (Maximum data entry) – (Minimum data entry)

Example:
The following data are the closing prices for a certain stock on ten successive
Fridays. Find the range.

Stock 56 56 57 58 61 63 63 67 67 67

The range is 67 – 56 = 11.

2
Deviation
The deviation of an entry x in a population data set is the difference between the entry and
the mean μ of the data set.
Deviation of x = x – μ

Example:
The following data are the closing prices for a Stock Deviation
certain stock on five successive Fridays. Find x x–μ
the deviation of each price. 56 56 – 61 = – 5
58 58 – 61 = – 3
61 61 – 61 = 0
63 63 – 61 = 2
The mean stock price is
67 67 – 61 = 6
μ = 305/5 = 61.
Σx = 305 Σ(x – μ) = 0

3
Variance and Standard Deviation
The population variance of a population data set of N entries is
Population variance = (x  μ )2
2
  .
N
“sigma
squared”

The population standard deviation of a population data set of N entries is the square root
of the population variance.

Population standard deviation =

2 (x  μ )2
   .
N
“sigma”

4
Finding the Population Standard Deviation

Guidelines
In Words In Symbols
1. Find the mean of the population data x
μ 
set. N

2. Find the deviation of each entry. x μ

3. Square each deviation. x  μ
2

4. Add to get the sum of squares. SS x  x  μ 

5. Divide by N to get the population x  μ 

2
2
variance.  
N
6. Find the square root of the variance to
x  μ 
2
get the population standard 
deviation. N

5
Finding the Sample Standard Deviation

Guidelines
In Words In Symbols
1. Find the mean of the sample data set. x
x 
n
2. Find the deviation of each entry.
3. Square each deviation. x x

4. Add to get the sum of squares. x x

SS x  x  x 
2
5. Divide by n – 1 to get the sample
variance. x  x 
2
2
s 
6. Find the square root of the variance to n 1
get the sample standard deviation.
x  x 
2
s 
n 1

6
Finding the Population Standard Deviation
Example:
The following data are the closing prices for a certain stock on five successive Fridays.
The population mean is 61. Find the population standard deviation.

Always positive!

Stock Deviation Squared SS2 = Σ(x – μ)2 = 74

x x–μ (x – μ)2
x  μ
2
56 –5 25 2
  
74
 14.8
58 –3 9 N 5
61 0 0
x  μ
2
63 2 4   14.8  3.8
N 3.85
67 6 36
Σx = 305 Σ(x – μ) = 0 Σ(x – μ)2 = 74
σ  $3.85

7
Interpreting Standard Deviation
When interpreting standard deviation, remember that is a measure of the typical amount an
entry deviates from the mean. The more the entries are spread out, the greater the standard
deviation.

14 14
12 x=4 12 x =4
10 s = 1.18 10 s=0
Frequency

Frequency
8 8
6 6
4 4
2 2
0 0
2 4 6 2 4 6
Data value Data value
8
Empirical Rule (68-95-99.7%)
 Empirical Rule
 For data with a (symmetric) bell-shaped distribution, the standard
deviation has the following characteristics.

1. About 68% of the data lie within one standard deviation of the
mean.
2. About 95% of the data lie within two standard deviations of the
mean.
3. About 99.7% of the data lie within three standard deviation of
the mean.

9
Empirical Rule (68-95-99.7%)
99.7% within 3
standard deviations

95% within 2 standard

deviations

68% within 1
standard
deviation

34% 34%
2.35% 2.35%
13.5% 13.5%

–4 –3 –2 –1 0 1 2 3 4

10
Using the Empirical Rule
 Example:
 The mean value of homes on a street is $125 thousand with a
standard deviation of $5 thousand. The data set has a bell shaped
distribution. Estimate the percent of homes between $120 and
$130 thousand.
68%

105 110 115 120 125 130 135 140 145

μ–σ μ μ+σ
68% of the houses have a value between $120 and $130 thousand.
11
Chebychev’s Theorem
The Empirical Rule is only used for symmetric
distributions.

Chebychev’s Theorem can be used for any distribution,

regardless of the shape.

12
Chebychev’s Theorem
 The portion of any data set lying within k standard
deviations (k > 1) of the mean is at least

1  12 .
k
1  1  1 1  3 ,
For k = 2: In any data set, at least 22 4 or475%, of the data lie within 2 standard
deviations of the mean.

1  1 1 1  8 ,
For k = 3: In any data set, at least 32 9 or9 88.9%, of the data lie within 3 standard
deviations of the mean.

13
Using Chebychev’s Theorem

Example:
The mean time in a women’s 400-meter dash is 52.4
seconds with a standard deviation of 2.2 sec. At least 75%
of the women’s times will fall between what two values?
2 standard deviations

45.8 48 50.2 52.4 54.6 56.8 59

At least 75% of the women’s 400-meter dash times will fall between 48 and 56.8
seconds.

14
Standard Deviation for Grouped Data
(x  x )2 f
Sample standard deviation = s 
n 1
where n = Σf is the number of entries in the data set, and x is the data value or the
midpoint of an interval.

Example:
The following frequency distribution represents the ages of 30 students in a statistics
class. The mean age of the students is 30.3 years. Find the standard deviation of the
frequency distribution.

Continued.
15
Standard Deviation for Grouped Data
The mean age of the students is 30.3 years.

Class x f x–x (x – x )2 (x – x )2f

18 – 25 21.5 13 – 8.8 77.44 1006.72
26 – 33 29.5 8 – 0.8 0.64 5.12
34 – 41 37.5 4 7.2 51.84 207.36
42 – 49 45.5 3 15.2 231.04 693.12
50 – 57 53.5 2 23.2 538.24 1076.48
n = 30
  2988.80

(x  x )2 f 2988.8
s    103.06  10.2
n 1 29
The standard deviation of the ages is 10.2 years.

16
§ 2.5
Measures of Position

17
Quartiles
The three quartiles, Q1, Q2, and Q3, approximately divide an ordered data set into four
equal parts.

Median

Q1 Q2 Q3

0 25 50 75 100

Q1 is the median of the data below Q3 is the median of the data

Q2. above Q2.

18
Finding Quartiles
Example:
The quiz scores for 15 students is listed below. Find the first, second and third quartiles
of the scores.

28 43 48 51 43 30 55 44 48 33 45 37 37 42 38

Order the data.

Lower half Upper half

28 30 33 37 37 38 42 43 43 44 45 48 48 51 55

Q1 Q2 Q3

About one fourth of the students scores 37 or less; about one half score
43 or less; and about three fourths score 48 or less.
19
Interquartile Range
The interquartile range (IQR) of a data set is the difference between the third and first
quartiles.
Interquartile range (IQR) = Q3 – Q1.

Example:
The quartiles for 15 quiz scores are listed below. Find the interquartile range.

Q1 = 37 Q2 = 43 Q3 = 48

(IQR) = Q3 – Q1 The quiz scores in the middle portion of the data

set vary by at most 11 points.
= 48 – 37
= 11

20
Box and Whisker Plot
A box-and-whisker plot is an exploratory data analysis tool that highlights the important
features of a data set.

The five-number summary is used to draw the graph.

• The minimum entry
• Q1
• Q2 (median)
• Q3
• The maximum entry
Example:
Use the data from the 15 quiz scores to draw a box-and-whisker plot.

28 30 33 37 37 38 42 43 43 44 45 48 48 51 55
Continued.
21
Box and Whisker Plot
Five-number summary
• The minimum entry 28
• Q1 37

• Q2 (median) 43
48
• Q3
55
• The maximum entry
Quiz Scores

28 37 43 48 55

28 32 36 40 44 48 52 56
22
Percentiles and Deciles
Fractiles are numbers that partition, or divide, an ordered data set.

Percentiles divide an ordered data set into 100 parts. There are 99 percentiles: P1, P2,
P3…P99.

Deciles divide an ordered data set into 10 parts. There are 9 deciles: D1, D2, D3…D9.

A test score at the 80th percentile (P80), indicates that the test score is greater than 80% of
all other test scores and less than or equal to 20% of the scores.

23
Standard Scores
The standard score or z-score, represents the number of standard deviations that a data
value, x, falls from the mean, μ.

va lu e  m ea n x 
z  
st a n da r d devia t ion 
Example:
The test scores for all statistics finals at Union College have a mean of 78 and standard
deviation of 7. Find the z-score for
a.) a test score of 85,
b.) a test score of 70,
c.) a test score of 78.

Continued.
24
Standard Scores
Example continued:

a.) μ = 78, σ = 7, x = 85

x   85  78
z   1.0
  7
This score is 1 standard deviation higher
than the mean.

b.) μ = 78, σ = 7, x = 70

x   70  78
z 
  7  1.14 lower than the mean.
This score is 1.14 standard deviations

c.) μ = 78, σ = 7, x = 78

x   78  78
z  0
  7
This score is the same as the mean.

25
Example:
Relative Z-Scores
John received a 75 on a test whose class mean was 73.2 with a standard deviation of 4.5.
Samantha received a 68.6 on a test whose class mean was 65 with a standard deviation of
3.9. Which student had the better test score?

John’s z-score Samantha’s z-score

x   75  73.2 x   68.6  65
z   z  
 4.5  3.9
 0.4  0.92
John’s score was 0.4 standard deviations higher than the mean, while Samantha’s
score was 0.92 standard deviations higher than the mean. Samantha’s test score was
better than John’s.

Biostatistics & Research Methodology Mcqs 2024
100% (1)
Biostatistics & Research Methodology Mcqs 2024
35 pages
Certiprof Lean Six Sigma White Belt Professional Certification Exam Answers
100% (4)
Certiprof Lean Six Sigma White Belt Professional Certification Exam Answers
13 pages
Assessment of Learning PPT 201012014906
No ratings yet
Assessment of Learning PPT 201012014906
54 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
59 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
BRE0009766 Qtegra Algorithms For ICP-OES Rev C
No ratings yet
BRE0009766 Qtegra Algorithms For ICP-OES Rev C
46 pages
Statistics 84
No ratings yet
Statistics 84
4 pages
Lesson 4: Statistics/Data Management Unit 1 - Measures of Central Tendency
No ratings yet
Lesson 4: Statistics/Data Management Unit 1 - Measures of Central Tendency
26 pages
Astm D5
No ratings yet
Astm D5
4 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
22 pages
BUAD 311 Final FA17 Solution
0% (1)
BUAD 311 Final FA17 Solution
9 pages
MMW PPT Weeks 9 12
No ratings yet
MMW PPT Weeks 9 12
31 pages
StatProb11 Q3 Mod1 Random-Variable Version2
No ratings yet
StatProb11 Q3 Mod1 Random-Variable Version2
36 pages
Working Platforms To BRE or Not To BRE Is The Question Mar 16 AG PDF
No ratings yet
Working Platforms To BRE or Not To BRE Is The Question Mar 16 AG PDF
11 pages
Excel For Statistical Data Analysis
No ratings yet
Excel For Statistical Data Analysis
46 pages
Chapter 3
No ratings yet
Chapter 3
17 pages
Ken Black QA ch03
0% (1)
Ken Black QA ch03
61 pages
DDDDDD 2
No ratings yet
DDDDDD 2
5 pages
Continuous Probability Distributions
No ratings yet
Continuous Probability Distributions
59 pages
Unit Five
No ratings yet
Unit Five
23 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
11 pages
CH 5.2 Measure of Dispersion
No ratings yet
CH 5.2 Measure of Dispersion
16 pages
Standard Deviation
No ratings yet
Standard Deviation
37 pages
Ch. 3-2
No ratings yet
Ch. 3-2
42 pages
Biostat Ch-5
No ratings yet
Biostat Ch-5
58 pages
Lecture III-Measures of Dispersion
No ratings yet
Lecture III-Measures of Dispersion
33 pages
Range, SD, QD, Variance
No ratings yet
Range, SD, QD, Variance
14 pages
Sample Final Examination Attempt Review
No ratings yet
Sample Final Examination Attempt Review
19 pages
Mmw-Chapter 1docx-Pdf-Free
No ratings yet
Mmw-Chapter 1docx-Pdf-Free
5 pages
Lecture4 Slides
No ratings yet
Lecture4 Slides
22 pages
Statistics Lec4
No ratings yet
Statistics Lec4
36 pages
B. Data Management
No ratings yet
B. Data Management
61 pages
Unit3 and Unit4 Problem Set
No ratings yet
Unit3 and Unit4 Problem Set
20 pages
Activity 4 - Measures of Dispersion
No ratings yet
Activity 4 - Measures of Dispersion
3 pages
Lecture 3
No ratings yet
Lecture 3
10 pages
G7 Math Q4-Week 7 - Ungrouped Data of Measures of Variability
No ratings yet
G7 Math Q4-Week 7 - Ungrouped Data of Measures of Variability
11 pages
Week 6 Lec and Act
No ratings yet
Week 6 Lec and Act
8 pages
GCE As Level Representation of Dbxbbcata Measures of Central Tendency and Variation
No ratings yet
GCE As Level Representation of Dbxbbcata Measures of Central Tendency and Variation
9 pages
3-Measures of Dispersion
No ratings yet
3-Measures of Dispersion
33 pages
WP Method Collection A4 EN
No ratings yet
WP Method Collection A4 EN
3 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
Super 25 Question Bank SMM
No ratings yet
Super 25 Question Bank SMM
3 pages
Research and Practice in Human Resource Management
No ratings yet
Research and Practice in Human Resource Management
12 pages
Lesson 7 6 Answer Key AP Stats Math Medic 0798d9b3ba
No ratings yet
Lesson 7 6 Answer Key AP Stats Math Medic 0798d9b3ba
2 pages
Reading The Candlesticks An OK Estimator For Volatility Jia LI
No ratings yet
Reading The Candlesticks An OK Estimator For Volatility Jia LI
32 pages
FIN 644 Lecture 2
No ratings yet
FIN 644 Lecture 2
55 pages
Cse QB Unit5
No ratings yet
Cse QB Unit5
6 pages
Chapter 6 Processing and Analysis of Data
No ratings yet
Chapter 6 Processing and Analysis of Data
30 pages
FDSA Unit 2
No ratings yet
FDSA Unit 2
44 pages
Full Report - FM
No ratings yet
Full Report - FM
22 pages
Statistics Midterms Notes
No ratings yet
Statistics Midterms Notes
8 pages
1 - 3 Descriptive Measures
No ratings yet
1 - 3 Descriptive Measures
33 pages
Applied Statistics: Normal Distribution
No ratings yet
Applied Statistics: Normal Distribution
13 pages
GE MODMAT Unit 4 Statistics 1
No ratings yet
GE MODMAT Unit 4 Statistics 1
14 pages
Lecture3 Slides
No ratings yet
Lecture3 Slides
20 pages
Measures of Dispersion Tendency
No ratings yet
Measures of Dispersion Tendency
7 pages
AP ECON 2500 Session 2
No ratings yet
AP ECON 2500 Session 2
22 pages
Group 7 MMW Reporting
No ratings yet
Group 7 MMW Reporting
28 pages
Lecture 3 Numerical Measures of Data
No ratings yet
Lecture 3 Numerical Measures of Data
36 pages
4 - Stat - Measures of Variation 2024
No ratings yet
4 - Stat - Measures of Variation 2024
27 pages
Properties of The Normal Distribution
No ratings yet
Properties of The Normal Distribution
16 pages
Chapter 4 QD, SD, Empirical Rule
No ratings yet
Chapter 4 QD, SD, Empirical Rule
25 pages
Part 2-Chapter 3 - Describing Data - Edit
No ratings yet
Part 2-Chapter 3 - Describing Data - Edit
46 pages
Describing Data: Measure Sample Population Mean 'X Stand. Dev. Variance Size
No ratings yet
Describing Data: Measure Sample Population Mean 'X Stand. Dev. Variance Size
10 pages
Chapter 9 - Sampling Distribution
No ratings yet
Chapter 9 - Sampling Distribution
9 pages
Data Management
No ratings yet
Data Management
50 pages
Assignment 3 ISI MBB9 (Corrected)
No ratings yet
Assignment 3 ISI MBB9 (Corrected)
23 pages
M-Learning Exploring Mobile Technologies For Second and Prim School Science Inquiry
No ratings yet
M-Learning Exploring Mobile Technologies For Second and Prim School Science Inquiry
7 pages
SB Mid Term
No ratings yet
SB Mid Term
83 pages
3 - Descriptive Stat
No ratings yet
3 - Descriptive Stat
70 pages
Chapter 6 Lecture 3 of 3
No ratings yet
Chapter 6 Lecture 3 of 3
30 pages
Statistics Lab 10-4
No ratings yet
Statistics Lab 10-4
11 pages
Numerical Measures HANDOUT With Answers
No ratings yet
Numerical Measures HANDOUT With Answers
8 pages
History Reporting
No ratings yet
History Reporting
61 pages
Percentile - Percentile Divide A Data Set Into 100 Equal Parts Having One
No ratings yet
Percentile - Percentile Divide A Data Set Into 100 Equal Parts Having One
9 pages
Decriptive Part 3
No ratings yet
Decriptive Part 3
32 pages
Representation of Data - 1.1.4
No ratings yet
Representation of Data - 1.1.4
6 pages
6927
No ratings yet
6927
12 pages
Lecture of BIOSTATISTICS 12.2022 RMDC
No ratings yet
Lecture of BIOSTATISTICS 12.2022 RMDC
85 pages
Descriptive Stat
No ratings yet
Descriptive Stat
13 pages
Probability Concepts and Applications
No ratings yet
Probability Concepts and Applications
110 pages
QBM 101 Business Statistics: Department of Business Studies Faculty of Business, Economics & Accounting HE LP University
No ratings yet
QBM 101 Business Statistics: Department of Business Studies Faculty of Business, Economics & Accounting HE LP University
62 pages
3) S1 Representation and Summary of Data - Dispersion
No ratings yet
3) S1 Representation and Summary of Data - Dispersion
27 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
No ratings yet
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
4 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
50 pages
Measures of Variation: Range Variance and Standard Deviation (Ungrouped Data)
No ratings yet
Measures of Variation: Range Variance and Standard Deviation (Ungrouped Data)
24 pages
Standard and Quartile Deviation
No ratings yet
Standard and Quartile Deviation
7 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

4 - Stat - Measures of Variation 2021

Uploaded by

4 - Stat - Measures of Variation 2021

Uploaded by

§Week 4

The range is 67 – 56 = 11.

Population standard deviation =

2. Find the deviation of each entry. x μ

4. Add to get the sum of squares. SS x  x  μ 

5. Divide by N to get the population x  μ 

4. Add to get the sum of squares. x x

Stock Deviation Squared SS2 = Σ(x – μ)2 = 74

95% within 2 standard

105 110 115 120 125 130 135 140 145

Chebychev’s Theorem can be used for any distribution,

45.8 48 50.2 52.4 54.6 56.8 59

Class x f x–x (x – x )2 (x – x )2f

Q1 is the median of the data below Q3 is the median of the data

Order the data.

Lower half Upper half

(IQR) = Q3 – Q1 The quiz scores in the middle portion of the data

The five-number summary is used to draw the graph.

John’s z-score Samantha’s z-score

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.