Measure of Location (Final)
Measure of Location (Final)
Statistics
Muhammad Irfan Malik, Ph.D
School of Social Sciences & Humanities (S3H),
National University of Science and Technology (NUST), Islamabad
Email: irfanmalik@s3h.nust.edu.pk
What are descriptive measures?
Types of descriptive measures
What is measure of central tendency or measure of location?
What are the measures central tendency?
How to find these measures?
What are the advantages and disadvantages of these
measures?
Shape of distribution and relationship between mean, median
and mode
Application on real world data.
Interpretation
Descriptive Measures
Numbers that are used to describe the data set are
called descriptive measures.
Types
Measures of Location (Mean, Median, Mode)
Measures
Arithmetic
Mean Median Mode
Arithmetic Mean
(Mean)
The arithmetic mean (mean) of a data set is obtained by
dividing the sum of values by the number of values. It is usually
denoted by µ (population mean) and (sample mean)
Properties of Mean
It needs not be an element of the collection.
It needs not be an integer even if all the elements of the collection are integers.
It is somewhere between the smallest and largest values in the collection.
It needs not be halfway between the two extremes; in general, it is not true
that half the elements in a collection are above the mean.
If the collection consists of values of a variable measured in specified units,
then the mean has the same units too.
点击添加文本
点击添加文本
Mathematical formula of mean for ungrouped (raw) data
Population Mean
Sample Mean
𝑛
𝑿 𝟏 + 𝑿 𝟐 + 𝑿 𝟑 +…+ 𝑿 𝒏
∑ 𝑿𝒊
𝑿= = 𝑖 =1
𝒏 𝑛
Sample Mean
𝒇 𝑿𝟏+𝒇 𝑿𝟐 + 𝒇 𝑿 𝟑+ … + 𝒇 𝑿𝒌
∑ 𝒇 𝒊 𝑿𝒊
𝟏 𝟐 𝟑 𝒌 𝑖=1
𝑿= =
𝑘
𝑛
𝒏= ∑ 𝒇 𝒊
𝑖 =1
represents the number of classes and represents the mid point of class
How to find mean of grouped data?
What was the reason that the mean of ungrouped data and the mean of grouped data are
different?
The energy consumption of natural gas (in billions of Btu)
by the 50 states and the District of Columbia.
点击添加文本
点击添加文本
Median for ungrouped (raw) data
Then median is
Note:
Mathematical formula of Median for grouped data
𝒉 𝒏
𝑴𝒆𝒅𝒂𝒊𝒏=𝒍 + ( − 𝑪 )
𝒇 𝟐
Where:
is the lower-class boundary of median class
is the width or class interval of median class
frequency of median class
is the total no of observation in the data set or sum of frequency column
is the cumulative frequency of previous class to median class.
How to find median of grouped data?
( )
22 113 42 120 𝒕𝒉
3 105 𝟓𝟎 𝒕𝒉
4 105
23 113 43 120 𝒐𝒃𝒔𝒆𝒓𝒗𝒂𝒕𝒊𝒐𝒏=𝟐𝟓 𝒐𝒃𝒔¿𝟏𝟏𝟒
5 105 24 114 44 120 𝟐
( )
𝟓𝟎 𝒕𝒉
6 106 25 114
45 120
𝟐
8 108
9 109 28 114 47 122
29 115 48 122
10 109
30 116
11 110 49 127 (114+114)=114
31 116
12 110
32 117 50 134
13 110
33 117
14 110
34 117
15 110
35 118
16 111
36 118
17 111
37 118
n=50, which an even number
18 112 38 118
() ( )
𝒕𝒉 𝒕𝒉
19 112 39 118 𝟏 𝒏 𝒏
𝐌𝐞𝐝𝐢𝐚𝐧= [ 𝒐𝒃𝒔𝒆𝒓𝒗𝒂𝒕𝒊𝒐𝒏+ +𝟏 𝒐𝒃𝒔𝒆𝒓𝒗𝒂𝒕𝒊𝒐𝒏]
20 112 40 118 𝟐 𝟐 𝟐
𝒕𝒉
𝟐𝟓 𝒐𝒃𝒔 𝟐𝟔𝒕𝒉 𝒐𝒃𝒔
Median Class
k Classes Frequency Class Boundaries Cf
1 100-104 2 99.5-104.5 2 ¿ 𝟐𝟓 , Condition not satisfied and continue
2 105-109 8 104.5-109.5 10 ¿ 𝟐𝟓 , Condition not satisfied and continue
3 110-114 18 109.5-114.5 28 ≥ 𝟐𝟓 , Condition satisfied and stop
4 115-119 13 114.5-119.5 41
5 120-124 7 119.5-124.5 48 Now
6 125-129 1 124.5-129.5 49 𝒍=𝟏𝟎𝟗 .𝟓 𝐟 =𝟏𝟖 𝒉=𝟏𝟏𝟒 .𝟓−𝟏𝟎𝟗.𝟓=𝟓 𝐂=𝟏𝟎
7 130-134 1 129.5-134.5 50
Sum 50
( )
𝒕𝒉
𝒏
𝐌𝐞𝐝𝐢𝐚𝐧=
𝒕𝒉
𝒐𝒃𝒔𝒆𝒓𝒗𝒂𝒕𝒊𝒐𝒏=𝟐𝟓 𝑶𝒃𝒔𝒆𝒓𝒗𝒂𝒕𝒊𝒐𝒏 𝟓 𝟓𝟎
𝟐 𝑴𝒆𝒅𝒂𝒊𝒏=𝟏𝟎𝟗 . 𝟓+ ( −𝟏𝟎)
We can not find the 25th observation
𝟏𝟖 𝟐
because the data is grouped. To find 𝟓
median, first, we need to find the class in ¿ 𝟏𝟎𝟗 . 𝟓+ (𝟏𝟓)
which 25th observation fall. To do so, in cf 𝟏𝟖
column from the first cell, find the cell in
which cf is greater than or equal to the 25 4.17
and stop. ¿ 𝟏𝟏𝟑. 𝟔𝟕
Class associated with this cell is median
class
What was the reason that the median of ungrouped data and the median of grouped data are
different?
Ages of the Vice Presidents at the time of their death. The ages at the time
of death of those Vice Presidents of the United States who have passed away
are listed below.
Find the median age at the time of death of the vice presidents
Use the data to construct a frequency distribution. Use 6 classes.
Plot the histogram and comment of the symmetry
Draw the Ogive using percentages instead of frequency and find
out the approximate value of age at death below which 50% of the
ages of presidents fall.
Using the frequency distribution, find the median class
Find the median age at the time of death using frequency
distribution.
Comment on the both values of the median
Mode
The mode is a value that appears most frequently in a data set.
Some important properties of Mode
This is a measure that appear more than once.
The mode is not calculated on all observations in a data set
It is not affected by extreme observations (extreme minimum or extreme
maximum
Sometimes it may not be possible to calculate the mode.
What was the reason that the mean of ungrouped data and the mode of grouped data are
different?
Enrollments for Selected Independent Religiously Controlled 4-
Year Colleges Listed below are the enrollments for selected
independent religiously controlled 4-year colleges that offer
bachelor’s degrees only.
Mathematical formula
∑ 𝒙 = 𝟐𝟏𝟔 =𝟒𝟑 . 𝟐 𝒙 𝒘=
∑ 𝒙 𝒘 = 𝟔𝟑𝟔𝟗 =𝟔𝟑 .𝟔𝟗
𝒙=
𝒏 𝟓 ∑ 𝒘 𝟏𝟎𝟎
∑ 𝒙 = 𝟐𝟏𝟔 =𝟒𝟑 . 𝟐 𝒙 𝒘=
∑ 𝒙 𝒘 = 𝟔𝟑𝟔𝟗 =𝟔𝟑 .𝟔𝟗
𝒙=
𝒏 𝟓 ∑ 𝒘 𝟏𝟎𝟎
The shapes of distribution and relationship between
Mean, Median and Mode
A frequency distribution is
symmetric when a vertical
line can be drawn through the
middle of a graph of the
distribution and the resulting
halves are approximately mirror
images. ∑ 𝒙 𝟐𝟏𝟔
𝒙= = =𝟒𝟑 . 𝟐
𝒏 𝟓
The shapes of distribution and relationship between
Mean, Median and Mode
A frequency distribution is skewed if the “tail” of the graph elongates more to one
side than to the other. A distribution is skewed left (negatively skewed) if its tail
extends to the left. A distribution is skewed right (positively skewed) if its tail
extends to the right.
The shapes of distribution and relationship between
Mean, Median and Mode