Unit 4: Descriptive Statistics
Unit 4: Descriptive Statistics
Unit 4: Descriptive Statistics
Introduction
In the previous units, it has been discussed that one of the nature of statistics is
descriptive statistics along with inferential statistics. Descriptive statistics refers to
the field of statistics that includes the methods of collecting, classifying, graphing,
and averaging data with the objective of simply describing the properties or
characteristics of the data on hand. Thus, the task of the statistician in this area is
simply to select a few procedures, do some averaging, and eventually be able to
identify significant features of the given data. This unit will focus on the
commonly used descriptive statistics such as measures of central tendency,
measures of position, measures of variation, and measures of skewness. Both
manual computation and use of digital technology in solving will be introduced
on this unit.
Learning Outcomes
At the end of this unit, you are expected to:
1. Determine when to use the appropriate measure of central tendency,
position, variability, and skewness.
2. Compute and interpret the mean, median, mode, quartile, decile,
percentile, range, quartile deviation, mean deviation, variance, standard
deviation, and skewness of a given set of data.
3. Use digital technology in solving measure of central tendency, position,
variation, and skewness.
1
Unit 4: Descriptive Statistics
2
Unit 4: Descriptive Statistics
Learning Objectives
At the end of the lesson, you are expected to:
1. Calculate the measures of central tendency such as mean, median, and
mode.
2. Provide a sound interpretation of these measures.
3. Discuss the properties of these measures.
Presentation of Content
Measures of Central Tendency
A measure of central tendency is a value at the center or middle of a data
set.
‘The Average’
There are three common measures of central tendency: mean, median, and
mode.
Mean
The most reliable and the most sensitive measure of central tendency.
It is the most widely used measure.
It is commonly known as the “average” although the median and the mode
are also known as averages.
Mean for Ungrouped Data
It comes into 2 different forms: 1) Simple Mean and 2) Weighted Mean
Simple Mean
The simple mean is obtained by adding all the values/ observations of a certain
variable and divide the sum by the total number of values, cases or observations.
Formula:
𝒔𝒖𝒎 𝒐𝒇 𝒂𝒍𝒍 𝒕𝒉𝒆 𝒗𝒂𝒍𝒖𝒆𝒔 𝒊𝒏 𝒕𝒉𝒆 𝒅𝒊𝒔𝒕𝒓𝒊𝒃𝒖𝒕𝒊𝒐𝒏
̅=
𝒙
𝒏𝒖𝒎𝒃𝒆𝒓 𝒐𝒇 𝒗𝒂𝒍𝒖𝒆𝒔 𝒊𝒏 𝒕𝒉𝒆 𝒅𝒊𝒔𝒕𝒓𝒊𝒃𝒖𝒕𝒊𝒐𝒏
∑𝒙
̅=
𝒙 𝒏
3
Unit 4: Descriptive Statistics
Example: The scores of 15 students in a 25-item test are 15, 18, 17, 16, 19, 21,
18, 23, 24, 18, 16, 17, 20, 21, and 19. Determine the mean score of the
students.
Solution:
∑𝑥
𝑥̅ =
𝑛
15 + 18 + 17 + 16 + 19 + 21 + 18 + 23 + 24 + 18 + 16 + 17 + 20 + 21 + 19
𝑥̅ =
15
̅ = 𝟏𝟖. 𝟖𝟎
𝒙
The performance of the class/group (15 students) in the 25-item test can be
represented by their mean score which is 18.80.
Weighted Mean
Formula:
∑ 𝑥𝑤
𝑥̅ =
∑𝑤
where
x = represents the item value
w = represents the weight associated to x
Example: The following represents the final grades obtained by a CTEd
student on one summer term:
Chemistry (3 units) ---------88
Statistics (5 units) ---------93
NSTP (2 units) ---------89
Find the weighted average of the student.
4
Unit 4: Descriptive Statistics
Solution:
∑ 𝑥𝑤
𝑥̅ =
∑𝑤
93(5) + 88(3) + 89(2)
𝑥̅ =
10
̅ = 𝟗𝟎. 𝟕
𝒙
The student’s performance for the summer term can be represented by the
average/mean of his grades in the three subjects which is 90.7.
Solution:
∑ 𝑥𝑤
𝑥̅ =
∑𝑤
5(17) + 4(11) + 3(9) + 2(12) + 1(1)
𝑥̅ =
50
̅ = 𝟑. 𝟔𝟐 (Strongly Agree)
𝒙
5
Unit 4: Descriptive Statistics
Solution:
Step 1. Get the midpoint for each class. The midpoints are shown in the 3 rd
column.
6
Unit 4: Descriptive Statistics
Step 2. Multiply each midpoint by its corresponding frequency. The products are
shown in the 4th column.
Step 4. Divide the result in step 3 by the sample size. The result is the mean of the
distribution. Hence,
∑ 𝑓𝑥
𝑥̅ =
𝑛
3,174
𝑥̅ =
60
̅ = 𝟓𝟐. 𝟗𝟎
𝒙
7
Unit 4: Descriptive Statistics
Formula:
𝑥̃ = 𝑥(𝑛+1) 𝑖𝑓 𝑛 𝑖𝑠 𝑜𝑑𝑑
2
𝑥(𝑛)+𝑥(𝑛+1)
2 2
𝑥̃ = 𝑖𝑓 𝑛 𝑖𝑠 𝑒𝑣𝑒𝑛
2
Example: The following are the daily wages of seven (7) employees of a
certain food chain in Tuguegarao City:
Php108, Php120, Php154, Php118, Php125, Php164, Php135
Determine the median wage of these employees.
Solution:
The value of n=7. Arranging the values in terms of magnitude and using
𝑥̃ = 𝑥(𝑛+1)
2
8
Unit 4: Descriptive Statistics
𝑚𝑒𝑑𝑖𝑎𝑛 = 𝑥4
Php108, Php118, Php120, Php125, Php135, Php154, Php164
̃ = 𝟏𝟐𝟓
𝒙
Example: The following values are the number of students of the first 8 classes
in a certain college taken for inspection:
21, 25, 26, 30, 36, 39, 42, 55
Determine the median.
Solution:
The values are already arranged in terms of magnitude. Since n=8 and is even,
then we shall use
𝑥(𝑛)+𝑥(𝑛+1)
2 2
𝑥̃ =
2
21, 25, 26, 30, 36, 39, 42, 55
𝑥(8)+𝑥(8+1)
2 2
𝑥̃ =
2
𝑥4+𝑥5
𝑥̃ =
2
30 + 36
𝑥̃ =
2
̃ = 𝟑𝟑
𝒙
9
Unit 4: Descriptive Statistics
where
𝑥𝑙𝑏 ------------lower boundary of the median class
𝑛--------------sample size
𝑐𝑢𝑚𝑏𝑓𝑏 ------cumulative frequency before the median class
𝑓𝑚 ------------the frequency of the median class
𝑐---------------class width
10
Unit 4: Descriptive Statistics
The median class refers to the interval that contains the median, that is,
𝑛 𝑡ℎ
where (2 ) value is located.
𝑛
Among the entries under the < 𝑐𝑢𝑚𝑓 column which are greater than 2 , the
smallest shall be the frequency of the median class.
If a distribution contains an interval where the cumulative frequency is
𝑛
exactly 2 , then the upper boundary of that class will be the median and no
interpolation is needed.
2. 𝑐𝑢𝑚𝑏𝑓𝑏 = 19
3. median class: 47 − 58
̃ = 𝟓𝟑. 𝟒𝟓
𝒙
11
Unit 4: Descriptive Statistics
---------------------------------------------------------------------------------------------------
Mode
The most favorite score.
The score having the highest frequency.
The most frequently occurring score.
The least reliable measure of position.
12
Unit 4: Descriptive Statistics
13
Unit 4: Descriptive Statistics
14
Unit 4: Descriptive Statistics
---------------------------------------------------------------------------------------------------
Comparison of Mean, Median, and Mode
Mean
The mean always exist in any distribution. This implies that for any set of
data, the mean can always be computed.
The value of the mean in any distribution is unique. This implies that for
any distribution, there is only one possible value of the mean.
In the computation for this measure, it takes into consideration all the
values in the distribution.
Median
Like the mean, the median also exists in any distribution.
The value of the median is also unique.
This is a positional measure.
Mode
It does not always exist.
If the mode exists, it is not always unique.
In determining the value of the mode, it does not take into account all the
values in the distribution.
15
Unit 4: Descriptive Statistics
Application
Solve the given problem below. Show all pertinent solutions on the box provided.
The rate per hour (in pesos) of the 10 employees of a certain company were
taken and are shown below.
44.17, 44.17, 38, 39.25, 18, 15, 57.17, 65.25, 44.17, 39.5
Solution box
Solution box
Solution box
16
Unit 4: Descriptive Statistics
4. If the value 39.25 was erroneously written whose actual value is 49.25,
then what measure of central tendency will be affected? Support your
answer.
Solution/Reasoning box
The ages of 210 qualified voters in a certain barangay were taken and are shown
below. Compute the value of the mean, median, and mode. Show all your pertinent
solutions on the box provided.
17
Unit 4: Descriptive Statistics
Feedback
Solve the following problems. Show all pertinent solutions on the box provided.
The NCEE scores of 12 students in a certain college were taken and are shown
below.
93, 65, 87, 56, 99, 76, 58, 87, 76, 93, 68, 69
Solution box
18
Unit 4: Descriptive Statistics
Solution box
Solution box
The daily mean sales of ABC Store for the month of November was computed
at Php 5, 386.65. When the figures were reviewed, it was found out that on
November 21, the actual sales was Php 6, 389 and was erroneously written Php
3, 689. Assuming that the store was open for 20 days for the month of
November, then determine the store’s daily mean sales for November.
Solution box
19
Unit 4: Descriptive Statistics
The results of an IQ
test of a group of
students in a certain
college were taken and
are presented in a
frequency distribution.
20
Unit 4: Descriptive Statistics
Learning Objectives
At the end of the lesson, you are expected to:
1. Calculate the measures of variation such as range, semi inter-quartile
range or quartile deviation, mean deviation, variance, and the standard
deviation.
2. Provide a sound interpretation of these measures.
3. Use a digital technology in calculating measures of variation.
Presentation of Content
Measures of Variation
They have the same mean (C+), yet they shaped differently.
21
Unit 4: Descriptive Statistics
How to interpret?
The lesser the value of the measure, the more consistent, the more
homogeneous and the less scattered are the observations in the set of data.
Range
The range is the simplest measure of variation to calculate.
It is the difference between the largest and the smallest value in a given
data set.
A much larger range suggest greater variation or dispersion.
where
R------------range value
HO----------highest observation
LO----------lowest observation
Example:
Find the range of the two groups of score distribution.
𝑅𝐴 = 𝐻𝑂 − 𝐿𝑂
𝑅𝐴 = 35 − 10
𝑅𝐴 = 25
𝑅𝐵 = 𝐻𝑂 − 𝐿𝑂
𝑅𝐵 = 30 − 15
𝑅𝐵 = 15
𝑹𝑩 < 𝑹𝑨
The implication of this is that the scores in group B are less scattered than the
scores in group A.
22
Unit 4: Descriptive Statistics
where
R------------range value
𝐻𝑂𝑈𝐵 -------upper boundary of the highest observation
𝐿𝑂𝐿𝐵 ---------lower boundary of the lowest observation
23
Unit 4: Descriptive Statistics
Solution:
𝑅 = 𝐻𝑂𝑈𝐵 − 𝐿𝑂𝐿𝐵
𝑅 = 94.5 − 10.5
𝑹 = 𝟖𝟒
Properties of Range
The value is always affected by extreme values.
In the process of computing the value of the range, not all values are
considered.
The range does not consider the variation of the items relative to the
central value of the distribution.
---------------------------------------------------------------------------------------------------
Semi Inter-Quartile Range or Quartile Deviation
It indicates the distance we need to go above and below the median to include the
middle 50% of the scores. It is based on the range of the middle 50% of the
scores, instead of the range of the entire set.
Semi Inter-Quartile Range or Quartile Deviation for Ungrouped Data
Formula:
𝑸𝟑 − 𝑸 𝟏
𝑸=
𝟐
where
Q------------value of the quartile deviation
𝑄3 ----------value of the 3rd quartile
𝑄1----------value of the 1st quartile
24
Unit 4: Descriptive Statistics
1 1 𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝑄1 = [4 (9) + (1 − 4)]
12 𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝑄1 = [ ]
4
𝑄1 = 3𝑟𝑑 𝑠𝑐𝑜𝑟𝑒
𝑄1 = 10
3 3 𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝑄3 = [ (9) + (1 − )]
4 4
𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
28
𝑄3 = [ ]
4
𝑄3 = 7𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝑄3 = 15
𝑄3 − 𝑄1
𝑄=
2
15 − 10
𝑄=
2
𝑸 = 𝟐. 𝟓
The amount that deviates from the mean value is 2.5.
25
Unit 4: Descriptive Statistics
where
Q------------value of the quartile deviation
𝑄3 ----------value of the 3rd quartile
𝑄1----------value of the 1st quartile
26
Unit 4: Descriptive Statistics
Solution:
𝑛
− 𝑐𝑢𝑚𝑓𝑏
𝑄1 = 𝑥𝑙𝑏 + (4 )𝑐
𝑓𝑄1
100
− 13
𝑄1 = 78.5 + ( 4 )4
13
𝑄1 = 82.19
3𝑛
− 𝑐𝑢𝑚𝑓𝑏
𝑄3 = 𝑥𝑙𝑏 + ( 4 )𝑐
𝑓𝑄3
3(100)
− 69
𝑄3 = 90.5 + ( 4 )4
19
𝑄3 = 91.76
𝑄3 − 𝑄1
𝑄=
2
91.76 − 82.19
𝑄=
2
𝑸 = 𝟒. 𝟕𝟖
The larger the value of the Q, the more dispersed the scores at the middle 50% of
the distribution. On the other hand, if the Q is small, the scores are less dispersed
at the middle 50%of the distribution. The point of dispersion is the median value.
27
Unit 4: Descriptive Statistics
Mean Deviation
Mean Deviation for Ungrouped Data
Formula:
∑|𝒙 − 𝒙
̅|
𝑴𝑫 =
𝒏
where
MD-------------mean deviation value
x----------------individual score
𝑥̅ ----------------sample mean
n---------------number of cases
Steps:
1. Solve the mean value.
2. Subtract the mean value from each score.
3. Take the absolute value of the difference in step 2.
4. Solve the mean deviation using the formula:
∑|𝑥 − 𝑥̅ |
𝑀𝐷 =
𝑛
∑𝑥
𝑥̅ =
𝑛
212
𝑥̅ =
10
𝑥̅ = 21.2
∑|𝑥 − 𝑥̅ |
𝑀𝐷 =
𝑛
60.4
𝑀𝐷 =
10
𝑀𝐷 = 6.04
The mean deviation of the 10 scores of students is 6.04. This means that on the
average, the value deviated from the mean of 21.2 is 6.04.
28
Unit 4: Descriptive Statistics
∑ 𝒇|𝒙 − 𝒙
̅|
𝑴𝑫 =
𝒏
where
MD-------------mean deviation value
x----------------the midpoint of each class
𝑥̅ ----------------the mean of the distribution
n---------------total number of frequency
Steps:
1. Compute the value of the mean.
2. Get the deviation by using the expression 𝑥 − 𝑥̅ .
3. Multiply the deviation by its corresponding frequency.
4. Add the results in step 3.
5. Divide the sum in step 4 by n.
Example:
Step 1. Compute the value of the mean.
∑ 𝑓𝑥 3174
𝑥̅ = = = 𝟓𝟐. 𝟗𝟎
𝑛 60
29
Unit 4: Descriptive Statistics
∑ 𝑓 |𝑥 − 𝑥̅ |
𝑀𝐷 =
𝑛
750.4
𝑀𝐷 =
60
𝑀𝐷 = 𝟏𝟐. 𝟓𝟏
30
Unit 4: Descriptive Statistics
Variance
It is a measure of variability that uses all the data.
It is the average of the squared differences between the observations and the
mean value.
𝟐
̅ )𝟐
∑(𝒙 − 𝒙
𝒔 =
𝒏
where
x----------represents the individual values in the distribution
𝑥̅ ----------the mean of the distribution
n----------the sample size
Steps:
1. Compute the value of the mean.
2. Get the deviation of each value from the mean.
3. Square the deviations.
4. Calculate the sum of the squared deviations.
5. Divide the sum by the total number of values.
Solution:
∑𝑥
𝑥̅ =
𝑛
88
𝑥̅ =
8
𝑥̅ = 11
∑(𝑥 − 𝑥̅ )2
𝑠2 =
𝑛
114
𝑠2 =
8
𝑠 2 = 𝟏𝟒. 𝟐𝟓
31
Unit 4: Descriptive Statistics
𝟐
̅ )𝟐
∑ 𝒇 (𝒙 − 𝒙
𝒔 =
𝒏
where
x----------midpoint of each class interval
𝑥̅ ----------mean
n----------sample size
Steps:
1. Compute the value of the mean.
2. Determine the deviation 𝑥 − 𝑥̅ by subtracting the mean from the midpoint
of each class interval.
3. Square the deviations obtained in step 2.
4. Multiply the frequencies by their corresponding squared deviations.
5. Add the results in step 4.
6. Divide the result in step 5 by the sample size.
32
Unit 4: Descriptive Statistics
Example:
∑ 𝑓𝑥 3174
𝑥̅ = = = 52.90
𝑛 60
∑ 𝑓 (𝑥 − 𝑥̅ )2
𝑠2 =
𝑛
16406.40
𝑠2 =
60
𝑠 2 = 𝟐𝟕𝟑. 𝟒𝟒
---------------------------------------------------------------------------------------------------
Standard Deviation
It is the square root of the variance.
It is the most commonly used measure of variation. It indicates how
closely the values of a given data set are clustered around the mean.
A lower value of the standard deviation means that the values of that given
data set are spread over a smaller range around the mean.
On the other hand, a large value of the standard deviation means that the
values of that data set are spread over a larger range around the mean.
∑(𝒙 − ̅
𝒙 )𝟐
𝒔=√
𝒏
33
Unit 4: Descriptive Statistics
Solution:
∑𝑥
𝑥̅ =
𝑛
88
𝑥̅ =
8
𝑥̅ = 11
∑(𝑥 − 𝑥̅ )2
𝑠=√
𝑛
114
𝑠=√
8
𝑠 = 𝟑. 𝟕𝟕
34
Unit 4: Descriptive Statistics
∑ 𝑓 (𝑥 − 𝑥̅ )2
𝑠=√
𝑛
16406.40
𝑠=√
60
𝑠 = 𝟏𝟔. 𝟓𝟒
---------------------------------------------------------------------------------------------------
Coefficient of Variation
The standard deviation measures absolute variability and not relative variability.
It can only compare two samples that have the same units of measure.
It allows us to compare two different data sets that have different units of
measurement.
This expresses the standard deviation as a percentage of the mean.
The smaller the value of the coefficient variation, the more homogeneous
the scores in a particular group.
The higher the value of the coefficient of variation, the more dispersed the
scores in a particular distribution.
35
Unit 4: Descriptive Statistics
Formula:
𝒔
𝑪𝑽 = ( ) 𝟏𝟎𝟎%
̅
𝒙
where
𝑥̅ --------------mean value
s--------------standard deviation
Example: The average score of the students in one English class is 110, with a
standard deviation of 5; the average score of students in a Mathematics class is
106, with a standard deviation of 4. Which class is more variable in terms of
score?
Solution:
5
English class: 𝐶𝑉 = (110) 100% = 4.55%
4
Math class: 𝐶𝑉 = (106) 100% = 3.77%
The scores in the Math class are less scattered than the scores in the English
class.
The scores in the English class are more spread out than the scores in the Math
class.
36
Unit 4: Descriptive Statistics
Application
Solve the following problems. Show all pertinent solutions inside the box.
Solution box
Solution box
Solution box
37
Unit 4: Descriptive Statistics
38
Unit 4: Descriptive Statistics
39
Unit 4: Descriptive Statistics
Feedback
Solve the following problems. Show all pertinent solutions inside the box.
The following are the hourly rate of 12 employees in a certain fastfood chain in
Tuguegarao City.
Php 26.30, Php 45.25, Php 18.25, Php 13.50, Php 18.60, Php 25.60, Php 55.81,
Php 13.50, Php 13.50, Php 18.25, Php 18.60, Php 25.60
Solution box
Solution box
Solution box
40
Unit 4: Descriptive Statistics
41
Unit 4: Descriptive Statistics
42
Unit 4: Descriptive Statistics
Learning Objectives
At the end of the lesson, you are expected to:
1. Calculate the measures of position (quantiles) such as quartile, decile, and
percentile.
2. Provide a sound interpretation of these measures.
3. Use a digital technology in calculating measures of position (quantiles).
Presentation of Content
Quantiles
It is treated as extensions of the concept on median.
If in median, the value divides the given distribution into two equal parts,
the distribution in quantiles is divided into:
Quartile
Quartiles refer to the values that divide the distribution into four equal
parts. There are three quartiles represented by 𝑄1 , 𝑄2 ,and 𝑄3 .
𝒌 𝒌 𝒏𝒕𝒉 𝒔𝒄𝒐𝒓𝒆
𝑸𝒌 = [ 𝒏 + (𝟏 − )]
𝟒 𝟒
where
k----------1,2,3
n----------number of cases
43
Unit 4: Descriptive Statistics
Solution:
1 1 𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝑄1 = [ (9) + (1 − )]
4 4
12 𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝑄1 = [ ]
4
𝑄1 = 3𝑟𝑑 𝑠𝑐𝑜𝑟𝑒
The value of 𝑄1 is 10 which is the 3rd score in the distribution. Therefore, 25% of
the scores are below 10.
44
Unit 4: Descriptive Statistics
Steps:
To compute the value of 𝑄𝑘 , follow the procedure used in computing the value of
the median.
𝑘𝑛
1. Get .
4
2. Get the value of the cumulative frequency before the kth quartile class.
3. Determine the kth quartile class.
4. Determine the lower boundary of the kth quartile class.
5. Get the frequency of the kth quartile class.
6. Substitute all the values in the formula:
𝑘𝑛
− 𝑐𝑢𝑚𝑓𝑏
𝑄𝑘 = 𝑥𝑙𝑏 + ( 4 )𝑐
𝑓𝑄𝑘
STEPS:
𝑘𝑛 (1)(60)
1. 4
= 4
= 15
2. 𝑐𝑢𝑚𝑏𝑓𝑏 = 8
3. 1st quartile class: 35 − 46
4. 𝑥𝑙𝑏 = 34.5; 𝑓𝑄1 = 11; 𝑐 = 12
𝑘𝑛
−𝑐𝑢𝑚𝑓𝑏
5. 𝑄𝑘 = 𝑥𝑙𝑏 + ( 4 )𝑐
𝑓𝑄𝑘
(1)(60)
−8
𝑄1 = 34.5 + ( 4
) 12
11
𝑄1 = 42.14
Therefore, 25% of the scores of 60 students who took the Statistics test are less
than 42.14.
45
Unit 4: Descriptive Statistics
Decile
Deciles refer to the values that divide the distribution into 10 equal parts.
There are nine deciles represented by 𝐷1 , 𝐷2 , 𝐷3, . . . 𝐷9
𝒌 𝒌 𝒏𝒕𝒉 𝒔𝒄𝒐𝒓𝒆
𝑫𝒌 = [ 𝒏 + (𝟏 − )]
𝟏𝟎 𝟏𝟎
where
k----------1,2,3,4,5,6,7,8,9
n----------number of cases
Solution:
6 6 𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝐷6 = [ (9) + (1 − )]
10 10
58 𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝐷6 = [ ]
10
𝐷6 = 5.8𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
The value of D6 lies within the sum of the 5th score and 80% of the difference of
6th and 5th scores.
Therefore, 60% of the scores in the distribution are less than 13.60
46
Unit 4: Descriptive Statistics
Solution: (𝐷9 )
9 9 𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝐷9 = [ (9) + (1 − )]
10 10
𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
82
𝐷9 =[ ]
10
𝐷9 = 8.2𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝐷9 = 8𝑡ℎ 𝑠𝑐𝑜𝑟𝑒 + .20(9𝑡ℎ 𝑠𝑐𝑜𝑟𝑒 − 8𝑡ℎ 𝑠𝑐𝑜𝑟𝑒)
𝐷9 = 16 + .20(20 − 16)
𝐷9 = 16.80
Therefore, 90% of the scores in the distribution are less than 16.80
Steps:
To compute the value of 𝐷𝑘 , follow the procedure used in computing the value of
the median.
𝑘𝑛
1. Get 10 .
2. Get the value of the cumulative frequency before the kth decile class.
3. Determine the kth decile class.
4. Determine the lower boundary of the kth decile class.
5. Get the frequency of the kth decile class.
6. Substitute all the values in the formula:
𝑘𝑛
− 𝑐𝑢𝑚𝑓𝑏
𝐷𝑘 = 𝑥𝑙𝑏 + (10 )𝑐
𝑓𝐷𝑘
47
Unit 4: Descriptive Statistics
STEPS:
𝑘𝑛 3(60)
1. = = 18
10 10
2. 𝑐𝑢𝑚𝑏𝑓𝑏 = 8
3. 3rd decile class: 35 − 46
4. 𝑥𝑙𝑏 = 34.5; 𝑓𝐷3 = 11; 𝑐 = 12
𝑘𝑛
−𝑐𝑢𝑚𝑓𝑏
5. 𝐷𝑘 = 𝑥𝑙𝑏 + (10 )𝑐
𝑓𝐷𝑘
(3)(60)
−8
10
𝐷3 = 34.5 + ( ) 12
11
𝑫𝟑 = 𝟒𝟓. 𝟒𝟏
Therefore, 30% of the scores of 60 students who took the Statistics test are less
than 45.41.
---------------------------------------------------------------------------------------------------
Percentile
Percentiles refer to the values that divide the distribution into 100 equal
parts. There are 99 percentiles represented by 𝑃1 , 𝑃2 , 𝑃3 , 𝑃4 , 𝑃5 , . . . 𝑃99
k k nth score
Pk = [ n + (1 − )]
100 100
where
k----------1,2,3,4,5, . . . 99
n----------number of cases
48
Unit 4: Descriptive Statistics
Solution:
99 99 𝑛𝑡ℎ 𝑠𝑐𝑜𝑟𝑒
𝑃99 = [ (9) + (1 − )]
100 100
Therefore, 99% of the scores in the distribution are less than 19.68.
49
Unit 4: Descriptive Statistics
Steps:
To compute the value of 𝑃𝑘 , follow the procedure used in computing the value of
the median.
𝑘𝑛
1. Get 100.
2. Get the value of the cumulative frequency before the kth percentile class.
3. Determine the kth percentile class.
4. Determine the lower boundary of the kth percentile class.
5. Get the frequency of the kth percentile class.
6. Substitute all the values in the formula:
𝑘𝑛
− 𝑐𝑢𝑚𝑓𝑏
𝑃𝑘 = 𝑥𝑙𝑏 + (100 )𝑐
𝑓𝑃𝑘
STEPS:
43𝑛 43(60)
1. = = 25.8
100 100
2. 𝑐𝑢𝑚𝑏𝑓𝑏 = 19
3. 43rd percentile class: 47 − 58
4. 𝑥𝑙𝑏 = 46.5; 𝑓𝑃43 = 19; 𝑐 = 12
𝑘𝑛
−𝑐𝑢𝑚𝑓𝑏
5. 𝑃𝑘 = 𝑥𝑙𝑏 + (100 𝑓 )𝑐
𝑃𝑘
(43)(60)
−19
100
𝑃43 = 46.5 + ( ) 12
19
𝑃43 = 50.8
Therefore, 43% of the scores of 60 students who took the Statistics test are less
than 50.8.
50
Unit 4: Descriptive Statistics
Application
Solve the following problems. Show all pertinent solutions inside the box.
Solution box
Solution box
Solution box
51
Unit 4: Descriptive Statistics
The efficiency ratings of 155 faculty members of a certain college were taken and
are shown below.
52
Unit 4: Descriptive Statistics
53
Unit 4: Descriptive Statistics
Feedback
Solve the following problems. Show all pertinent solutions inside the box.
Solution box
Solution box
Solution box
54
Unit 4: Descriptive Statistics
The ages of residents of a certain zone in a barangay were taken and are shown
below.
55
Unit 4: Descriptive Statistics
56
Unit 4: Descriptive Statistics
Summary
A measure of central tendency is a location measure that pinpoints the
center or middle value.
The three common measures of central tendency are the mean, median,
and mode.
Each measure of central tendency has its own properties that serve as basis
in determining when to use it appropriately.
Measure of variation is used to further describe the distribution of the data
set.
Absolute measures of variation include range, semi-interquartile range or
quartile deviation, mean deviation, variance, and standard deviation.
A relative measure of variation is provided by the coefficient of variation.
There are other measures of location that could further describe the
distribution of the data set.
Quartiles, deciles, and percentiles are measures of location that divide the
distribution into 4, 10, and 100 equal parts, respectively.
Reflection
Congratulations! You are done with the fourth unit of this module. Now, go back
to the activities and lessons you have taken in this unit and answer the following
questions. Limit your answers for each question to 5 to 10 sentences only.
1. What made you successful/unsuccessful with this unit of the module?
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
2. Which among the activities on this unit you enjoyed most? Explain.
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
57
Unit 4: Descriptive Statistics
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
____________________________________________________________
References
Albert, J. R. (2008). Basic statistics for the tertiary level (with applications in
excel). Rex Bookstore.
Asaad, Abubakar S. (2008). Statistics made simple for researchers. Quezon City
Rex Printing Company, Inc.
Devore, J. L. (2004). Probability and statistics for engineers and the sciences.(6th
ed.). Belmont, CA Thompson
Gravetter, F. J., & Wallnau, L. B. (2009). Statistics for the behavioural sciences.
California USA Wadsworth, Cengage Learning.
58
Unit 4: Descriptive Statistics
Electronic Sources:
https://www.youtube.com/watch?v=8WdSJhEIrQk
https://www.youtube.com/watch?v=8U7UUHVaVVo
https://www.youtube.com/watch?v=H9ITfdaX2ZQ
https://www.youtube.com/watch?v=7m51Vzndhdo
https://www3.nd.edu/~rwilliam/stats1/x21.pdf
http://www.mathportal.org/calculators/statistics-calculator/normal-distribution-
calculator.php
https://blog.udemy.com/importance-of-statistics/
https://www.youtube.com/watch?v=095BdbOunPU
https://www.youtube.com/watch?v=be9e-Q-jC-0
https://www.youtube.com/watch?v=0zZYBALbZgg
https://www.academia.edu/34527054/Textual_presentation_of_data
59