3-Data Collection and Presentation
3-Data Collection and Presentation
PRESENTATION
Learning Outcomes:
⮚ Determine how to organize data
through frequency distribution.
⮚ Get oriented with the different
types of data presentation through
graphs
Methods of Collecting Data
• Direct Observation
• Experiments, and
• Surveys
Methods
• Mailing paper questionnaires to respondents, who
fill them out and mail them back
• Having interviewers call to respondents on the
telephone and ask them the question in a
telephone interview
• Sending the interviewers to the respondent’s
home or office to administer the questions in face-
to-face interviews
• Sending survey questionnaires thru google forms
• Depending upon the nature of the variable
data is classified into 2 broad categories:
✔ Qualitative Data
✔ Quantitative Data
Methods
Quantitative Qualitative Surveys
MCPEGOLLO/BASIC STATISTICS/SRSTHS
Presentation of Data
MCPEGOLLO/BASIC STATISTICS/SRSTHS
Textual Presentation of Data
Data can be presented using
paragraphs or sentences. It involves
enumerating important characteristics,
emphasizing significant figures and
identifying important features of data.
MCPEGOLLO/BASIC STATISTICS/SRSTHS
Textual Presentation of Data
Example. You are asked to present the performance of your section in
the Statistics test. The following are the test scores of your class:
34 42 20 50 17 9 34 43
50 18 35 43 50 23 23 35
37 38 38 39 39 38 38 39
24 29 25 26 28 27 44 44
49 48 46 45 45 46 45 46
MCPEGOLLO/BASIC STATISTICS/SRSTHS
Solution
First, arrange the data in order for you to identify the important
characteristics. This can be done in two ways: rearranging from lowest
to highest or using the stem-and-leaf plot.
Below is the rearrangement of data from lowest to highest:
9 23 28 35 38 43 45 48
17 24 29 37 39 43 45 49
18 25 34 38 39 44 46 50
20 26 34 38 39 44 46 50
23 27 35 38 42 45 46 50
MCPEGOLLO/BASIC STATISTICS/SRSTHS
With the rearranged data, pertinent data worth
mentioning can be easily recognized. The following
is one way of presenting data in textual form.
MCPEGOLLO/BASIC STATISTICS/SRSTHS
Tabular Presentation of Data
Below is a sample of a table with all of its parts indicated:
Table Number
Table Title
Column Header
Row Classifier
Body
Source Note
http://www.sws.org.ph/youth.htm
MCPegollo/Basic Statistics/SRSTHS
Frequency Distribution
Tabular Form
MCPegollo/Basic Statistics/SRSTHS
Grouped Frequency Distribution
✔ Categorical
✔ Interval or Ratio
Constructing Frequency Distribution
Grouped Frequency
Categorical Frequency
Examples:
Gender Political affiliation
Business type Year level
Example 1
Twenty applicants were given a performance evaluation
appraisal. The data set is
Frequency of
Percentage Formula:
the class
Total number
Percentage of values
Sample of a Frequency Distribution Table
for Grouped Data
Table 1.2
Frequency Distribution Table for the Quiz Scores of 50
Students in Geometry
Score Frequency
s
0-2 1
3-5 2
6-8 13
9 - 11 15
12 - 14 19
MCPegollo/Basic Statistics/SRSTHS
Basic Terms in a Frequency Distribution
Table
MCPegollo/Basic Statistics/SRSTHS
Lower Class Limits
are the smallest numbers that can actually belong to
different classes
Rating Frequency
0-2 1
3-5 2
6-8 13
9 - 11 15
12 - 14 19
Lower Class Limits
are the smallest numbers that can actually
belong to different classes
Rating Frequency
0-2 1
Lower Class 3-5 2
Limits
6-8 13
9 - 11 15
12 - 14 19
Upper Class Limits
are the largest numbers that can actually belong to
different classes
Rating Frequency
0-2 1
3-5 2
6-8 13
9 - 11 15
12 - 14 19
Upper Class Limits
are the largest numbers that can actually belong to
different classes
Rating Frequency
Rating Frequency
- 0.5
0-2 20
2.5
3-5 14
5.5
6-8 15
8.5
9 - 11 2
11.5
12 - 14 1
14.5
Class Boundaries
number separating classes
Rating Frequency
- 0.5
0-2 20
2.5
Class 5.5
3-5 14
Boundaries 6-8 15
8.5
9 - 11 2
11.5
12 - 14 1
14.5
MCPegollo/Basic Statistics/SRSTHS
Class Midpoints
The Class Mark or Class Midpoint is the
respective average of each class limits
Class Midpoints
midpoints of the classes
Rating Frequency
0- 1 2 20
Class
3- 4 5 14
Midpoints
6- 7 8 15
9 - 10 11 2
12 - 13 14 1
Class Width
is the difference between two consecutive lower class limits
or two consecutive class boundaries
Rating Frequency
0-2 20
3-5 14
6-8 15
9 - 11 2
12 - 14 1
Class Width
is the difference between two consecutive lower class limits
or two consecutive class boundaries
Rating Frequency
3 0-2 20
3 3-5 14
Class Width 3 6-8 15
3 9 - 11 2
3 12 - 14 1
Relative Frequency Table
Relative
Rating Frequency Rating Frequency
Total frequency = 52
Table 2-5
Cumulative Frequency Table
Rating Frequency <cf >cf
0-2 20 20 52
3–5 14 34 32
Cumulative
6–8 15 49 18 Frequencies
9 – 11 2 51 3
12 – 14 1 52 1
Table 2-6
Frequency Tables
Relative Cumulative
Rating Frequency Rating Frequency Rating Frequency
9 - 11 2 9 - 11 3.8% 9 – 11 51
12 - 14 1 12 - 14 1.9% 12 – 14 52
2. Determine the class width by dividing the range by the number of classes
(range = highest score - lowest score) and round up.
range
class width ≈ round up of
number of classes
3. Select for the first lower limit either the lowest score or a
convenient value slightly less than the lowest score.
4. Add the class width to the starting point to get the second lower
class limit, add the width to the second lower limit to get the
third, and so on.
5. List the lower class limits in a vertical column and enter the
upper class limits.
6. Represent each score by a tally mark in the appropriate class.
Total tally marks to find the total frequency for each class.
Determining Class Interval
Rule: 2k ≥ n
18.80 22.00 23.40 24.30 27.00 27.90 31.00 26.00 20.80 17.00
20.00 22.60 23.40 24.50 27.00 29.30 32.10 26.10 21.00 17.30
20.25 22.75 23.70 24.70 27.40 30.10 33.70 26.30 21.60 17.80
18.40 21.90 23.00 23.85 26.80 27.80 30.80 25.00 20.40 15.50
18.70 21.90 23.20 24.10 26.90 27.90 30.90 25.20 20.50 15.70
17.95 21.75 22.90 23.70 26.50 27.50 30.60 24.75 20.25 14.10
18.35 21.80 22.90 23.70 26.50 27.60 30.75 25.00 20.30 14.30
20.20 22.80 23.50 24.60 27.30 29.50 32.90 26.20 21.30 17.40
Example 2
Range Percentage
Interval Midpoints
Relative frequencies
Step 1
✍ When k = 6
2k ≥ n ⇒ 26 ≤ 80 ⇒ 64 ≤ 80
✍ When k = 7
2k ≥ n ⇒ 27 ≥ 80 ⇒ 128 ≥ 80
14
14.10 17.95 20.25 21.75 22.90 23.70 24.75 26.50 27.50 30.60
14.30 18.35 20.30 21.80 22.90 23.70 25.00 26.50 27.60 30.75
15.50 18.40 20.40 21.90 23.00 23.85 25.00 26.80 27.80 30.80
15.70 18.70 20.50 21.90 23.20 24.10 25.20 26.90 27.90 30.90
17.00 18.80 20.80 22.00 23.40 24.30 26.00 27.00 27.90 31.00
17.30 20.00 21.00 22.60 23.40 24.50 26.10 27.00 29.30 32.10
17.40 20.20 21.30 22.75 23.50 24.60 26.20 27.30 29.50 32.90
17.80 20.25 21.60 22.80 23.70 24.70 26.30 27.40 30.10 33.70
Step 2 (continuation)
Determine Lower and Upper class limits
Class Limits
14 - 16
17 - 19
20 - 22
Lower Limit 23 - 25 Upper Limit
26 - 28
29 - 31
32 - 34
Step 3
Convert the tallied data to numerical frequencies
14 - 16 IIII 4
17 - 29 IIII-IIII 9
20 - 22 IIII-IIII-IIII-I 16
23 - 25 IIII-IIII-IIII-IIII-III 23
26 - 28 IIII-IIII-IIII-II 17
29 - 31 IIII-III 8
32 - 34 III 3
Step 4
Determine the relative frequency (rf)
14 - 16 4 0.05
17 - 29 9 0.11
20 - 22 16 0.20
23 - 25 23 0.29
26 - 28 17 0.21
29 - 31 8 0.10
32 - 34 3 0.04
Total 80 1.00
3 ÷ 80 ≈ 0.04
Step 5
Determine the percentage
14 - 16 4 5
17 - 29 9 11
20 - 22 16 20
23 - 25 23 29
26 - 28 17 21
29 - 31 8 10
32 - 34 3 4
Total 80 100
14 - 16 4 4 4
17 - 29 9 13 4+9
20 - 22 16 29 4 + 9 +16
23 - 25 23 52 4 + 9 +16 + 23
26 - 28 17 69 4 + 9 +16 + 23 + 17
29 - 31 8 77 4 + 9 +16 + 23 + 17 + 8
32 - 34 3 80 4 + 9 +16 + 23 + 17 + 8 + 3
Total 80
Step 6
18 29 42 57 61 67 37 49 53 47
24 34 45 58 63 70 39 51 54 48
28 36 46 60 66 72 40 52 56 49
19 31 44 58 62 68 38 50 54 48
27 36 46 59 64 72 39 51 55 48
Stem Leaf
1 8, 9
2 4, 7, 8, 9
3 1, 4, 6, 6, 7, 8, 9, 9
4 0, 2, 4, 5, 6, 6, 7, 8, 8, 8, 9, 9
5 0, 1, 1, 2, 3, 4, 4, 5, 6, 7, 8, 8, 9
6 0, 1, 2, 3, 4, 6, 7, 8
7 0, 2, 2
✔ Frequency Polygon
✔ Histogram
MCPegollo/Basic Statistics/SRSTHS
Graphical Presentation of Data
Frequency Polygon
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
Graphical Presentation of Data
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
Graphical Presentation of Data
Cumulative Frequency Polygon or Ogive
⚫ An ogive (oh-jive), sometimes called a cumulative
frequency polygon, is a type of frequency polygon that
shows cumulative frequencies. In other words, the
cumulative percents are added on the graph from left to
right.
⚫ An ogive graph plots cumulative frequency on the y-
axis and class boundaries along the x-axis. It’s very
similar to a histogram, only instead of rectangles, an
ogive has a single point marking where the top right of
the rectangle would be. It is usually easier to create this
kind of graph from a frequency table.
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
Graphical Presentation of Data
Cumulative Frequency Polygon or Ogive
There are two types of ogives :
1.Less than ogive : Plot the points with the upper limits of
the class as abscissa and the corresponding less than
cumulative frequencies as ordinates. The points are joined
by free hand smooth curve to give less than cumulative
frequency curve or the less than Ogive. It is a rising curve.
2.Greater than ogive : Plot the points with the lower limits
of the classes as abscissa and the corresponding Greater
than cumulative frequencies as ordinates. Join the points
by a free hand smooth curve to get the “More than Ogive”.
It is a falling curve.
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
Other Types of Graphs/ Charts
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
Example 4
Products Sales
Junk Foods 135
Candy 250
Ice Cream 185
Chocolate 210
Others 90
Pareto Chart
It represents a frequency distribution for a categorical data
(or nominal-level) & frequencies are displayed by the heights
of vertical bars, which are arranged in order from highest to
lowest.
Bar Chart (Bar Graph)
The bases of the rectangles are arbitrary intervals whose
centers are the codes. The height of each rectangle represents
the frequency of that category. It is also applicable for
categorical data (or nominal-level).
Pie Chart (Circle Graph)
A circle divided into portions that represent the relative
frequencies (or percentages) of the data belonging to different
categories. The data in a pie chart should be categorical or
nominal-level.
Time-Series Graph
Peso/US Dollar 41 42 43 46 44 45
Exchange Rate
Month July August Sept Oct Nov Dec
Peso/US Dollar 43 42 45 44 45 43
Exchange Rate
800
700
600
No. of houses
500
400
300
200
100
2012 2013 2014 2015 2016
Year
Legend: = 100 houses
MCPegollo/Basic Statistics/SRSTHS
Consider the following test scores in a
Statistics class. Construct a frequency
distribution table out of it.
9 23 28 35 38 43 45 48
17 24 29 37 39 43 45 49
18 25 34 38 39 44 46 50
20 26 34 38 39 44 46 50
23 27 35 38 42 45 46 50
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
Exercise:
For each of the following class intervals, give the
class width(i), class mark (x), and class
boundary (cb)
Class interval (ci) Class Width Class Mark Class
Boundary
a. 4 – 8
b. 35 – 44
c. 17 – 21
d. 53 – 57
e. 8 – 11
f. 108 – 119
g. 10 – 19
h. 25 – 29
i. 175 – 225
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS
MCPegollo/Basic Statistics/SRSTHS