Part 1 Descriptive
Part 1 Descriptive
1
List of symbols used
2
RAW DATA
ID GENDER COURSE QUIZ SCORE (10%)
1 M BBB 8
2 M BBB 7
3 M BBC 8
4 F BBC 6
5 F BBC 7
6 M BBE 5
7 M BBE 5
8 F BBC 7
9 F BBB 8
10 F BBC 7
11 F BBE 7
12 M BBB 7
3
Outline
4
Overview
Descriptive Statistics
• Describes the important characteristics of a set of
data.
• Organize, present, and summarize data:
1. Graphically
2. Numerically
5
Important Characteristics of
Quantitative Data
“Shape, Center, and Spread”
• Center: A representative or average value that
indicates where the middle of the data set is located.
Symmetric
• Data is symmetric if the left half of its histogram is
roughly a mirror image of its right half.
Skewed
• Data is skewed if it is not symmetric and if it
extends more to one side than the other.
Uniform
• Data is uniform if it is equally distributed (on a
histogram, all the bars are the same height or
approximately the same height).
The Shape of Distributions
Symmetric Uniform
Outliers
• Unusual data values as compared to the rest of the set.
They may be distinguished by gaps in a histogram.
Section 2.1
Frequency Distributions
and Their Graphs
10
Frequency Distributions
Frequency Distribution
• A table that organizes data values into classes or
intervals along with number of values that fall in
each class (frequency, f ).
1. Ungrouped Frequency Distribution – for data
sets with few different values. Each value is in
its own class.
Ungrouped Grouped
6 5 4 5 5 3 5
6 2 3 5 5
4 9
5 5 7 4 3
4 5 4 5 6 5 18
5 1 6 2 6 6 12
6 6 6 6 4
7 3
4 5 4 5 3
5 5 7 6 5
Graphs of Frequency Distributions:
Frequency Histograms
Frequency Histogram
• A bar graph that represents the frequency distribution.
• The horizontal scale is quantitative and measures the
data values.
• The vertical scale measures the frequencies of the
classes.
• Consecutive bars must touch.
15
Frequency, f
2 2
3 5 10
4 9 5
5 18
0
6 12 1 2 3 4 5 6 7
Number of Peas
7 3
Relative Frequency Distributions and
Relative Frequency Histograms
Relative Frequency Distribution
• Shows the portion or percentage of the data that falls
in a particular class.
class frequency f
• relative frequency
Sample size n
19
Labeling Grouped Frequency
Distributions
• Class midpoints: the value halfway between LCL
and UCL:
(Lower class limit) (Upper class limit)
2
22
4 RULES: Constructing a Frequency
Distribution
23
4 RULES: Example
24
4 RULES: Example
1. We note that the maximum is 94 and the minimum is
53. Thus we have a range of 94 - 53 + 1 = 42 potential
scores.
2. We divide this range into about 10 class intervals (rule
1), each of which is a "simple" number (rule 2). This
leads to 9 intervals with a width of 5 each.
3. We make the bottom interval start at 50 since this is
the first number below the minimum that is a multiple
of the interval width (rule 3).
4. The bottom inteval would be 50-54, the next one 55-
59, etc. This makes all intervals the same width (rule
4). 25
4 RULES: Example
26
Other Graphs
Dot plot
• Consists of a graph in which each data value is
plotted as a point along a scale of values
Figure 2-5
Time Series
(Paired data)
Time Series
• Data set is composed of quantitative entries taken at
regular intervals over a period of time.
e.g., The amount of precipitation measured each
day for one month.
• Use a time series chart to graph.
Quantitative
data time
Larson/Farber 4th ed. 32
Time-Series Graph
Figure 2-8
Ex. www.eia.doe.gov/oil_gas/petroleum/
Graphing Qualitative Data Sets
Pie Chart
• A circle is divided into sectors
that represent categories.
Pareto Chart
• A vertical bar graph in which the
Frequency
height of each bar represents
frequency or relative frequency.
Categories
Figure 2-6
Contoh:
k = 1 + 3.3 log N
k = 1 + 3.3 log N
= 1 + 3.3 log 60
= 1.3.3 (1.7782)
= 1 + 5.8681
= 6.8681
7.0
.
37
Setiap kelas mengandungi 6 unit
7
Talli untuk kepada
adalah seperti berikut:
20
15
f 10
0
1 2 3 4 5 6 7
Kelas M en giku t Berat
THANK YOU.