Statistics
Statistics
Chapter summary
• Discrete data can take only par ticular values, and each value is distinct from the others.
• Continuous data is not restricted to cer tain xed values, but it can occupy any value within a
continuous range.
• Data is reliable if you can repeat the data collection process and obtain similar results.
• The population consists of every member in the group that you want to nd out about.
• A sample is a subset of the population that will give you information about the population as a whole.
• Dierent sampling methods include convenience, simple random, systematic, stratied or quota
sampling
• A bar char t is used to display discrete data, whereas a histogram is used to display continuous data.
Measures of dispersion
• The range is the dierence between the largest and smallest values.
• Quar tiles divide a sample of data into four groups containing equal numbers of obser vations.
• Q is the median of the lower 50% of the data, and Q is the median of the upper 50% of the data.
1 3
• The interquar tile range is the value of the third quar tile minus the value of the rst quar tile.
• Dierent methods for nding quar tiles exist and therefore the values obtained using technology
Range
Middle 50%
Whisker Whisker
Min X Max X
Q m Q
1 3
(Median)
• Ex treme values are called outliers and might be considered for removal from a sample. An outlier
3 1
• Adding up the frequencies of the data values as we go along is called calculating the cumulative
frequency (CF).
80
x f CF
70
0 0
0 < x ≤ 10
ycneuqerF
60
19 19
10 < x ≤ 20
50
30 49
20 < x ≤ 30
evitalumuC
40
20 69
30 < x ≤ 40
30
6 75
40 < x ≤ 50 20
10
Q Q
1 3
5 10 15 20 25 30 35 40 45 50
310
/
6.4
6
• Standard deviation is a measure of how spread out the items of a data set are from the mean.
x x
2
• Adding a constant c to every value in a data set increases the mean and the median by c, and has
• Multiplying every value in a data set by a constant d means that the mean and standard deviation
graph.
dna scitsitatS
ytilibaborp
5 What would happen to the mean, median and standard deviation if the teacher decided to multiply all of
Chapter review
review exercise
a father owns.
month was 420. From October to
per month.
Ties 0 1 2 3 4 5 6
f 4 8 10 20 4 3 1
experiment.
311
/
6 REPRE SENTING D ATA : S TAT I S T I C S FOR U N I V A R I AT E D ATA
a Find the mean number of minutes the 8 The test results for a group of children
students spent doing the experiment. in a school district are shown on this
800
15 minutes.
700
setadidnac
600
500
400
fo
300
rebmuN
6 A data set has a mean of 48 and a standard
200
deviation of 5.
100
10 20 30 40 50 60 70 80 90 100
Marks
deviation.
and variance.
marks.
that a class of 8 year-olds could jump.
d 60 74 c
40
range is 24 cm.
1 8
2 8
3
8
6 6 6
4 4 4
2 2 2
0 0 0
E X AM HINT
deviation and b nd the interquartile
for working.
312
/
6
11 Mr M a t e o ’s class were wearing rings for a How many bats were weighed?
table.
Rings 2 3 4 5 6 7
The data is also presented in this table.
f 3 4 10 3 2 2
30 < w ≤ 60 45
60 < w ≤ 90 b
assessment about the number of friends
Friends (p) f
0 < p ≤ 5 15
10 < p ≤ 15 9
15 < p ≤ 20 12
x f Cumulative frequency
20 < p ≤ 25 6
10 3 3
15 11 14
median and standard deviation.
20 16 n
b Explain why you are only able to nd
estimates in part a 25 42
dna scitsitatS
m
ytilibaborp
30 8 50
13 This table shows the number of watches
Watches 0 1 2 3 4 5
f 11 7 6 k 8 10
Mobile devices 1 2 3 4 5 6
a small cave produced this cumulative
frequency graph. f 41 60 52 32 15 8
80
a State whether the data is discrete or
continuous.
70
ycneuqerf
50
evitalumuC
40
30
20
10
20 40 60 80 100 120
Weights (g)
313
/
6 REPRE SENTING D ATA : S TAT I S T I C S FOR U N I V A R I AT E D ATA
Exam-style questions
a set of statistical data. (1 mark)
continuous data.
nd the
a A student’s grade on an IB SL
i mode
Mathematics exam. (1 mark)
ii median
b The volume of water that a person
(2 marks)
21 P2: Sue has collected continuous data on the heights of owers and has represented it in the
table below.
Frequency 40 45 50 60 5
d Sue’s calculator states that the median is 25. Find a better estimate than this by
considering the median’s position within the interval that it belongs to. Give your
Frequency 2 5 8 40 50 20 15
1 grade. (3 marks)
a State whether this data is discrete or f Identify with justication any outliers.
deviation. (4 marks)
are shown in the table below.
314
/
6
x Frequency Cumulative
numbers. (4 marks)
frequency
0 10 10
1 7
2 11 28
(3 marks)
3 41
4 15 56
has a unique mode. (2 marks)
5 71
6 12
7 10 93
9 2 99
10 1 100
24 P1: Grouped, continuous data for the mass, w kg, of a group of adults is given in the table
below.
Mass
40 < w ≤ 50 50 < w ≤ 60 60 < w ≤ 70 70 < w ≤ 80 80 < w ≤ 90 90 < w ≤ 100 100 < w ≤ 110 110 < w ≤ 120
Frequency 5 15 25 30 50 35 25 15
dna scitsitatS
c On graph paper draw a cumulative frequency curve, with 1 cm representing
ytilibaborp
10 kg on the x-axis and 1 cm representing 10 adults on the y-axis. (5 marks)
d Hence, estimate values for the i median ii lower quartile iii upper quartile.
Draw lines on your graph to indicate how you obtained these values. (4 marks)
25 P2: Sally and Rob each teach an IB SL 26 P2: Discrete data on an IB exam taken by
Grade 1 2 3 4 5 6 7
1, 1, 4, 7, 8, 8, 10, 10
Frequency 4 8 16 20 16 8 4
a For the data from Sally’s class nd the represent 1 grade on the x-axis and
that her class did better than Rob’s c Explain your answers to parts
class. (1 mark)
315