0% found this document useful (0 votes)
39 views

5 1 Representation of Data Hard

Uploaded by

Aaron Bvitira
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
39 views

5 1 Representation of Data Hard

Uploaded by

Aaron Bvitira
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 29

Probability and Statistics 1

5.1 Representation of Data - Hard

Subject: Mathematics
Syllabus Code: 9709
Level: AS Level
Component: Probability and Statistics 1
Topic: 5.1 Representation of Data
Difficulty: Hard
Questions
1. The numbers of chocolate bars sold per day in a cinema over a period of 100 days are summarised
in the following table. (9709/51/M/J/20 number 7)

No. of chocolate bars sold 1 − 10 11 − 15 16 − 30 31 − 50 51 − 60


No. of days 18 24 30 20 8

(a) Draw a histogram to represent this information.


(b) What is the greatest value of the interquartile range for this data?
(c) Calculate estimates of the mean and standard deviation of the number of chocolate bars sold.

2. The annual salaries, in thousands of dollars, for 11 employees at each of two companies A and B
are shown below. (9709/53/M/J/20 number 6)

Company A 30 32 35 41 41 42 47 49 52 53 64
Company B 26 47 30 52 41 38 35 42 49 31 42

(a) Represent the data by drawing a back-to-back stem-and-leaf diagram with company A on the
left-hand side of the diagram.
(b) Find the median and the interquartile range of the salaries of the employees in company A.
A new employee joins company B. The mean salary of the 12 employees is now $38 500.
(c) Find the salary of the new employee.

3. The times, t minutes, taken by 150 students to complete a particular challenge are summarised in
the following cumulative frequency table. (9709/51/O/N/20 number 6)

Time taken (t minutes) t ≤ 20 t ≤ 30 t ≤ 40 t ≤ 60 t ≤ 100


Cumulative Frequency 12 48 106 134 150

(a) Draw a cumulative frequency graph to illustrate the data.


(b) 24% of the students take k minutes or longer to complete the challenge. Use your graph to
estimate the value of k.
(c) Calculate estimates of the mean and standard deviation of the time taken to complete the
challenge.

4. A particular piece of music was played by 91 pianists and for each pianist, the number of incorrect
notes was recorded. The results are summarised in the table. (9709/53/O/N/20 number 7)

Number of incorrect notes 1 − 5 6 − 10 11 − 20 21 − 40 41 − 70


Frequency 10 5 26 32 18

(a) Draw a histogram to represent this information.


(b) State which class interval contains the lower quartile and which class contains the upper
quartile. Hence find the greatest possible value of the interquartile range.
(c) Calculate an estimate for the mean number of incorrect notes.

5. The distances, xm, travelled to school by 140 children was recorded. The results are summarised
in the table below. (9709/52/O/N/21 number 7)

1
Distance, x m x ≤ 200 x ≤ 300 x ≤ 500 x ≤ 900 x ≤ 1200 x ≤ 1600
Cumulative Frequency 16 46 88 122 134 140

(a) On the grid, draw a cumulative frequency graph to represent these results.
(b) Use your graph to estimate the interquartile range of these distances.
(c) Calculate estimates of the mean and standard deviation of the distances.

6. The time, t minutes, taken to complete a walking challenge by 250 members of a club are summarised
in the table. (9709/53/O/N/22 number 3)

Time taken (t minutes) t ≤ 20 t ≤ 30 t ≤ 35 t ≤ 40 t ≤ 50 t ≤ 60


Cumulative Frequency 32 66 112 178 228 250

(a) Draw a cumulative frequency graph to illustrate the data.


(b) Use your graph to estimate the 60th percentile of the data.
It is given that an estimate for the mean time taken to complete the challenge by these 250
members is 34.4 minutes.
(c) Calculate an estimate for the standard deviation of the times taken to complete the challenge
by these 250 members.

7. The times, to the nearest minute, of 150 athletes taking part in a charity run are recorded. The
results are summarised in the table. (9709/51/O/N/23 number 4)

Time in minutes 101 − 120 121 − 130 131 − 135 136 − 145 146 − 160
Frequency 18 48 34 32 18

(a) Draw a histogram to represent this information.


(b) Calculate estimates for the mean and standard deviation of the times taken by the athletes.

8. The heights, in cm, of the 11 players in each of two teams, the Aces and the Jets, are shown in the
following table. (9709/52/O/N/23 number 4)

Aces 180 174 169 182 181 166 173 182 168 171 164
Jets 175 174 188 168 166 174 181 181 170 188 190

(a) Draw a back-to-back stem-and-leaf diagram to represent this information with the Aces on the
left-hand side of the diagram.
(b) Find the median and interquartile range of the heights of the players in the Aces.
(c) Give one comment comparing the spread of the heights of the Aces with the spread of the
heights of the Jets.

9. The weights, x kg, of 120 students is a sports college are recorded. The results are summarised in
the following table. (9709/53/O/N/23 number 4)

Weight (x kg) x ≤ 40 x ≤ 60 x ≤ 65 x ≤ 70 x ≤ 85 x ≤ 100


Cumulative Frequency 0 14 38 60 106 120

(a) Draw a cumulative frequency graph to represent this information.


(b) It is found that 35% of the students weigh more than W kg. Use your graph to estimate the
value of W .

2
(c) Calculate estimates for the mean and standard deviation of the weights of the 120 students.

10. Helen measures the lengths of 150 fish of a certain species in a large pond. These lengths, correct
to the nearest centimetre, are summarised in the following table. (9709/52/F/M/20 number 7)

Length (cm) 0 − 9 10 − 14 15 − 19 20 − 30
Cumulative Frequency 15 48 66 21

(a) Draw a cumulative frequency graph to illustrate the data.


(b) 40% of these fish have a length of dcm or more. Use your graph to estimate the value of d.
The mean length of these 150 fish is 15.295cm.
(c) Calculate an estimate for the variance of the lengths of the fish.

11. The following table gives the weekly snowfall, in centimetres, for 11 weeks in 2018 at two ski resorts,
Dados and Linva. (9709/52/O/N/20 number 5)

Dados 6 8 12 15 10 36 42 28 10 22 16
Linva 2 11 15 16 0 32 36 40 10 12 9

(a) Represent the information on a back-to-back stem-and-leaf diagram.


(b) Find the median and the interquartile range for the weekly snowfall in Dados.
(c) The median, lower quartile and upper quartile of the weekly snowfall for Linva are 12, 9 and
32cm respectively. Use this information and your answers to part (b) to compare the central
tendency and the spread of the weekly snowfall in Dados and Linva.

12. The times taken, in minutes, by 360 employees at a large company to travel from home to work are
summarised in the following table. (9709/53/O/N/21 number 3)

Time, t minutes 0≤t<5 5 ≤ t < 10 10 ≤ t < 20 20 ≤ t < 30 30 ≤ t < 50


Frequency 23 102 135 76 24

(a) Draw a histogram to represent this information.


(b) Calculate an estimate of the mean time taken by an employee to travel to work.

3
Answers
1. The numbers of chocolate bars sold per day in a cinema over a period of 100 days are summarised
in the following table. (9709/51/M/J/20 number 7)

No. of chocolate bars sold 1 − 10 11 − 15 16 − 30 31 − 50 51 − 60


No. of days 18 24 30 20 8

(a) Draw a histogram to represent this information.

Notice how there are gaps between our classes. Let’s apply continuity correction to
get rid of the gaps,

No. of chocolate bars sold 0.5 − 10.5 10.5 − 15.5 15.5 − 30.5 30.5 − 50.5 50.5 − 60.5
No. of days 18 24 30 20 8

Note: To apply correction, add 0.5 to the upper bounds and subtract 0.5 from the
lower bounds.

Draw a table of class width and frequency density,


Class Width 10 5 15 20 10
Frequency Density 1.8 4.8 2.0 1.0 0.8

Label the y-axis with frequency density and the x-axis with number of chocolate bars
sold. Plot frequency density against the number of chocolate bars sold, ensuring
that you use the correct class width for each class,

4
Frequency density

0.5 10.5 15.5 30.5 50.5 60.5

Number of chocolate bars sold

(b) What is the greatest value of the interquartile range for this data?

The formula for interquartile range is,

IQR = q3 − q1

4
To find the greatest possible value of the interquartile range we need the largest
value of the upper quartile and the smallest value of the lower quartile. Let’s find
the largest value of the upper quartile,
3
q3 = n
4
3
q3 = × 100
4
q3 = 75

If you add up the frequencies, 75 lies between,

31 − 50

The largest value in that class is,


50

Now let’s find the smallest value of the lower quartile,


1
q1 = n
4
1
q1 = × 100
4
q1 = 25

If you add up the frequencies, 25 lies between,

11 − 15

The smallest value in that class is,


11

Substitute into the formula for interquartile range,

IQR = 50 − 11

IQR = 39

Therefore, the final answer is,


IQR = 39
(c) Calculate estimates of the mean and standard deviation of the number of chocolate bars sold.
No. of chocolate bars sold 1 − 10 11 − 15 16 − 30 31 − 50 51 − 60
No. of days 18 24 30 20 8

Let’s start by finding the midpoints of each class,


Mid Interval 5.5 13 23 40.5 55.5
Number of players 18 24 30 20 8

5
The formula for grouped mean is,
Σxf
x=
Σx

Let’s find Σxf ,

Σxf = 18 × 5.5 + 24 × 13 + 30 × 23 + 20 × 40.5 + 8 × 55.5

Σxf = 2355

Substitute into the formula for mean,


2355
x=
100
x = 23.55

The mean is,


x = 23.55

Now let’s find the standard deviation,


r
Σx2 f
σ= − x2
Σx

Let’s find Σx2 f ,

Σx2 f = 18 × 5.52 + 24 × 132 + 30 × 232 + 20 × 40.52 + 8 × 55.52

Σx2 f = 77917.5

Substitute into the formula for standard deviation,


r
77917.5
σ= − (23.55)2
100

Simplify,
σ = 15.0

Therefore, the final answer is,

x = 23.55 σ = 15.0

2. The annual salaries, in thousands of dollars, for 11 employees at each of two companies A and B
are shown below. (9709/53/M/J/20 number 6)

Company A 30 32 35 41 41 42 47 49 52 53 64
Company B 26 47 30 52 41 38 35 42 49 31 42

(a) Represent the data by drawing a back-to-back stem-and-leaf diagram with company A on the
left-hand side of the diagram.

6
We need to determine a scale for our stem. Our data ranges from 26000 to 64000
so our stem will run from 2 to 6 to represent 2000 to 6000. Now let’s draw our
stem-and-leaf diagram and don’t forget to add a key,
A B
2 6
5 2 0 3 0 1 5 8
9 7 2 1 1 4 1 2 2 7 9
3 2 5 2
4 6

Key: 1 | 4 | 2 means $41000 for A and $42000 for B


(b) Find the median and the interquartile range of the salaries of the employees in company A.

Let’s start by finding the median. The median position is,


n+1
q2 =
2
11 + 1
q2 =
2
q2 = 6

Identify the data point in position 6,

q2 = $42000

Now let’s find the interquartile range. The formula for interquartile range is,

IQR = q3 − q1

The position for the upper quartile is,


3
q3 = (n + 1)
4
3
q3 = (11 + 1)
4
q3 = 9

Find the data point in position 9,

q3 = 52000

Now let’s find the lower quartile,


1
q1 = (n + 1)
4
1
q1 = (11 + 1)
4
q1 = 3

7
Find the data point in position 3,

q1 = 35000

Let’s substitute into the formula for interquartile range,

IQR = 52000 − 35000

IQR = $17000

Therefore, the final answer is,

q2 = $42000 IQR = $17000

A new employee joins company B. The mean salary of the 12 employees is now $38 500.
(c) Find the salary of the new employee.

Let’s construct an equation using the formula for mean,


Σx + x12
x=
n+1

Note: Where x12 represents the salary of the new employee.

We are given that the mean is 12 and the original number of cyclists is 11,
Σx + x12
38500 =
11 + 1
Σx + x12
38500 =
12

Let’s find Σx,

Σx = 26000+47000+30000+52000+41000+38000+35000+42000+49000+31000+42000

Σx = 433000

Substitute into our equation,


433000 + x12
38500 =
12

Solve for x20 ,


462000 = 433000 + x12
x12 = 462000 − 433000
x12 = 29000

8
Therefore, the final answer is,
$29000

3. The times, t minutes, taken by 150 students to complete a particular challenge are summarised in
the following cumulative frequency table. (9709/51/O/N/20 number 6)

Time taken (t minutes) t ≤ 20 t ≤ 30 t ≤ 40 t ≤ 60 t ≤ 100


Cumulative Frequency 12 48 106 134 150

(a) Draw a cumulative frequency graph to illustrate the data.

Label the y-axis as cumulative frequency and the x-axis as times (t minutes). Plot
the upper bounds against the cumulative frequency. Join the plots to form an
s-shaped curve,
150

140

130

120

110

100
Cumulative Frequency

90

80

70

60

50

40

30

20

10

0
0 10 20 30 40 50 60 70 80 90 100
Time taken (t minutes)

(b) 24% of the students take k minutes or longer to complete the challenge. Use your graph to
estimate the value of k.

9
This means 76% of the students take less than k minutes. Let’s find 76% of 150,
76
× 150 = 114
100

Draw construction lines at a cumulative frequency of 114 and read off the time,

k = 44 minutes

Therefore, the final answer is,


k = 44 minutes
(c) Calculate estimates of the mean and standard deviation of the time taken to complete the
challenge.
Time taken (t minutes) t ≤ 20 t ≤ 30 t ≤ 40 t ≤ 60 t ≤ 100
Cumulative Frequency 12 48 106 134 150

Let’s start by finding the midpoints and frequencies of each class,


Midpoint 10 25 35 50 80
Frequency 12 36 58 28 16

The formula for grouped mean is,


Σxf
x=
Σx

Let’s find Σxf ,

Σxf = 10 × 12 + 25 × 36 + 35 × 58 + 50 × 28 + 80 × 16

Σxf = 5730

Substitute into the formula for mean,


5730
x=
150
x = 38.2

The mean is,


x = 38.2

Now let’s find the standard deviation,


r
Σx2 f
σ= − x2
Σx

Let’s find Σx2 f ,

Σx2 f = 102 × 12 + 252 × 36 + 352 × 58 + 502 × 28 + 802 × 16

Σx2 f = 267150

10
Substitute into the formula for standard deviation,
r
267150
σ= − (38.2)2
150

Simplify,
σ = 17.9

Therefore, the final answer is,

x = 38.2 σ = 17.9

4. A particular piece of music was played by 91 pianists and for each pianist, the number of incorrect
notes was recorded. The results are summarised in the table. (9709/53/O/N/20 number 7)

Number of incorrect notes 1 − 5 6 − 10 11 − 20 21 − 40 41 − 70


Frequency 10 5 26 32 18

(a) Draw a histogram to represent this information.

Notice how there are gaps between our classes. Let’s apply continuity correction to
get rid of the gaps,

Number of incorrect notes 0.5 − 5.5 5.5 − 10.5 10.5 − 20.5 20.5 − 40.5 40.5 − 70.5
Frequency 10 5 26 32 18

Note: To apply correction, add 0.5 to the upper bounds and subtract 0.5 from the
lower bounds.

Draw a table of class width and frequency density,


Class width 5 5 10 20. 30
Frequency density 2 1 2.6 1.6 0.6

Label the y-axis with frequency density and the x-axis with number of incorrect
notes. Plot frequency density against the number of incorrect notes, ensuring that
you use the correct class width for each class,

11
3

Frequency density
2

0.5 5.5 10.5 20.5 40.5 70.5

Number of incorrect notes

(b) State which class interval contains the lower quartile and which class contains the upper
quartile. Hence find the greatest possible value of the interquartile range.

Let’s start by finding the class that contains the upper quartile,
3
q3 = n
4
3
q3 = × 91
4
q3 = 68.25

If you add up the frequencies, 68.25 lies within,

21 − 40

The class that contains the upper quartile is,

21 − 40

Now let’s find the class that contains the lower quartile,
1
q1 = n
4
1
q1 = × 91
4
q1 = 22.75

If you add up the frequencies, 22.75 lies within,

11 − 20

The class that contains the lower quartile is,

11 − 20

12
The formula for the interquartile range is,

IQR = q3 − q1

To find the greatest possible value of the interquartile range we have to find the
largest value of the upper quartile and smallest value of the lower quartile. The
upper quartile lies within,
21 − 40

The largest value in that class is,


40

The lower quartile lies within,


11 − 20

The smallest value in that class is,


11

Substitute into the formula for interquartile range,

IQR = 40 − 11

IQR = 29

Therefore, the final answer is,


IQR = 29
(c) Calculate an estimate for the mean number of incorrect notes.

Number of incorrect notes 0.5 − 5.5 5.5 − 10.5 10.5 − 20.5 20.5 − 40.5 40.5 − 70.5
Frequency 10 5 26 32 18

Let’s start by finding the midpoints of each class,


Midpoint 3 8 15.5 30.5 55.5
Frequency 10 5 26 32 18

The formula for grouped mean is,


Σxf
x=
Σx

Let’s find Σxf ,

Σxf = 3 × 10 + 8 × 5 + 15.5 × 26 + 30.5 × 32 + 55.5 × 18

Σxf = 2448

13
Substitute into the formula for mean,
2448
x=
91
x = 26.9

Therefore, the final answer is,


x = 26.9

5. The distances, xm, travelled to school by 140 children was recorded. The results are summarised
in the table below. (9709/52/O/N/21 number 7)

Distance, x m x ≤ 200 x ≤ 300 x ≤ 500 x ≤ 900 x ≤ 1200 x ≤ 1600


Cumulative Frequency 16 46 88 122 134 140

(a) On the grid, draw a cumulative frequency graph to represent these results.

Label the y-axis as cumulative frequency and the x-axis as distance (x m). Plot the
upper bounds against the cumulative frequency. Join the plots to form an s-shaped
curve,
140

130

120

110

100

90
Cumulative Frequency

80

70

60

50

40

30

20

10

0
0 200 400 600 800 1,000 1,200 1,400 1,600
Distance (x m)

(b) Use your graph to estimate the interquartile range of these distances.

14
The formula for interquartile range is,

IQR = q3 − q1

Let’s start by finding the upper quartile,


3
q3 = n
4
3
q3 = × 140
4
q3 = 105

Draw construction lines at a cumulative frequency of 105 and read off the distance,

q3 = 685

Now let’s find the lower quartile,


1
q1 = n
4
1
q1 = × 140
4
q1 = 35

Draw construction lines at a cumulative frequency of 35 and read off the distance,

q1 = 260

Substitute into the formula for interquartile range,

IQR = 685 − 260

IQR = 425

Therefore, the final answer is,


IQR = 425
(c) Calculate estimates of the mean and standard deviation of the distances.
Distance, x m x ≤ 200 x ≤ 300 x ≤ 500 x ≤ 900 x ≤ 1200 x ≤ 1600
Cumulative Frequency 16 46 88 122 134 140

Let’s start by finding the midpoints and frequencies of each class,


Midpoint 100 250 400 700 1050 1400
Frequency 16 30 42 34 12 6

The formula for grouped mean is,


Σxf
x=
Σx

15
Let’s find Σxf ,

Σxf = 16 × 100 + 30 × 250 + 42 × 400 + 34 × 700 + 12 × 1050 + 6 × 1400

Σxf = 70700

Substitute into the formula for mean,


70700
x=
140
x = 505

The mean is,


x = 505

Now let’s find the standard deviation,


r
Σx2 f
σ= − x2
Σx

Let’s find Σx2 f ,

Σx2 f = 16 × 1002 + 30 × 2502 + 42 × 4002 + 34 × 7002 + 12 × 10502 + 6 × 14002

Σx2 f = 50405000

Substitute into the formula for standard deviation,


r
50405000
σ= − (505)2
140

Simplify,
σ = 324

Therefore, the final answer is,

x = 505 σ = 324

6. The time, t minutes, taken to complete a walking challenge by 250 members of a club are summarised
in the table. (9709/53/O/N/22 number 3)

Time taken (t minutes) t ≤ 20 t ≤ 30 t ≤ 35 t ≤ 40 t ≤ 50 t ≤ 60


Cumulative Frequency 32 66 112 178 228 250

(a) Draw a cumulative frequency graph to illustrate the data.

16
Label the y-axis as cumulative frequency and the x-axis as time (t minutes). Plot the
upper bounds against the cumulative frequency. Join the plots to form an s-shaped
curve,
250

225

200

175
Cumulative Frequency

150

125

100

75

50

25

0
0 10 20 30 40 50 60
Time taken (t minutes)

(b) Use your graph to estimate the 60th percentile of the data.

Let’s find 60% of 250,


60
× 250 = 150
100

Draw construction lines at a cumulative frequency of 150 and read off the time,

t = 38 minutes

Therefore, the final answer is,


t = 38 minutes
It is given that an estimate for the mean time taken to complete the challenge by these 250
members is 34.4 minutes.
(c) Calculate an estimate for the standard deviation of the times taken to complete the challenge
by these 250 members.
Midpoint 10 25 32.5 37.5 45 55
Frequency 32 34 46 66 50 22

17
The formula for standard deviation is,
s
Σx2 f
σ= − x2
Σf

We need to find Σx2 f ,

Σx2 f = 32 × 102 + 34 × 252 + 46 × 32.52 + 66 × 37.52 + 50 × 452 + 22 × 552

Σx2 f = 333650

Substitute into the formula for standard deviation,


r
333650
σ= − (34.4)2
250

Simplify,
σ = 12.3

Therefore, the final answer is,


σ = 12.3

7. The times, to the nearest minute, of 150 athletes taking part in a charity run are recorded. The
results are summarised in the table. (9709/51/O/N/23 number 4)

Time in minutes 101 − 120 121 − 130 131 − 135 136 − 145 146 − 160
Frequency 18 48 34 32 18

(a) Draw a histogram to represent this information.

Notice how there are gaps between our classes. Let’s apply continuity correction to
get rid of the gaps,

Time in minutes 100.5 − 120.5 120.5 − 130.5 130.5 − 135.5 135.5 − 145.5 145.5 − 160.5
Frequency 18 48 34 32 18

Note: To apply correction, add 0.5 to the upper bounds and subtract 0.5 from the
lower bounds.

Draw a table of class width and frequency density,


Class width 20 10 5 10 15
Frequency density 0.9 4.8 6.8 3.2 1.2

18
Label the y-axis with frequency density and the x-axis with time (minutes). Plot
frequency density against the time, ensuring that you use the correct class width for
each class,

Frequency density 5

100.5 120.5 130.5 135.5 145.5 160.5

Time (minutes)

(b) Calculate estimates for the mean and standard deviation of the times taken by the athletes.

Time in minutes 100.5 − 120.5 120.5 − 130.5 130.5 − 135.5 135.5 − 145.5 145.5 − 160.5
Frequency 18 48 34 32 18

Let’s start by finding the midpoints of each class,


Midpoint 110.5 125.5 133 140.5 153
Frequency 18 48 34 32 18

The formula for grouped mean is,


Σxf
x=
Σx

Let’s find Σxf ,

Σxf = 18 × 110.5 + 48 × 125.5 + 34 × 133 + 32 × 140.5 + 18 × 153

Σxf = 19785

Substitute into the formula for mean,


19785
x=
150
x = 131.9

The mean is,


x = 131.9

19
Now let’s find the standard deviation,
r
Σx2 f
σ= − x2
Σx

Let’s find Σx2 f ,


Σx2 f = 18 × 110.52 + 48 × 125.52 + 34 × 1332 + 32 × 140.52 + 18 × 1532
Σx2 f = 2630272.5

Substitute into the formula for standard deviation,


r
2630272.5
σ= − (131.9)2
150

Simplify,
σ = 11.7

Therefore, the final answer is,


x = 131.9 σ = 11.7

8. The heights, in cm, of the 11 players in each of two teams, the Aces and the Jets, are shown in the
following table. (9709/52/O/N/23 number 4)

Aces 180 174 169 182 181 166 173 182 168 171 164
Jets 175 174 188 168 166 174 181 181 170 188 190

(a) Draw a back-to-back stem-and-leaf diagram to represent this information with the Aces on the
left-hand side of the diagram.

We need to determine a scale for our stem. Our data ranges from 164 to 190 so our
stem will run from 16 to 19 to represent 160 to 190. Now let’s draw the stem-and-leaf
diagram, don’t forget to add the key,
Aces Jets
9 8 6 4 16 6 8
4 3 1 17 0 4 4 5
2 2 1 0 18 1 1 8 8
19 0 6

Key: 1 | 17 | 0 means 171 cm for Aces and 170 cm for Jets


(b) Find the median and interquartile range of the heights of the players in the Aces.

Let’s start by finding the median. The median position is,


n+1
q2 =
2
11 + 1
q2 =
2
q2 = 6

20
Identify the data point in position 6,

q2 = 173 cm

Now let’s find the interquartile range. The formula for interquartile range is,

IQR = q3 − q1

The position for the upper quartile is,


3
q3 = (n + 1)
4
3
q3 = (11 + 1)
4
q3 = 9

Find the data point in position 9,


q3 = 181

Now let’s find the lower quartile,


1
q1 = (n + 1)
4
1
q1 = (11 + 1)
4
q1 = 3

Find the data point in position 3,


q1 = 168

Let’s substitute into the formula for interquartile range,

IQR = 181 − 168

IQR = 13 cm

Therefore, the final answer is,

q2 = 173 cm IQR = 13 cm

(c) Give one comment comparing the spread of the heights of the Aces with the spread of the
heights of the Jets.

Our answer would be,

21
If you look at the stem and leaf diagram above, you will notice that the Jets have
a wider range (166 to 190), than the

The Jets have a wider range of heights, (190 − 160) = 24, than the Aces, (182 − 164) = 18.

9. The weights, x kg, of 120 students is a sports college are recorded. The results are summarised in
the following table. (9709/53/O/N/23 number 4)

Weight (x kg) x ≤ 40 x ≤ 60 x ≤ 65 x ≤ 70 x ≤ 85 x ≤ 100


Cumulative Frequency 0 14 38 60 106 120

(a) Draw a cumulative frequency graph to represent this information.

Label the y-axis as cumulative frequency and the x-axis as weights (x kg). Plot the
upper bounds against the cumulative frequency. Join the plots to form an s-shaped
curve,
120

110

100

90

80
Cumulative Frequency

70

60

50

40

30

20

10

0
40 50 60 70 80 90 100
Weight (x kg)

(b) It is found that 35% of the students weigh more than W kg. Use your graph to estimate the
value of W .

This means that 65% of the students weight less than W kg. Let’s find 65% of 120,
65
× 120 = 78
100

22
Draw construction lines at a cumulative frequency of 78 and read off the weight,

W = 76 kg

Therefore, the final answer is,


W = 76 kg
(c) Calculate estimates for the mean and standard deviation of the weights of the 120 students.
Weight (x kg) x ≤ 40 x ≤ 60 x ≤ 65 x ≤ 70 x ≤ 85 x ≤ 100
Cumulative Frequency 0 14 38 60 106 120

Let’s start by finding the midpoints and frequencies of each class,


Midpoint 20 50 62.5 67.5 77.5 92.5
Frequency 0 14 24 22 46 14

The formula for grouped mean is,


Σxf
x=
Σx

Let’s find Σxf ,

Σxf = 0 × 20 + 14 × 50 + 24 × 62.5 + 22 × 67.5 + 46 × 77.5 + 14 × 92.5

Σxf = 8545

Substitute into the formula for mean,


8545
x=
120
x = 71.2

The mean is,


x = 71.2

Now let’s find the standard deviation,


r
Σx2 f
σ= − x2
Σx

Let’s find Σx2 f ,

Σx2 f = 0 × 202 + 14 × 502 + 24 × 62.52 + 22 × 67.52 + 46 × 77.52 + 14 × 92.52

Σx2 f = 625062.5

Substitute into the formula for standard deviation,


s  2
625062.5 8545
σ= −
120 120

23
Simplify,
σ = 11.8

Therefore, the final answer is,


x = 71.2 σ = 11.8
10. Helen measures the lengths of 150 fish of a certain species in a large pond. These lengths, correct
to the nearest centimetre, are summarised in the following table. (9709/52/F/M/20 number 7)
Length (cm) 0 − 9 10 − 14 15 − 19 20 − 30
Frequency 15 48 66 21
(a) Draw a cumulative frequency graph to illustrate the data.

Notice how there are gaps between our classes. Let’s apply continuity correction to
get rid of the gaps, and let’s also find the cumulative frequency,
Length (cm) 0 − 9.5 9.5 − 14.5 14.5 − 19.5 19.5 − 30.5
Cumulative Frequency 15 63 129 150

Label the y-axis as cumulative frequency and the x-axis as Length (cm). Plot the
cumulative frequency against the upper bounds of the length. Join the plots to form
an s-shaped curve,
150
140
130
120
110
100
Cumulative Frequency

90
80
70
60
50
40
30
20
10
0
0 5 10 15 20 25 30
Length (cm)

(b) 40% of these fish have a length of dcm or more. Use your graph to estimate the value of d.

24
This means that 60% of these fish have a length less than d. Let’s find 60% of 150,
60
× 150 = 90
100

Draw construction lines at a cumulative frequency of 90 and read off the length,

d = 16.5 cm

Therefore, the final answer is,


d = 16.5 cm
The mean length of these 150 fish is 15.295cm.
(c) Calculate an estimate for the variance of the lengths of the fish.
Length (cm) 0 − 9 10 − 14 15 − 19 20 − 30
Frequency 15 48 66 21

Let’s start by drawing a table with the midpoints of the classes against their fre-
quencies,
Mid Interval 4.75 12 17 25
Frequency 15 48 66 21

The formula for variance is,


Σx2 f
σ2 = − x2
Σf

We need to find Σx2 f ,

Σx2 f = 4.752 × 15 + 122 × 48 + 172 × 66 + 252 × 21

Σx2 f = 39449.4375

Substitute into the formula for standard deviation,


39449.4375
σ2 = − (15.295)2
150

Simplify,
σ 2 = 29.1

Therefore, the final answer is,


σ 2 = 29.1

11. The following table gives the weekly snowfall, in centimetres, for 11 weeks in 2018 at two ski resorts,
Dados and Linva. (9709/52/O/N/20 number 5)

Dados 6 8 12 15 10 36 42 28 10 22 16
Linva 2 11 15 16 0 32 36 40 10 12 9

(a) Represent the information on a back-to-back stem-and-leaf diagram.

25
We need to determine a scale for our stem. Our data ranges from 0 to 42 so our
stem will run from 0 to 4 to represent 0 to 40. Now let’s draw the stem-and-leaf
diagram, don’t forget to add the key,
Dados Linva
8 6 0 0 2 9
6 5 2 0 0 1 0 1 2 5 6
8 2 2
6 3 2 6
2 4 0

Key: 6 | 3 | 2 means 36 cm for Dados and 32 cm for Linva


(b) Find the median and the interquartile range for the weekly snowfall in Dados.

Let’s start by finding the median. The median position is,


n+1
q2 =
2
11 + 1
q2 =
2
q2 = 6

Identify the data point in position 6,

q2 = 15 cm

Now let’s find the interquartile range. The formula for interquartile range is,

IQR = q3 − q1

The position for the upper quartile is,


3
q3 = (n + 1)
4
3
q3 = (11 + 1)
4
q3 = 9

Find the data point in position 9,


q3 = 28

Now let’s find the lower quartile,


1
q1 = (n + 1)
4
1
q1 = (11 + 1)
4
q1 = 3

26
Find the data point in position 3,
q1 = 10

Let’s substitute into the formula for interquartile range,

IQR = 28 − 10

IQR = 18 cm

Therefore, the final answer is,

q2 = 15 cm IQR = 18 cm

(c) The median, lower quartile and upper quartile of the weekly snowfall for Linva are 12, 9 and
32cm respectively. Use this information and your answers to part (b) to compare the central
tendency and the spread of the weekly snowfall in Dados and Linva.

Let’s start by comparing the central tendency,


The median for Dados, 15cm, is higher than the median for Linva, 12. This means that the
snowfall is generally higher in Dados than in Linva.

To compare the spread, we need the interquartile range. Let’s find the interquartile
range for Linva,
IQR = q3 − q1
IQR = 32 − 9
IQR = 23

Now let’s compare the spread,


The interquartile range for Dados, 18cm, is smaller than the interquartile range for Linva,
23cm. This means that the snowfall for Dados is less spread than that of Linva.

12. The times taken, in minutes, by 360 employees at a large company to travel from home to work are
summarised in the following table. (9709/53/O/N/21 number 3)

Time, t minutes 0≤t<5 5 ≤ t < 10 10 ≤ t < 20 20 ≤ t < 30 30 ≤ t < 50


Frequency 23 102 135 76 24

(a) Draw a histogram to represent this information.

Draw a table of class width and frequency density,


Class width 5 5 10 10 20
Frequency density 4.6 20.4 13.5 7.6 1.2

27
Label the y-axis with frequency density and the x-axis with time taken (minutes).
Plot the frequency density against the classes, ensuring each class has its respective
class width,

22

20

18

Frequency density
16

14

12

10

5 10 20 30 40 50

Time taken (minutes)

(b) Calculate an estimate of the mean time taken by an employee to travel to work.
Time, t minutes 0 ≤ t < 5 5 ≤ t < 10 10 ≤ t < 20 20 ≤ t < 30 30 ≤ t < 50
Frequency 23 102 135 76 24

Let’s start by finding the midpoints of each class,


Midpoint 2.5 7.5 15 25 40
Frequency 23 102 135 76 24

The formula for grouped mean is,


Σxf
x=
Σx

Let’s find Σxf ,

Σxf = 2.5 × 23 + 7.5 × 102 + 15 × 135 + 25 × 76 + 40 × 24

Σxf = 5707.5

Substitute into the formula for mean,


5707.5
x=
360
x = 15.9

Therefore, the final answer is,


x = 15.9

28

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy