0% found this document useful (0 votes)

11 views

MATH2016 - 2021 - S2 Notes Week 1

This document provides definitions and examples of key concepts in statistics including populations, samples, variables, descriptive and inferential statistics, frequency distributions, histograms, cumulative frequencies, measures of central tendency, and variance. Formulas for calculating variances and means for both populations and samples are also presented.

Uploaded by

Forza Bee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

MATH2016 - 2021 - S2 Notes Week 1

Uploaded by

Forza Bee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 11

MATH2026

Week 1 & 2

Statistics is a collection of methods for collecting, analyzing, presenting and interpreting data and for
making decisions.

Definition A population consists of all elements whose characteristics are of interest.

Definition A sample is a portion of the population selected for study.

Definition An element or member of a sample is a specific subject or object about which data is
collected.

Example If we are interested in the set of all cars in a city, this set will be the population. To obtain
information about this population, we may select some cars from the population and study them. This
subset of cars would be the sample.

Definition Descriptive statistics consists of methods for organizing, displaying and describing data by
using tables, graphs and summary measures.

Definition Inferential statistics consists of methods that use sample results to help make decisions or
predictions about a population.

Example in the previous example, we may obtain the ages of cars in the sample to get an idea of the
ages of cars in the population.

Definition A variable is a characteristics under study that assumes different values for different
elements.

Example Age of a car in the first example.

Definition The value of a variable for an element is called an observation or measurement.

Definiton A data set is a collection of observations on one or more variables.

Definition A quantitative variable is one that can be measured numerically.

Example The weight of a bolt.

Defintion A qualitative variable cannot assume a numerical value, but can be classified in two or more
nonnumeric categories.

Example The colour of an object.

1
Definition A variable whose values are countable is called a discrete variable.

Example The number of students in a class.

Definition A variable that can assume any numerical value over a certain interval or intervals is called a
continuous variable.

Example The mass of a tyre.

Frequency Distribution

Example In a test, students obtained the following marks out of a maximum of 10: 5, 6, 9, 3, 8, 8, 9, 10,
6, 8, 5, 8, 9, 3, 3, 3, 3, 8, 8, 3

Frequency distribution

Mark f

3 6
4 0
5 2
6 2
7 0
8 6
9 3
10 1

Note The frequency of 3 is 6, the frequency of 4 is 0 and so on.

Note For a small number of discrete values, a frequency distribution like the above one is suitable.

Frequency distribution for grouped data

Histogram A histogram is a graph with the classes on the horizontal axis and the frequencies (or relative
frequencies or percentages) on the vertical axis.

frequency of that class

Definition The relative frequency of a class =
sum of all frequencies

Percentage = (relative frequency)  100%

Example Students wrote a test, for which the maximum possible mark was 25. The marks obtained by
the students were 5,6,6,8,11, 11, 11, 12, 12, 13, 14, 14, 14, 14, 15, 16, 16, 16, 16, 16, 16, 17, 18, 18, 18,
18, 19, 19, 22, 22, 22, 23, 24

2
Classes Class Boundaries Frequency Relative Frequency
5–9 4.5 to less than 9.5 4
10 – 14 9.5 to less than 14.5 10
15 – 19 14.5 to less than 19.5 14
20 - 24 19.5 to less than 24.5 5

The histogram corresponding to the above table is drawn below

5
4

4.5 9.5 14.5 19.5 24.5

Definiition A frequency polygon is a graph formed by joining midpoints of the tops of successive bars in
a histogram with straight lines.

Using the above histogram, we can insert the frequency polygon as shown below

3
14

5
4

4.5 9.5 14.5 19.5 24.5

2 7 12 17 22 27

Cumulative Frequency

A cumulative frequency distribution gives the total number of values that fall below the upper

boundary of each class.

Example From previous example, we obtain the following cumulative frequency distribution.

Class Class Boundaries Cumulative Frequency

5–9 4.5 to less than 9.5 4

10 – 14 4.5 to less than 14.5 14

15 – 19 4.5 to less than 19.5 28

20 – 24 4.5 to less than 24.5 33

4
Definition An ogive is a graph drawn by joining with straight lines the dots marked above the upper
boundaries of classes at heights equal to the cumulative frequencies of respective class.

Example An ogive corresponding to the example above is drawn below.

4.5 9.5 14.5 19.5 24.5

Stem and Leaf Display

Example Students got the following marks in a test: 75, 52, 80, 96, 71, 53, 78, 81, 75, 59, 57, 52

The stem and leaf display for this data is

5
5 2 3 9 7 2
6
7 5 1 8 5
8 0 1
9 6

Bar Graphs
Example 30 employees of a company were asked how stressful their job was and the following
frequency distribution drawn up to illustrate their responses:

Stress on Job Frequency

Very stressful 10
Somewhat stressful 14
not stressful 6

Bar chart

very somewhat none

“very” means very stressful

“somewhat” means somewhat stressful
“none” means not stressful

The bars are of the same width and with equal spacing.

Definition The mean of a list of numbers is the arithmetic average.

6
Definition The mode of a list of numbers is the one that occurs most often. There may be more than
one mode if more than one value occurs the maximum number of time.

Definition The median of an odd number of values is the one in the middle when the numbers are
written in ascending order. The median of an even number of values is the average of the two in the
middle when the numbers are written in ascending order.

Example 3, 4, 9, 9, 9, 10

99
median = 9
2

Variance of Population Given a population  x1 , x 2 ,..., x N  , the population variance is

 x i  
2

2  i 1

N

where represents the mean of the population.

Standard deviation for population =  = square root of variance for population

Sample Variance Given a sample  x1 , x 2 ,..., x n  from some population, the sample variance is

 x i  x
2

s2  i 1

n 1
where x is the sample mean.
standard deviation for sample = s = square root of variance for sample
http://www.uvm.edu/~dhowell/SeeingStatisticsApplets/N-1.html

Shortcut Formulae for population variance and sample variance for ungrouped data
  x 2

x 2

N
 
2

N
 x 2

x 2

n
s 
2

n 1
Example Consider the sample 82, 95, 67, 92.
x = 84

7
x x- x
82 82-84=-2
95 95-84=11
67 67-84=-17
92 92-84=8

(2) 2  (11) 2  ( 17) 2  (8) 2

Variance = s 2   159.33
4 1
The standard deviation for a population is the square root of the population variance, and the standard
deviation for a sample is the square root of the sample variance.

Mean for Grouped Data

mf mf
Mean for population data:  
N Mean for sample data: x 
 n
where m is the midpoint and f is the frequency of the class.

Example
The following table gives the daily commuting times in minutes from home to work for all 25 employees
of a company.

Daily commuting time Number of employees

0 to less than 10 4
10 to less than 20 9
20 to less than 30 6
30 to less than 40 4
40 to less than 50 2

Daily commuting time f m mf

0 to less than 10 4 5 20
10 to less than 20 9 15 135
20 to less than 30 6 25 150
30 to less than 40 4 35 140
40 to less than 50 2 45 90
N = 25  mf  535


 mf 
535
N 25

Variance and Standard Deviation for Grouped Data

8
 f m  
2

2 
N

 f m  x
2

s2 
n 1

Shortcut Formulae

  mf  2

m 2
f 
N
  2

  mf  2

m 2
f 
n
s 
2

n 1

Daily commuting time f m mf m2 f

0 to less than 10 4 5 20 100

10 to less than 20 9 15 135 2025
20 to less than 30 6 25 150 3750
30 to less than 40 4 35 140 4900
40 to less than 50 2 45 90 4050

N = 25  mf  535  m 2 f = 14,825

  mf  2
(535) 2
m 2
f 
N 14,825 
2   25  3376  135.04
N 25 25
standard deviation =   135,04  11 .62

Quartiles

Quartiles are three summary measures that divide a ranked data set into four equal parts. The second
quartile is the same as the median of a data set. The first quartile is the value of the middle term among
9
the observations that are less than the median, and the third quartile is the value of the middle term
among the observations that are greater than the median.
First quartile = Q1

Second quartile = Q2

First quartile = Q3

Interquartile range = Q3  Q1
Example Consider the values 2, 4, 5, 6, 8, 10, 14

Second quartile = 6
First quartile = 4
Third quartile = 10

Example Consider the values 2, 4, 5, 6, 8, 10, 14, 15

Second quartile = (6+8)/2 = 7

First quartile = (4+5)/2 = 4.5
Third quartile = (10+14)/2 = 12

Example Consider the values 2, 4, 5, 6, 8, 10, 14, 15, 17

Second quartile = 8
First quartile = (4+5)/2 = 4.5
Third quartile = (14+15)/2 = 14.5

Box and Whisker Plot

This is a plot that shows the centre, spread and skewness of a data set. It is constructed by drawing a
box and two whiskers that use the median, the first quartile, the third quartile and the smallest and the
largest values in the data set between the lower and upper inner fences.
Example Consider the sample {35, 29, 44, 72, 34, 64, 41, 50, 54, 104, 39, 58}
We are going to draw a box plot for this sample.

First quartile = Q1 = 37

Second quartile = Q2 = 47

Third quartile = Q3 = 61

10
Interquartile range = IQR = Q3  Q1  24
Upper inner fence = Q3 + 1.5(IQR) = 61+36
Lower inner fence = Q1 - 1.5(IQR)= 37-36
Smallest value within the two inner fences = 29
Largest value within the two inner fences = 72

Upper outer fence = Q3 + 3.0(IQR)=133

Lower outer fence = Q1 - 3.0(IQR)= -35

A mild outlier is outside either of the two inner fences but within either of the two outer fences.
A extreme outlier is outside either of the two outer fences.

104 is a mild outlier in this example. It is represented by the asterisk.

smallest value within largest value within

the two inner fences median the two inner fences

first quartile third quartile an outlier

25 35 45 55 65 75 85 105

IB Geography SL Fieldwork - An Investigation of The Effects of Longshore Drift at Two Different Beaches in Aldeburgh and Thorpeness
No ratings yet
IB Geography SL Fieldwork - An Investigation of The Effects of Longshore Drift at Two Different Beaches in Aldeburgh and Thorpeness
58 pages
Monica Joyce Naperi - Adaptive-Teaching-Guide-Template
No ratings yet
Monica Joyce Naperi - Adaptive-Teaching-Guide-Template
9 pages
Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)
'SST 111 Introduction To Probability and Statistics Lecture Notes
No ratings yet
'SST 111 Introduction To Probability and Statistics Lecture Notes
58 pages
Unit - 2 (QM)
No ratings yet
Unit - 2 (QM)
6 pages
Learning Competencies:: at The End of The Chapter, The Learner Should Be Able To
No ratings yet
Learning Competencies:: at The End of The Chapter, The Learner Should Be Able To
29 pages
Ch.IV_
No ratings yet
Ch.IV_
15 pages
Assessment Learning 2. M4
No ratings yet
Assessment Learning 2. M4
10 pages
Basic Statics
No ratings yet
Basic Statics
218 pages
Math 1F Module 4 Frequency Distribution
No ratings yet
Math 1F Module 4 Frequency Distribution
7 pages
Lecture 3 EDA 2022
No ratings yet
Lecture 3 EDA 2022
16 pages
STUDY94@817302
No ratings yet
STUDY94@817302
18 pages
Math7 Q4 Week3-4 Abellana Roldan
100% (1)
Math7 Q4 Week3-4 Abellana Roldan
12 pages
Statistics Formula (Grouped Data)
No ratings yet
Statistics Formula (Grouped Data)
18 pages
Frequency
100% (1)
Frequency
36 pages
Screenshot 2024-10-16 at 8.23.19 PM
No ratings yet
Screenshot 2024-10-16 at 8.23.19 PM
68 pages
EDA Lecture Notes
No ratings yet
EDA Lecture Notes
113 pages
4. Frequency distribution
No ratings yet
4. Frequency distribution
5 pages
STATISTICS
No ratings yet
STATISTICS
10 pages
Stat Unit 1
No ratings yet
Stat Unit 1
125 pages
Statistics
No ratings yet
Statistics
14 pages
Bulacan State University: Assessment of Student Learning 1 Utilization of Assessment Data
No ratings yet
Bulacan State University: Assessment of Student Learning 1 Utilization of Assessment Data
9 pages
Intro To Statistics
No ratings yet
Intro To Statistics
38 pages
Aptitude Full PDF EM
No ratings yet
Aptitude Full PDF EM
111 pages
Maths Project On Statistics
100% (1)
Maths Project On Statistics
7 pages
GE 4 Module 10
No ratings yet
GE 4 Module 10
16 pages
Statistics Book
No ratings yet
Statistics Book
271 pages
Module 3 Data Presentation
No ratings yet
Module 3 Data Presentation
9 pages
MAT114, 217 Lecture Note.
No ratings yet
MAT114, 217 Lecture Note.
12 pages
Descriptive Statistics: Definition 10.2.1
No ratings yet
Descriptive Statistics: Definition 10.2.1
16 pages
Statistics: By: Nidhi Achari, Harsha Devadiga, Vasudha Shinde, Nidhi Dubey, Ananya Bhadane
No ratings yet
Statistics: By: Nidhi Achari, Harsha Devadiga, Vasudha Shinde, Nidhi Dubey, Ananya Bhadane
10 pages
Statistics in Education - Made Simple
100% (1)
Statistics in Education - Made Simple
26 pages
1 Intro To Stat & Data Presentation
No ratings yet
1 Intro To Stat & Data Presentation
21 pages
Yr10 Chapter 22U Statistics 2023
No ratings yet
Yr10 Chapter 22U Statistics 2023
12 pages
Chapter 2 - Organizing Data
No ratings yet
Chapter 2 - Organizing Data
33 pages
Data Management and Presentation
No ratings yet
Data Management and Presentation
28 pages
STA112 Week 2 Class Note
No ratings yet
STA112 Week 2 Class Note
102 pages
DATA HANDLING - 1st - Chapter PDF
No ratings yet
DATA HANDLING - 1st - Chapter PDF
28 pages
Graphs-and-Tables-for-BBA-Class-note
No ratings yet
Graphs-and-Tables-for-BBA-Class-note
4 pages
Chapter 1 QM (PC)
No ratings yet
Chapter 1 QM (PC)
17 pages
Statistics
No ratings yet
Statistics
25 pages
Chapter 1 Data Presentation
No ratings yet
Chapter 1 Data Presentation
15 pages
Descriptive Statistics: Tabular and Graphical Methods: Summarizing Qualitative Data Summarizing Quantitative Data
No ratings yet
Descriptive Statistics: Tabular and Graphical Methods: Summarizing Qualitative Data Summarizing Quantitative Data
32 pages
Statatics Chapter 1
No ratings yet
Statatics Chapter 1
21 pages
Frequency Distribution and Data: Types, Tables, and Graphs: What Is Descriptive Statistics?
No ratings yet
Frequency Distribution and Data: Types, Tables, and Graphs: What Is Descriptive Statistics?
19 pages
unit-2 -notes
No ratings yet
unit-2 -notes
80 pages
Adv Stat
No ratings yet
Adv Stat
8 pages
Lecture (1) - Statistics
No ratings yet
Lecture (1) - Statistics
31 pages
Unit 7 Lecture Note
No ratings yet
Unit 7 Lecture Note
25 pages
000 Methods of Presentation of Data - Textual and FDT
No ratings yet
000 Methods of Presentation of Data - Textual and FDT
63 pages
PSY 320 L3 Data Presentation Methods 1
No ratings yet
PSY 320 L3 Data Presentation Methods 1
9 pages
UDSM Statistics and Probability For Non-Majors
No ratings yet
UDSM Statistics and Probability For Non-Majors
148 pages
Basic Statistics
No ratings yet
Basic Statistics
23 pages
Chapter 2 Describing Data Using Tables and Graphs
No ratings yet
Chapter 2 Describing Data Using Tables and Graphs
16 pages
Elementary Statistics: Davis Lazarus Assistant Professor ISIM, The IIS University
No ratings yet
Elementary Statistics: Davis Lazarus Assistant Professor ISIM, The IIS University
73 pages
Math 7-Q4-Module-3
50% (4)
Math 7-Q4-Module-3
16 pages
Frequency Distribution
No ratings yet
Frequency Distribution
4 pages
Frequency Distribution and Data
No ratings yet
Frequency Distribution and Data
5 pages
Applied Statistics - MBA
No ratings yet
Applied Statistics - MBA
62 pages
Statistic Review
No ratings yet
Statistic Review
35 pages
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Integrated Design II (Shivaan Maharaj - 74825)
No ratings yet
Integrated Design II (Shivaan Maharaj - 74825)
1 page
Sender (S) Message (M)
No ratings yet
Sender (S) Message (M)
1 page
University of Trinidad and Tobago: Worksheet 1
No ratings yet
University of Trinidad and Tobago: Worksheet 1
2 pages
University of Trinidad and Tobago: Worksheet 2
No ratings yet
University of Trinidad and Tobago: Worksheet 2
2 pages
Activity Worksheet MV
No ratings yet
Activity Worksheet MV
5 pages
CompFin 2020 SS QF Sheet 03
No ratings yet
CompFin 2020 SS QF Sheet 03
2 pages
Reliability and Validity of the Gujarati Menstrual Distress Questionnaire in Indian Girls with Primary Dysmenorrhea
No ratings yet
Reliability and Validity of the Gujarati Menstrual Distress Questionnaire in Indian Girls with Primary Dysmenorrhea
6 pages
Effects of Trade Openness On Regional Economic Growth
No ratings yet
Effects of Trade Openness On Regional Economic Growth
5 pages
Statistics For Managers Using Microsoft Excel: (3 Edition)
No ratings yet
Statistics For Managers Using Microsoft Excel: (3 Edition)
51 pages
Sections 9 Probabitliy Online Quewston Random
No ratings yet
Sections 9 Probabitliy Online Quewston Random
3 pages
MATERI WEBINAR DATA ANALYTICS by KMTI UMS
No ratings yet
MATERI WEBINAR DATA ANALYTICS by KMTI UMS
82 pages
Pfeifer 2014 Dynare Graphs
No ratings yet
Pfeifer 2014 Dynare Graphs
51 pages
Anova Assignment
No ratings yet
Anova Assignment
3 pages
Spatial Econometrics With R 2020
No ratings yet
Spatial Econometrics With R 2020
141 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
43 pages
Factor Analysis: KMO and Bartlett's Test
No ratings yet
Factor Analysis: KMO and Bartlett's Test
12 pages
S2 SamplingDistribExamQuestions
No ratings yet
S2 SamplingDistribExamQuestions
4 pages
Stat 497 - LN4
No ratings yet
Stat 497 - LN4
67 pages
SHS Stat Prob Q4 W2
No ratings yet
SHS Stat Prob Q4 W2
10 pages
ANOVA (Analysis of Variance)
No ratings yet
ANOVA (Analysis of Variance)
5 pages
Brm file 3
No ratings yet
Brm file 3
112 pages
Predictive Analytics: QM901.1x Prof U Dinesh Kumar, IIMB
No ratings yet
Predictive Analytics: QM901.1x Prof U Dinesh Kumar, IIMB
9 pages
Mahalanobis Distance
No ratings yet
Mahalanobis Distance
6 pages
Landis 1977 Kappa
No ratings yet
Landis 1977 Kappa
17 pages
STAT 200 Week 7 Homework Problems
No ratings yet
STAT 200 Week 7 Homework Problems
9 pages
BCADA1221 Exploratory Data Analysis Using Excel - UG - 1st Sem-Dec-2023
No ratings yet
BCADA1221 Exploratory Data Analysis Using Excel - UG - 1st Sem-Dec-2023
1 page
CASE - Gulf Real Estate Properties: Subject: Quantitative Method - I
No ratings yet
CASE - Gulf Real Estate Properties: Subject: Quantitative Method - I
12 pages
Essentials of Statistics for Business and Economics 8th Edition Anderson Solutions Manual - PDF Version Is Available For Instant Access
100% (4)
Essentials of Statistics for Business and Economics 8th Edition Anderson Solutions Manual - PDF Version Is Available For Instant Access
52 pages
(eBook PDF) Elementary Statistics 4th Edition instant download
100% (1)
(eBook PDF) Elementary Statistics 4th Edition instant download
52 pages
Curse of Dimensionality
No ratings yet
Curse of Dimensionality
51 pages
Losing Control (Group) The Machine Learning Control Method For Counterfactual Forecasting
No ratings yet
Losing Control (Group) The Machine Learning Control Method For Counterfactual Forecasting
44 pages
A Predictive Analytics Primer
No ratings yet
A Predictive Analytics Primer
16 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

MATH2016 - 2021 - S2 Notes Week 1

Uploaded by

MATH2016 - 2021 - S2 Notes Week 1

Uploaded by

MATH2026

Definition A population consists of all elements whose characteristics are of interest.

Definition A sample is a portion of the population selected for study.

Example Age of a car in the first example.

Definition The value of a variable for an element is called an observation or measurement.

Definiton A data set is a collection of observations on one or more variables.

Definition A quantitative variable is one that can be measured numerically.

Example The weight of a bolt.

Example The colour of an object.

Example The number of students in a class.

Example The mass of a tyre.

Note The frequency of 3 is 6, the frequency of 4 is 0 and so on.

Frequency distribution for grouped data

frequency of that class

Percentage = (relative frequency)  100%

The histogram corresponding to the above table is drawn below

4.5 9.5 14.5 19.5 24.5

4.5 9.5 14.5 19.5 24.5

boundary of each class.

Class Class Boundaries Cumulative Frequency

5–9 4.5 to less than 9.5 4

10 – 14 4.5 to less than 14.5 14

15 – 19 4.5 to less than 19.5 28

20 – 24 4.5 to less than 24.5 33

Example An ogive corresponding to the example above is drawn below.

4.5 9.5 14.5 19.5 24.5

Stem and Leaf Display

The stem and leaf display for this data is

Stress on Job Frequency

very somewhat none

“very” means very stressful

Definition The mean of a list of numbers is the arithmetic average.

Variance of Population Given a population  x1 , x 2 ,..., x N  , the population variance is

Standard deviation for population =  = square root of variance for population

(2) 2  (11) 2  ( 17) 2  (8) 2

Mean for Grouped Data

Daily commuting time Number of employees

Daily commuting time f m mf

Variance and Standard Deviation for Grouped Data

Daily commuting time f m mf m2 f

0 to less than 10 4 5 20 100

Example Consider the values 2, 4, 5, 6, 8, 10, 14, 15

Second quartile = (6+8)/2 = 7

Example Consider the values 2, 4, 5, 6, 8, 10, 14, 15, 17

Box and Whisker Plot

Upper outer fence = Q3 + 3.0(IQR)=133

104 is a mild outlier in this example. It is represented by the asterisk.

smallest value within largest value within

first quartile third quartile an outlier

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.