0% found this document useful (0 votes)

13 views

CH 8 Data Analysis

Chapter Eight discusses data analysis, outlining the importance of understanding data types, which include categorical and numerical data, before applying statistical techniques. It explains various coding methods for data consistency and describes different types of data analysis, including univariate, bivariate, and multivariate analysis, along with specific statistical methods such as regression and correlation. The chapter emphasizes the significance of these analyses in deriving insights and making informed decisions based on the data.

Uploaded by

Abdiman Habibo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

CH 8 Data Analysis

Uploaded by

Abdiman Habibo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

CHAPTER EIGHT

DATA ANALYSIS

Data for Analysis ?

 Data Analysis is the process of systematically applying statistical and/or

logical techniques to describe and illustrate, condense and recap, and
evaluate data.

Before analyzing the data for your research, it is important to know the type of data you have
at hand as the technique you use is determined by the data.
The following figure provides you clear information of the type of data to be used for
research.
1 01-09-2024
2 01-09-2024
8.1.1. Quantitative data can be divided into
two distinct groups:

A. Categorical and
B. Numerical
A. Categorical data

 These are data that can‘t be measured numerically as

quantities.
 Categorical data can be further sub-divided into
3 01-09-2024
1. Nominal- whose values can‘t be measured numerically
or can‘t be ranked. Rather these data simply count the
number of occurrences in each category of a variable.
Examples of nominal variables:
Where a person lives (AA, Adama, B/Dar, etc.)
Gender (male, female)
Nationality (American, Ethiopian, Chinese)
Ethnicity (Oromo, Amhara, Tgire, Gurage…)
4 01-09-2024
2. Ranked/Ordinal data - whose values can be ranked in orders
 Examples of ordinal data

 Education (Elementary school, High school, College Diploma, College

degree, Masters)
 Agreement (strongly disagree, disagree, neutral, agree, strongly agree)

 Rating (poor, fair, good, excellent)

 Frequency (never, often, sometimes; always,, )

 Any other scale (―On a scale of 1 to 5...‖)

5 01-09-2024
 Descriptive data with only two categories are known as

dichotomous data.
 E.g. gender can be divide into female and male.

 Or questions with a ‗yes‘ or ‗No‘ response

6 01-09-2024
Cont…
B. Numerical Data

 Which are sometimes termed ‗quantifiable‘, are those

whose values are measured or counted numerically as

quantities.
 Numerical data can be analysed using a far wider
range of statistics than categorical data.

7 01-09-2024
Coding the Data
 Coding – Process of translating information gathered from
questionnaires or other sources into something that can be
analyzed
 Involves assigning a value to the information given—often value is
given a label.
 Coding can make data more consistent

 Example: Question = Sex

 Answers = Male, Female, M, or F

 Coding will avoid such inconsistencies

11 01-09-2024
Coding Systems
 Common coding systems (code and label) for dichotomous variables:

0=No 1=Yes
(1 = value assigned,Yes= label of value)
OR: 1=No 2=Yes

 When you assign a value, you must also make it clear what that value

means
 In first example above, 1=Yes but in second example 1=No

 As long as it is clear how the data are coded, either is fine

12 01-09-2024
Coding- Ordinal Variables
 Coding process is similar with other categorical variables

 Example: variable EDUCATION, possible coding:

0 = Did not graduate from high school

1 = High school graduate
2 = Some college or post-high school education
3 = College graduate

 Could be coded in reverse order (0=college graduate, 3=did

not graduate high school)

13 01-09-2024
Coding: Nominal Variables
For coding nominal variables, order makes no difference
 Example: variable RESIDENCE

1 = Northeast
2 = South
3 = Northwest
4 = Midwest
5 = Southwest
 Order does not matter, no ordered value associated with each
response
14 01-09-2024
Coding: Continuous Variables
Creating categories from a continuous variable (ex. age) is
common
 May break down a continuous variable into chosen categories by
creating an ordinal categorical variable
 Example: variable = AGE
1 = 0–9 years old
2 = 10–19 years old
3 = 20–39 years old
4 = 40–59 years old
5 = 60 years or older

15 01-09-2024
8.2. Types of Data Analysis
 Is the process of inspecting, cleaning, transforming, and modelling data

with the goal of discovering useful information suggesting conclusions, and

supporting decision making.
 Data analysis can be made using:
(i) Descriptive Statistics
(ii) Inferential Statistics
 Descriptive statistics are used to describe, summarize, or
explain a given set of data.

 inferential statistics is used to infer certain characteristics of

samples to population.
22 01-09-2024
8.2.1. Univariate Analysis
 Is the analysis carried out with the description of single

variable in terms of the applicable unit of analysis.

 Measure of central tendencies and measure of dispersion are

the typical categories of univariate analysis.

24 01-09-2024
A. Measures of Central Tendency

 The three most frequently used measures of central

tendency are
• Mode
• Median and
• Mean

25 01-09-2024
1. Mode
 Mode can be defined as the most frequently occurring value in a
group of observations.
 If the scores for a given sample distributions are:
32, 32, 35, 36, 37, 38, 38, 39, 39, 39, 40, 40, 42, 45
 Then the mode would be 39 because a score of 39 occurs three

times, more than any other score.

 Mode is very good measure for ascertaining the location of

distribution in the case of nominal data.

26 01-09-2024
2. Median
 Median is defined as the middle value in an ordered arrangement

of observations.

 The median is often used to summarize the location of a distribution.

 Further, the median can be used with ordinal, interval, or ratio

measurements.

 If the scores for a given sample distributions are:

32, 32, 35, 36, 37, 38, 38, 39, 39, 39, 40, 40, 42, 45
The median will be 38 + 39 = 38.5
2
27 01-09-2024
3. Mean
 The arithmetic mean is the most commonly used and accepted

measure of central tendency.

 This should be used in the case of interval or ratio data.

If the scores for a given sample distributions are:
32, 32, 35, 36, 37, 38, 38, 39, 39, 39, 40, 40, 42, 45
The mean of the distribution will be:
32+32+35+36+37+38+38+39+39+39+40+40+42+45/14= 38
Mid-mean, geometric mean, mid-range are other types of means. (P.139 of
QRM)

28 01-09-2024
Bivariate Analysis/Relationships between Variables

 Help researchers to know the nature, direction, and significance

of the relationships between two variables in the study.

 Often in practical situations, researchers are interested in

describing associations between variables.

 They try to ascertain how two variables are related with each

other, that is, whether a change in one affects the other.

 The measures of association depend on the nature of the data

and could be positive, negative or neutral.

30 01-09-2024
8.2.1.1. Relation between two nominal variables -X2 Test

This analysis technique is used to know if there is relationship between

two nominal variables.
 E.g. Is viewing television advertisement of a product (yes/No)
related to buying that particular product ( buy/Not buy).
 An international business researcher wants to establish if the

performance ( categorized as loss, breakeven and profit) of a

firm is dependent on which country ( categorized as low, middle
and high income) it is located.

32 01-09-2024
There are three different types of chi-square analysis
1. Chi-square test for goodness of fit

2. Chi-square test for homogeneity

3. Chi-square test of independence

 The first one used to see if the sample has been drawn from
the population and the second if the population are
homogenous with respect to a given characteristics.
 The two are not common and we will focus on the third
type of test
33 01-09-2024
8.2.1. 2. Correlations Analysis
 Correlation is a measure of relationship between two variable. It has wide
application in business and statistics.

 The correlation coefficient describes the direction of the correlation, that is,

whether it is
• Positive or

• Negative,

 And the strength of the correlation, that is, whether an existing correlation is:

• Strong or
• Weak.

35 01-09-2024
8.2.1.3. Bi-variate regression analysis
 Regression is one of the most frequently used techniques in business and

social researches.

 Regression analysis is used to predict the value of one variable (the

dependent variable) on the basis of other variables (the independent

variable).

 The most common form of regression, however, is linear regression,

where the dependent variable is related to the independent variable in a

linear way.

39 01-09-2024
 The linear regression equation takes the
following form

Variables:
X = Independent Variable (we provide this)
Y = Dependent Variable (we observe this)
Parameters:
β0 = Y-Intercept
β1 = Slope
ε = error term
Note: β1 = Indicates the change in the dependent variable for
every unit change in the independent variable

40 01-09-2024
Regression coefficient

Is the measure of how strongly the predictor (IDV)

predicts the DV

There are two types of regression coefficients

1. Unstandardized coefficients
2. Standardized coefficients (Beta Values)

42 01-09-2024
 The unstandardized coefficient can be used in the equation as

coefficients of different independent variables along with the

constant term to predict the value of the dependent variable.
o Difference in “Y” per Unit change in “X”

 The standardized coefficient (Beta) is measured in

standard deviation, i.e. the difference in “Y” in standard

deviation per standard deviation difference in “X”

43 01-09-2024
R values
 R represents the correlation between the observed values and the

predicted values (based on the regression equation obtained) of the

dependent variable.

 Is used to measure the fitness of the model used for the

research.

45 01-09-2024
 R square is the square of R and gives the proportion of variance in the

dependent variable accounted for by the set of independent variables

chosen for the model.

 R-square value tend to be influenced when the number of independent

variables is more or when the number of cases if large.

 Therefore the adjusted R square that takes in to account these things and

provides more accurate information about the fitness of the model.

 While it is not uncommon to get R square value of as high as 0.99 in

natural science, a much lower value (0.10 – 0.20 ) of R2 /R-square

is acceptable in social science research.

46 01-09-2024
2. Multicollinearity

 Is a situation when two or more IVs are highly

correlated to each other.

 If variables are so highly correlated with each other, it is

difficult to come up with reliable estimates of their

individual regression coefficients.

 In other words, when two variables are highly correlated,

they both convey essentially the same information.

49 01-09-2024
How to know the presence of Multicollinearity?
1. If the Variance Inflation Factor ( VIF) > 5 or it mean the Tolerance is < 0.2 as
tolerance is the inverse of VIF

2. If any two IDV have Variance proportion in excess of 0.9 (Column value)
corresponding to any raw in which the condition index is in excess of 30.

 If there is serious multicollinearity problem, try other solutions such as:

 Removing highly correlated predictors

 Linearly combining predictors, such as adding them together

 Running entirely different analyses, such as principal components analysis ( to
know similarities and differences)

50 01-09-2024
8.2. 2. Multivariate Analysis
 In many real life situations, it becomes necessary to analyse

relationship among three or more variables led to the

popularity of multivariate statistics.

 Multivariate statistics techniques look at the pattern of

relationships between several variables simultaneously.

 The following section deals with categories of multivariate

analysis techniques.

51 01-09-2024
8.2. 2. Multivariate Analysis …
8.2.2.1. Multiple linear Regression
 In simple regression, there is one dependent variable and one

independent variable, whereas in

 multiple regression, there is one dependent variable and many

independent variables.

 It examines the relationship between a single metric dependent

variable and two or more metric independent variables

52 01-09-2024
 .

 Assumptions of normality and linearity should be checked before using multiple

regression.

Where: y is a dependent variable and x1, x2, … xk are independent variables and a is
the Y intercept , b1, b2 … bk are the regression coefficient.

Note: All the conditions and tests above are common in case of
multivariate analysis too.
.
53 01-09-2024
End

Thanks

Questions

57 01-09-2024

Multiple Choice Questions on Effective Teaching and Learning Practices
100% (4)
Multiple Choice Questions on Effective Teaching and Learning Practices
8 pages
2019 2020 Practical Research 2
92% (13)
2019 2020 Practical Research 2
87 pages
Intro To Course and Basic Statistics
No ratings yet
Intro To Course and Basic Statistics
31 pages
Chapter 8
No ratings yet
Chapter 8
36 pages
Topic 1 Introduction To Statistics
No ratings yet
Topic 1 Introduction To Statistics
35 pages
Data Analysis Plan Handout
No ratings yet
Data Analysis Plan Handout
15 pages
Data Analysis Procedure
0% (1)
Data Analysis Procedure
27 pages
Qunt Data Coding & Analysis
No ratings yet
Qunt Data Coding & Analysis
104 pages
BRM Chapter 6
No ratings yet
BRM Chapter 6
8 pages
Introduction To Data Analtsis
No ratings yet
Introduction To Data Analtsis
33 pages
Quantitative Research Methods - Data Processing and Analysis
No ratings yet
Quantitative Research Methods - Data Processing and Analysis
25 pages
Lecture 8 Data Analysis
No ratings yet
Lecture 8 Data Analysis
30 pages
RM-EBBA-class-8-CH0-11-Quatitative-analysis
No ratings yet
RM-EBBA-class-8-CH0-11-Quatitative-analysis
37 pages
Notes On Data Processing, Analysis, Presentation
No ratings yet
Notes On Data Processing, Analysis, Presentation
63 pages
Data Preparation and Analysis 3
No ratings yet
Data Preparation and Analysis 3
182 pages
Chapter 14 - Analyzing Quantitative Data
No ratings yet
Chapter 14 - Analyzing Quantitative Data
8 pages
Fundamentals of Data Science and Analytics On Descriptive Analysis
No ratings yet
Fundamentals of Data Science and Analytics On Descriptive Analysis
53 pages
CAMAD- Data Analysis
No ratings yet
CAMAD- Data Analysis
21 pages
Not1
No ratings yet
Not1
8 pages
Introduction To Statistics
100% (3)
Introduction To Statistics
43 pages
Data Analysis Chapter 7
No ratings yet
Data Analysis Chapter 7
20 pages
QM 1
No ratings yet
QM 1
58 pages
Quantitative Methods 3
No ratings yet
Quantitative Methods 3
174 pages
CH01 - Introduction To Statistics 2
No ratings yet
CH01 - Introduction To Statistics 2
52 pages
የመወያያ ረዕስ አመራረጥ
No ratings yet
የመወያያ ረዕስ አመራረጥ
39 pages
Data analysis julie and field activties
No ratings yet
Data analysis julie and field activties
33 pages
Lecture 1 - Introduction To Statistics
No ratings yet
Lecture 1 - Introduction To Statistics
3 pages
Introduction to Statistics..Final
No ratings yet
Introduction to Statistics..Final
221 pages
Lecture Notes: (Introduction To Medical Laboratory Science Research)
No ratings yet
Lecture Notes: (Introduction To Medical Laboratory Science Research)
13 pages
Business Analytics (MIS171) Summary Notes
No ratings yet
Business Analytics (MIS171) Summary Notes
6 pages
ST1009 - Week 1
No ratings yet
ST1009 - Week 1
26 pages
Research Methodology: Result and Analysis (Part 1)
No ratings yet
Research Methodology: Result and Analysis (Part 1)
65 pages
MR Unit-V
No ratings yet
MR Unit-V
13 pages
Data Science (Unit 02) Notes
No ratings yet
Data Science (Unit 02) Notes
7 pages
Levels of Data
100% (1)
Levels of Data
26 pages
Topic 8 Data Processing and Analysis PDF
No ratings yet
Topic 8 Data Processing and Analysis PDF
157 pages
Week One: Introduction To Quantitative Methods MBA 2013
No ratings yet
Week One: Introduction To Quantitative Methods MBA 2013
49 pages
Data Management
No ratings yet
Data Management
48 pages
Introduction To Data Viz Lecture 2
No ratings yet
Introduction To Data Viz Lecture 2
44 pages
Lesson 2 Notes
No ratings yet
Lesson 2 Notes
11 pages
Course Introduction Inferential Statistics Prof. Sandy A. Lerio
No ratings yet
Course Introduction Inferential Statistics Prof. Sandy A. Lerio
46 pages
What Are Your Results?: Jeffrey Barnes
No ratings yet
What Are Your Results?: Jeffrey Barnes
17 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
44 pages
Intro To Statistics
No ratings yet
Intro To Statistics
35 pages
Data Analysis, Interpretation and Presentation
No ratings yet
Data Analysis, Interpretation and Presentation
21 pages
Lecture 1-Statistics Introduction-Defining, Displaying and Summarizing Data
No ratings yet
Lecture 1-Statistics Introduction-Defining, Displaying and Summarizing Data
53 pages
Ch-5
No ratings yet
Ch-5
26 pages
Chapter 7
No ratings yet
Chapter 7
39 pages
IS5740 W02
No ratings yet
IS5740 W02
37 pages
Analysing quantitative data -DPPM-2020 (1)
No ratings yet
Analysing quantitative data -DPPM-2020 (1)
34 pages
Univariate Bivariate & Multivariate Analysis of Data
No ratings yet
Univariate Bivariate & Multivariate Analysis of Data
24 pages
Introduction To STATISTICS-new
No ratings yet
Introduction To STATISTICS-new
44 pages
ANL303 - Week - 3 - Jan 2023
No ratings yet
ANL303 - Week - 3 - Jan 2023
69 pages
Data Analysis and report Writing BRM
No ratings yet
Data Analysis and report Writing BRM
49 pages
C207 Study Guide
No ratings yet
C207 Study Guide
27 pages
Statistics For Data Science
No ratings yet
Statistics For Data Science
30 pages
Data Analysis
No ratings yet
Data Analysis
30 pages
Topic 3 Data Processing_bus 221(0)
No ratings yet
Topic 3 Data Processing_bus 221(0)
130 pages
Design and Statistics
No ratings yet
Design and Statistics
16 pages
RM-Cha 9
No ratings yet
RM-Cha 9
83 pages
Unit - 8 Data Analysis
No ratings yet
Unit - 8 Data Analysis
6 pages
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
2838913
No ratings yet
2838913
19 pages
Managing Project Risk: Responding To Risk Throughout The Life of A
No ratings yet
Managing Project Risk: Responding To Risk Throughout The Life of A
31 pages
Role of Project Managers in the Stakehol
No ratings yet
Role of Project Managers in the Stakehol
6 pages
790198
No ratings yet
790198
11 pages
Chapter 6-PM & QC-Project Quality Improvement
No ratings yet
Chapter 6-PM & QC-Project Quality Improvement
33 pages
Insights Into Educational Attainment Maya City Grade 12 National Examination Outcomes
No ratings yet
Insights Into Educational Attainment Maya City Grade 12 National Examination Outcomes
18 pages
Manual For Master Researchpproposal - Thesis
100% (3)
Manual For Master Researchpproposal - Thesis
54 pages
chapter 1
No ratings yet
chapter 1
21 pages
4. Module 1 Part 2 - Day 2 F2F Training PPT for SL and SS
No ratings yet
4. Module 1 Part 2 - Day 2 F2F Training PPT for SL and SS
29 pages
ECT Skill
No ratings yet
ECT Skill
192 pages
Chap-4 Research Design ed
No ratings yet
Chap-4 Research Design ed
25 pages
Chapter-7Data Collection
No ratings yet
Chapter-7Data Collection
24 pages
Public Enterprineurship 1
No ratings yet
Public Enterprineurship 1
99 pages
HRD (2)
No ratings yet
HRD (2)
22 pages
Answers of Reflection Question From 21-2
No ratings yet
Answers of Reflection Question From 21-2
1 page
Stakeholder Management and Performance of County Government Funded Projects in Nyeri County, Kenya
No ratings yet
Stakeholder Management and Performance of County Government Funded Projects in Nyeri County, Kenya
68 pages
3 Module 1 Part 1 Day 2 F2F Training PPT for SL and SS Autosaved
No ratings yet
3 Module 1 Part 1 Day 2 F2F Training PPT for SL and SS Autosaved
29 pages
5. Module 1 Part 1- Day 3 F2F Training PPT for SL and SS
No ratings yet
5. Module 1 Part 1- Day 3 F2F Training PPT for SL and SS
29 pages
M&E (1)
No ratings yet
M&E (1)
16 pages
2-Multivarient Regression Using Python
No ratings yet
2-Multivarient Regression Using Python
7 pages
Case+Study+Summary+Session+May22
No ratings yet
Case+Study+Summary+Session+May22
18 pages
Pearson R XXXX
No ratings yet
Pearson R XXXX
9 pages
A Comparative Study of The Different Classification Algorithms On Football Analytics
No ratings yet
A Comparative Study of The Different Classification Algorithms On Football Analytics
16 pages
All You Can Learn: License
No ratings yet
All You Can Learn: License
8 pages
Introduction To The Course: Quality Control and Reliability
No ratings yet
Introduction To The Course: Quality Control and Reliability
10 pages
BIA B350F Assignment 1 Regression Analysis Sample
No ratings yet
BIA B350F Assignment 1 Regression Analysis Sample
19 pages
AGE 301 - Research and Statistical Methods-1
No ratings yet
AGE 301 - Research and Statistical Methods-1
51 pages
MRM Assignment Written Analysis Format
No ratings yet
MRM Assignment Written Analysis Format
2 pages
IBM - SAPHANA RealTimeAnalytics WhitePaper PDF
No ratings yet
IBM - SAPHANA RealTimeAnalytics WhitePaper PDF
17 pages
Sources of Validity Evidence PDF
No ratings yet
Sources of Validity Evidence PDF
14 pages
UCDownload Temp
No ratings yet
UCDownload Temp
11 pages
Simulation
No ratings yet
Simulation
66 pages
Devi & Sharma (2013)
No ratings yet
Devi & Sharma (2013)
8 pages
CH 13
No ratings yet
CH 13
38 pages
Spss 1. Uji Normalitas Data: One-Sample Kolmogorov-Smirnov Test
No ratings yet
Spss 1. Uji Normalitas Data: One-Sample Kolmogorov-Smirnov Test
4 pages
Summer Internship Report: by Jamila Rezayee
0% (1)
Summer Internship Report: by Jamila Rezayee
30 pages
CPRJ 2
No ratings yet
CPRJ 2
253 pages
Examining The Dimensions of Rural Economic Development in South Sudan
No ratings yet
Examining The Dimensions of Rural Economic Development in South Sudan
7 pages
Choice of Hotel Facilities by Guests With Physical Disabilities in Nairobi, Kenya
No ratings yet
Choice of Hotel Facilities by Guests With Physical Disabilities in Nairobi, Kenya
17 pages
Anova (Anavar) Analysis of Variance (Analyse de La Variance
No ratings yet
Anova (Anavar) Analysis of Variance (Analyse de La Variance
17 pages
PML Ex4
No ratings yet
PML Ex4
8 pages
unit-1-fundamentals-of-healthcare-analyticsregulation-2021
No ratings yet
unit-1-fundamentals-of-healthcare-analyticsregulation-2021
30 pages
Take-Home Test - ANSWERS
No ratings yet
Take-Home Test - ANSWERS
4 pages
Students' Academic Performance, Aptitude and Occupational Interest in The National Career Assessment Examination PDF
No ratings yet
Students' Academic Performance, Aptitude and Occupational Interest in The National Career Assessment Examination PDF
21 pages
Re Ections On Architectural Design Education: The Return of Rationalism in The Studio
No ratings yet
Re Ections On Architectural Design Education: The Return of Rationalism in The Studio
7 pages
ML Unit 2 CSE
No ratings yet
ML Unit 2 CSE
160 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.