0% found this document useful (0 votes)

8 views29 pages

Data Collection

Data collection

Uploaded by

yasmin tarek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views29 pages

Data Collection

Data collection

Uploaded by

yasmin tarek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Data Collection

Agenda

• key factors that influence data collection

• data source reliability
• data relevance
• sample size
• Case studies on how data collection strategies impact the outcome of business
decisions.
• Explore the concepts of bias in data
• types of biases
• Hands-on exercises to identify biases in sample datasets
key factors that
influence data collection
Engaging the audience
Purpose of Research: Clearly defining the aim of your research is essential to
guide the data collection process.
Data Type: Decide whether you need quantitative data (numerical) or qualitative
data (descriptive) based on your research questions.
Collection Method: Choose the most suitable method for gathering data, such as
surveys, interviews, or observations.
Ethical Considerations: Address ethical issues like privacy, consent, and data
protection to maintain the integrity of your research.
Data quality
reliability
1. Reliability
Reliability refers to the consistency and dependability of data. Reliable data
yields the same results under consistent conditions. It is about the
repeatability and stability of the measurement process.
•Consistency: If the data collection process is repeated under the same
conditions, it should produce the same results.
•Dependability: Reliable data can be trusted to be consistent over time.
•Measurement Error: Minimizing random errors increases reliability. For
instance, a reliable survey instrument will yield consistent responses if used
repeatedly with the same sample.
Example: A thermometer that gives the same reading when used to
measure the temperature of the same object multiple times is reliable.
Relevance
Relevance refers to the degree to which data is pertinent to the research question or
the decision-making process. Relevant data directly impacts the objectives and
provides insights necessary for the intended purpose.
•Pertinence: Data must be closely related to the needs of the study or decision-making
process.
•Utility: Relevant data is useful and applicable in addressing specific questions or
problems.
•Contextual Fit: The data collected should align with the specific context and
requirements of the analysis.
Example: For a study on consumer behavior regarding a specific product, relevant data
would include customer reviews, purchase history, and demographic information,
rather than unrelated data such as weather patterns.
Accuracy
Accuracy refers to the correctness and precision of data. Accurate data correctly
reflects the real-world conditions or phenomena it is intended to measure.
•Correctness: The data accurately represents the truth or the actual situation.
•Precision: Accurate data is detailed and free from significant errors or biases.
•Measurement Validity: The data collection methods and instruments must be
well-calibrated and appropriate for the type of data being collected.
Example: An accurate scale will provide the exact weight of an object. If the object
weighs 50 kg, the scale should read 50 kg without significant deviation.
Summary
Differences Summarized

•Reliability focuses on the consistency of data over time and

across different conditions. It is about getting the same results
repeatedly.
•Relevance emphasizes the importance and applicability of data
to the specific needs of the research or decision-making process.
It is about whether the data matters for the purpose at hand.
•Accuracy deals with the correctness and precision of data,
ensuring it truly represents what it is supposed to measure. It is
about the data being right and error-free.
Samplimg
Define the Target Population

•Clear Definition: Specify who or what constitutes the population of interest. This includes
defining the characteristics that qualify individuals or items for inclusion in the study.
•Scope and Boundaries: Clearly outline the geographical, temporal, and demographic
boundaries of the population.
Determine the Sample Size

•Statistical Power: Larger samples provide more accurate estimates but are costlier and more time-consuming. Use
power analysis to determine the minimum sample size needed to detect an effect or difference.
Choose a Sampling Method

•Probability Sampling: Methods where every member of the population has a known, non-zero chance of being
selected. This includes:
•Simple Random Sampling: Each member has an equal chance of selection.
•stratified Sampling: Population is divided into strata, and random samples are drawn from each stratum.
•Cluster Sampling: Population is divided into clusters, some of which are randomly selected, and all members of
chosen clusters are sampled.
•Systematic Sampling: Every nth member of the population is selected after a random start.
Choose a Sampling Method
•Non-Probability Sampling: Methods where some members of the population may have no chance of
being selected. This includes:
•Convenience Sampling: Samples are chosen based on ease of access.
•Judgmental or Purposive Sampling: Samples are selected based on the researcher’s judgment
about which members are most useful or representative.
•Quota Sampling: Samples are selected to ensure certain characteristics are represented in specific
proportions.
bias
Bias
Selection Bias Observer Bias
Measurement Bias Survivorship Bias
Response Bias Attrition Bias
Confirmation Bias Confounding Bias
Selection Bias
•Selection bias occurs when the sample is not representative of the population due to non-random selection.
This leads to results that cannot be generalized to the entire population.
Measurement Bias
2. Measurement Bias
Measurement bias occurs when the data collection instruments or procedures systematically favor certain
outcomes over others.
•Instrument Bias: Arises from faulty or biased measurement tools. For example, a poorly calibrated scale
consistently giving incorrect weight readings.
•Interviewer Bias: Occurs when the interviewer's behavior or questioning influences the responses. For example,
leading questions can sway respondents' answers.
•Recall Bias: Happens when participants do not accurately remember past events or experiences. This is common
in retrospective studies.
Response Bias
• Response bias occurs when participants do not provide truthful or accurate responses, leading to distorted
data
Confirmation Bias
Confirmation bias occurs when researchers selectively collect or interpret data in a way that confirms their
preexisting beliefs or hypotheses.

•Data Mining Bias: Occurs when researchers look for patterns in the data that support their hypothesis while
ignoring those that contradict it.
•Publication Bias: Tendency for studies with positive or significant results to be published more often than those
with null or negative results.
Observer Bias
Observer bias happens when the researcher's expectations or knowledge influence their observations and
interpretations.
Attrition Bias
Attrition bias occurs when participants drop out of a study over time, and those who remain are systematically
different from those who leave.
Case Study: The Impact of Biased
Data on Decision-Making
Background
A prominent technology firm, TechSolutions,
implemented a machine learning algorithm to
streamline its hiring process. The goal was to
identify and recruit top talent more efficiently by
analyzing resumes and ranking candidates based
on predicted job performance.
The Problem
After six months of using the algorithm, it was
observed that the number of women and
minority candidates being hired had significantly
decreased. This discrepancy raised concerns
about potential biases in the hiring algorithm.
Data Analysis
Historical Bias: The training data consisted primarily of resumes from the past
decade, during which the majority of employees hired were white males. This
historical bias was encoded into the algorithm, which learned to favor similar
profiles.
Feature Selection: Certain features that correlated with higher hiring success,
such as attending specific universities or having particular job titles, were
disproportionately common among white male candidates, leading the
algorithm to favor these candidates.
Lack of Diversity in Training Data: The dataset lacked sufficient representation
from women and minority groups, preventing the algorithm from accurately
assessing the potential of candidates from these backgrounds.
Consequences

•Reduced Diversity: The company missed out on diverse

perspectives and talents, which are crucial for innovation and
problem-solving.
•Legal and Reputational Risks: The company faced potential
legal action for discriminatory hiring practices and suffered
damage to its reputation as an inclusive employer.
Thank you
Jackie Abualeam
Ministry of communication and
information technology

Chapter Three 3.0 Research Methodology 3.1 Introduction.
89% (139)
Chapter Three 3.0 Research Methodology 3.1 Introduction.
11 pages
The World's Best Optical Illusions - Paraquin, Karl Heinz - New York, 1987 - New York - Sterling Pub - Co - 9780806966441 - Anna's Archive
75% (4)
The World's Best Optical Illusions - Paraquin, Karl Heinz - New York, 1987 - New York - Sterling Pub - Co - 9780806966441 - Anna's Archive
100 pages
Business Statistics LU1 Notes
100% (1)
Business Statistics LU1 Notes
11 pages
Week 03 - Lecture 03
No ratings yet
Week 03 - Lecture 03
85 pages
Business Analytics - L4-L6, Ch. 3-4
No ratings yet
Business Analytics - L4-L6, Ch. 3-4
17 pages
NOTA PRE WORK Unit 3
No ratings yet
NOTA PRE WORK Unit 3
41 pages
Quantitative Research Methods
No ratings yet
Quantitative Research Methods
44 pages
Lesson 01 Introduction To Data Collection and Surveys 1
No ratings yet
Lesson 01 Introduction To Data Collection and Surveys 1
46 pages
CH 2 Research Design
No ratings yet
CH 2 Research Design
31 pages
Unit 1 RM and IPR
No ratings yet
Unit 1 RM and IPR
23 pages
RESEARCH METHODOLOGY Assignment - Eliot Aradukunda
No ratings yet
RESEARCH METHODOLOGY Assignment - Eliot Aradukunda
4 pages
Notes - STA 5313 01F Theory of Sample Surveys With Applications
No ratings yet
Notes - STA 5313 01F Theory of Sample Surveys With Applications
10 pages
Collection of Data and Sampling Methods
No ratings yet
Collection of Data and Sampling Methods
65 pages
Major Sources of Bias in Research Studies
100% (1)
Major Sources of Bias in Research Studies
4 pages
STAT 1181 Chapter 3
No ratings yet
STAT 1181 Chapter 3
12 pages
Research: 1) Argument
No ratings yet
Research: 1) Argument
17 pages
Business - Research.Chapter 3
No ratings yet
Business - Research.Chapter 3
2 pages
Recprocess Print
No ratings yet
Recprocess Print
7 pages
An Overview of Research Methods and Methodologies
100% (1)
An Overview of Research Methods and Methodologies
73 pages
QM 1
No ratings yet
QM 1
58 pages
Quantitative Methods 3
No ratings yet
Quantitative Methods 3
174 pages
BRM Merged
No ratings yet
BRM Merged
969 pages
Classx DS Unit 3
No ratings yet
Classx DS Unit 3
43 pages
Amin Hassan Maths Project
No ratings yet
Amin Hassan Maths Project
14 pages
Chapter 3 Research Design
No ratings yet
Chapter 3 Research Design
119 pages
1.1 Unit I
No ratings yet
1.1 Unit I
11 pages
BRM CH 02
No ratings yet
BRM CH 02
78 pages
CH4 Data Collection
No ratings yet
CH4 Data Collection
30 pages
SAMPLING
No ratings yet
SAMPLING
29 pages
Data Fairness
No ratings yet
Data Fairness
4 pages
Chapter 2 Sampling and Data Collection
No ratings yet
Chapter 2 Sampling and Data Collection
30 pages
Research Methods
No ratings yet
Research Methods
14 pages
MPC005 - Research - Methodology - Notes - Free - Session - by - StutiBatra
No ratings yet
MPC005 - Research - Methodology - Notes - Free - Session - by - StutiBatra
23 pages
C207 Study Guide
No ratings yet
C207 Study Guide
27 pages
Market Research
No ratings yet
Market Research
29 pages
Introduction To Data Cleaning and Bias in Analysis
No ratings yet
Introduction To Data Cleaning and Bias in Analysis
35 pages
In-Class Exercise #1 Notes
No ratings yet
In-Class Exercise #1 Notes
7 pages
Lecture Note-1-Birleştirildi
No ratings yet
Lecture Note-1-Birleştirildi
137 pages
Research Methodology
No ratings yet
Research Methodology
39 pages
Data Science Interview Questions - 1
No ratings yet
Data Science Interview Questions - 1
55 pages
Introduction To Analytics
100% (1)
Introduction To Analytics
45 pages
Lecture1 Introduction
No ratings yet
Lecture1 Introduction
49 pages
Prelims SFCR
No ratings yet
Prelims SFCR
15 pages
The Research Process: Dr. Hemant Sharma
No ratings yet
The Research Process: Dr. Hemant Sharma
36 pages
Week 1a - Chapter 1 - Introduction
No ratings yet
Week 1a - Chapter 1 - Introduction
82 pages
What Is Research
No ratings yet
What Is Research
8 pages
Slidesaver - App Gvmyah
No ratings yet
Slidesaver - App Gvmyah
54 pages
STATISTICS N Quantitative
No ratings yet
STATISTICS N Quantitative
58 pages
BA - II Session
No ratings yet
BA - II Session
62 pages
Intro
No ratings yet
Intro
32 pages
Lecture 2 Notes
No ratings yet
Lecture 2 Notes
9 pages
PH.D Course Work: Dr. Hemant Sharma
No ratings yet
PH.D Course Work: Dr. Hemant Sharma
42 pages
Business-Research Ashkar
No ratings yet
Business-Research Ashkar
26 pages
Nature and Scope of Marketing Research
No ratings yet
Nature and Scope of Marketing Research
24 pages
Chap 2-MKT RES
No ratings yet
Chap 2-MKT RES
31 pages
Research Design
100% (1)
Research Design
33 pages
BIASES
No ratings yet
BIASES
3 pages
Bias: From Questions To Conclusions
No ratings yet
Bias: From Questions To Conclusions
11 pages
MBA1
No ratings yet
MBA1
7 pages
Sampling and Non Sampling Error and Questionnaire
No ratings yet
Sampling and Non Sampling Error and Questionnaire
34 pages
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
Healthcare Staffing Candidate Screening and Interviewing Handbook
From Everand
Healthcare Staffing Candidate Screening and Interviewing Handbook
Business Success Shop
No ratings yet
Offer Lettar
No ratings yet
Offer Lettar
2 pages
CHAPTER 7-WPS Office
No ratings yet
CHAPTER 7-WPS Office
22 pages
A Study On Saving Pattern and Investment Opportunities Awareness at Rural Level
No ratings yet
A Study On Saving Pattern and Investment Opportunities Awareness at Rural Level
8 pages
Bhavya Si Report
No ratings yet
Bhavya Si Report
16 pages
Safe Space Harassment 1
No ratings yet
Safe Space Harassment 1
3 pages
Master Thesis Research Proposal and Contract - EMTM 2022-2024
No ratings yet
Master Thesis Research Proposal and Contract - EMTM 2022-2024
8 pages
(2015) Human Factors & Ergonomics
No ratings yet
(2015) Human Factors & Ergonomics
9 pages
2021-2022 - Individual Responsiblities2
No ratings yet
2021-2022 - Individual Responsiblities2
16 pages
Follow Your Dreams Lyrics
No ratings yet
Follow Your Dreams Lyrics
5 pages
Final Paper
No ratings yet
Final Paper
4 pages
Omega 3
No ratings yet
Omega 3
8 pages
A Critical Analysis of Talcott Parsons
No ratings yet
A Critical Analysis of Talcott Parsons
4 pages
The Art of Setting Smart Goals Set Winning Goals and Live A Life of Abundance Success and Achievement 9781095678022
No ratings yet
The Art of Setting Smart Goals Set Winning Goals and Live A Life of Abundance Success and Achievement 9781095678022
38 pages
NSTP Portfolio
No ratings yet
NSTP Portfolio
19 pages
Task 3 1500
No ratings yet
Task 3 1500
8 pages
PE 2 Module 2021 Lesson 7
0% (1)
PE 2 Module 2021 Lesson 7
10 pages
Caregiver, Take Care
No ratings yet
Caregiver, Take Care
2 pages
Portfolio Piece Instruction-2023
No ratings yet
Portfolio Piece Instruction-2023
4 pages
100 Years of Training and Development Research
No ratings yet
100 Years of Training and Development Research
65 pages
Influential Life Experiences Scramble 2019
No ratings yet
Influential Life Experiences Scramble 2019
4 pages
BSBPEF502 Task 3 Assessment Templates V1.0122
No ratings yet
BSBPEF502 Task 3 Assessment Templates V1.0122
9 pages
Lesson Plan
No ratings yet
Lesson Plan
3 pages
Types of Speech Context
No ratings yet
Types of Speech Context
19 pages
Organization Change & Development by Nilanjan Sengupta (OCD) PDF
No ratings yet
Organization Change & Development by Nilanjan Sengupta (OCD) PDF
6 pages
Binhboong 6
No ratings yet
Binhboong 6
17 pages
Full Download Ted Bundy and The Unsolved Murder Epidemic Matt Delisi PDF
100% (1)
Full Download Ted Bundy and The Unsolved Murder Epidemic Matt Delisi PDF
47 pages
Disciplines of Counseling. 1
No ratings yet
Disciplines of Counseling. 1
32 pages
Feature Witing
No ratings yet
Feature Witing
54 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Data Collection

Uploaded by

Data Collection

Uploaded by

Data Collection

• key factors that influence data collection

•Reliability focuses on the consistency of data over time and

•Reduced Diversity: The company missed out on diverse

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.