0% found this document useful (0 votes)

3 views

Assessing Data Quality

The document discusses the importance of assessing data quality in research through the concepts of reliability and validity of measurement instruments. Reliability refers to the consistency of results across different applications, while validity measures whether an instrument accurately represents the variable it intends to measure. Various methods for evaluating reliability, such as test-retest, inter-rater, and split-half techniques, as well as types of validity including face, content, criterion, and construct validity, are outlined.

Uploaded by

avinash dhameriya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Assessing Data Quality

Uploaded by

avinash dhameriya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

ASSESSING DATA QUALITY (VALIDITY AND RELIABILITY)

Measurement is an essential process to obtain data that is gathered from measurement tool
constructed by the researcher during a course of research study. Usually, the researchers construct the
measurement tool, and they do not simply assume that their measures work; instead, they collect data
to show that they are valid and reliable instrument for quantitative research study. In case, the tools
show that they are not valid and reliable, the researcher stops using them and try to construct
alternative means of measurement. However, to demonstrate the data quality, the researchers have
two distinct criteria to evaluate their quantitative measures i.e., reliability and of validity of instrument.

Reliability of measuring instruments

Reliability is the degree to which research method produces stable and consistent results. A specific
measure is considered to be reliable if its application on same object produces the same results on
number of times. The reliability of an instrument can be assessed in various ways based on their nature
of instrument and on the aspects of reliability. The reliability of any instrument is concerned with
consistency of a measure; across time (test-retest reliability to assess stability), across items (split half
test to assess the internal consistency/homogeneity), and across different researchers (inter-rater /
inter-observer reliability for equivalence).

Types of reliability
1. External reliability: it is the extent to which a measure varies from one use to another
a. Test-retest reliability: measures the stability of test over a time
b. Inter-rater reliability: to the degree to which different raters give consistent estimates of
the same behavior
2. Internal reliability: it is the extent to which a measure is consistent within itself
c. Split half test: measures the extent to which all parts of the test contribute equally to
what is being measured

Test-retest reliability (stability over time): this test focus on instrument’s susceptibility to extraneous
factors over time, such as subject fatigue or environmental conditions. Stability is the extent to which
an instrument is administered twice to the same sample on two separate occasions (Time 1 and Time 2
usually after 7 days). A researcher will administer the measures on two occasions, and then compare
the scores using coefficient-correlation. Theoretically, the score value of coefficient reliability ranges
between -1.00 through .00 to +1.00. In general, perfection of coefficient reliability is difficult and hence
most of the researchers accept a test-retest correlation at 0.70 or greater depending upon the type of
instrument and area of research. Having good test re-test reliability signifies the internal validity of a
test and ensures the measurements obtained in two occasions are stable over time.

For example; if a scale weighed a person at 60 kg one minute and 60.01 kg the next, we would consider
it reliable instrument. The less variation of an instrument in repeated measures produces the higher in
its reliability. Any good measure of instrument should produce roughly the same scores, and a measure
that produces highly inconsistent scores over time cannot be a good construct and not reliable.

Test-retest reliability is used when the attributes are fairly stable in nature (e.g., self esteem that
usually does not fluctuate). This method is relatively an easy approach and can be used with interview
schedule, questionnaire, observational and physiological measures.

Procedure of conducting test re-tests method

1. Select the subjects from target population (10% of total samples other than actual study)
2. Time 1: administer measuring instrument to a group of sample
3. Time 2: re-administer same measures to same group of sample (usually after 7 days; time
duration may vary in occasions based on some attributes)
4. Compare the scores of two different occasions using Karl Pearson’s coefficient formula
5. Interpret the results: Perfect reliability (+1.00), Acceptable reliability (+.70 and above),
Questionable reliability (+.69 and below) or No reliability (+00)
Example of test re-test reliability on self esteem instrument

Subject Number Time 1 Time2

1 55 57
2 49 46
3 78 74
4 37 35
5 44 46
6 50 56
7 58 55
8 62 66
9 48 50
10 67 63
r=.95 (highly acceptable reliability)
Drawbacks of test re-tests method
a. Many traits (attitude, behavior, knowledge, physical conditions etc.) do change independently
with time which affects the measures of stability.
b. The observer’s or researcher’s coding on the second administration may be influenced by their
memory of first administration regardless of actual values of present behavior.
c. Subjects may change as a result of first administration
d. Subjects and researchers may not be careful using the same instrument in the second time
e. Boring on the second occasion

Inter-rater / inter-observer reliability (for equivalence): is estimated by having two or more trained
observers watching an single event simultaneously and independently recording the data to measure
the reliability to establish equivalence or consistency in their judgments primarily with observational
measures of instruments.
Inter-rater reliability is used to compute an index level of equivalence or agreement between the raters
or judges using coefficient correlation technique to demonstrate the strength of relationship between
one observer’s ratings to another’s.
There is another procedure is to compute reliability as a function of agreements between observers
using equation formula. If everyone agrees, it is 1 (or 100%) and if everyone disagrees, it is 0 (0%). The
equation formula for measuring reliability is -

Number of agreements
Number of agreements + Disagreements
Drawbacks of inter-rater / inter-observer reliability
a. The observers may tend to overestimate or underestimate the observation
b. The agreement equation formula may tends to overestimate observer agreements; the observe
may code for absence or presence, 50% of time by chance only

Split half test/technique (Internal consistency or homogeneity across the items)–Internal consistency
is concerned with consistency or homogeneity of results to an extent which all parts of the test
items/subparts contribute equally to what is being measured. In simpler terms, it is the degree to
which the subparts of an instrument yield the same results within the same test. One of the oldest,
cheapest and easiest methods for assessing internal consistency is the split-half technique where
researchers prefer to measure the reliability by including two versions of same instrument within the
same test.
In split-half reliability, the items of instrument are split into two parts or groups, and then both parts
are given to one group of subjects at the same time. Then, scores from both parts of the test are
correlated (Cronbach’s alpha formula) to test the reliability.

Procedure of split half test/technique

1. Select the subjects from target population (10% of total samples other than actual study)
2. Randomly divide the items or questions of instrument into two parts or groups; either odd/
even number or first half / second half.
a. Based on first half and second half: divide the instrument in two equal parts.
b. Based on odd and even numbers: one part with odd items like; 1, 3, 5, 7, 9 and so on
(first half test). And the other with even items like; 2, 4, 6, 8, 10 and so on (second half ).
3. Administer both parts simultaneously; first half & second half test items to one group of
subjects at the same time
4. compare the scores of first half & second half test using Cronbach’s alpha formula to measure
the internal consistency of an instrument
5. Measure the score and interpret results: Perfect reliability (+1.00), Acceptable reliability (+.70
and above), Questionable reliability (+.69 and below) or No reliability (+00)

Split half technique is easy, economical and widely used reliability test as it requires single
administration. And it is the best means of assessing an important source of measurement error in
psychological instruments. This technique is most commonly used for multiple choice tests (also used
for other type of tests). The multiple choice tests contain distinct subtests or subparts, but related
concepts. In Split half test, the internal consistency of subparts is typically assessed, and if subpart
score are summed for an overall score, the scale’s internal consistency can be assessed. One of the
drawbacks is that it works only for a large set of questions that measure the construct.

Validity of measuring instruments

Validity is the extent to which the scores from a measure represent the variable they are intended to.
But how do researchers make this judgment? We have already considered one factor i.e., reliability.
When a measure has good test-retest reliability and internal consistency, researchers should be more
confident that the scores represent what they are supposed to. There has to be more to it, however,
because a measure can be extremely reliable but have no validity whatsoever. As an example, imagine
someone who believes that people’s index finger length reflects their self-esteem and therefore tries to
measure self-esteem by holding a ruler up to people’s index fingers. Though it is good test-retest
reliability, has no validity. The fact that one person’s index finger is a centimeter longer than another’s,
and that would indicate nothing about one had higher self-esteem. Therefore an instrument can,
however, be reliable without being valid.

Therefore, the second important criterion for evaluating a quantitative instrument is its validity. It is
the degree to which an instrument measures what it is supposed to measure. In other words, validity is
the appropriateness, completeness, and usefulness of an attribute measuring research instrument. For
example, a thermometer supposed to measure only body temperature; and can’t be considered as
valid instrument if it measures an attribute other than the temperature. Similarly, if a researcher
constructed instrument measures on pain, and if it includes the items on anxiety, can’t be considered
as valid. Hence, the valid instrument should measure only what it supposed to measure.

Measures of validity / types of validity / aspects of validity

1. Face validity
2. Content validity
3. Criterion validity
4. Construct validity

Face validity – is the overall looking of an instrument with regard to its appropriateness to measure
specific attribute. Although it is not considered as primary evidence, it is helpful for a measure to have
face validity if other type of validity has been demonstrated. For example; most people would expect a
self-esteem questionnaire to include items about whether they see themselves as a person of worth
and whether they think they have good qualities. So a questionnaire that included these kinds of items
would have good face validity.
Content validity – is the extent to which a measuring instrument provides adequate coverage of the
specific content. In other words, it is concerned with the degree to which an instrument has
appropriate and representative samples of items for the construct being measured. The content
validity of an instrument is primarily based on judgment, and there are no completely objective
methods to ensure adequate content coverage of an instrument. However, in recent years it has
become common to use a panel of experts to evaluate the new instruments for its adequacy and
appropriateness. The panel typically consist at least three members excluding language expert. The
content validity of an instrument is relevant for both; cognitive measures and affective measures
(feeling, emotions, and other psychological traits).

Criterion–related validity – involves determining the relationship between an instrument and an

external criterion. In other words, it is the extent to which scores on a measure are correlated with
other variables (known as criteria). The instrument said to be valid if its scores correlates highly with
scores on the criterion.

Construct validity – is the most complex and abstract. A measure is said to possess construct validity to
the degree that it confirms to predicted correlations with other theoretical propositions.

Prof. ED 8
No ratings yet
Prof. ED 8
4 pages
Reliability, Validity, 2015
No ratings yet
Reliability, Validity, 2015
15 pages
Inter Rather Reliabaility_045145
No ratings yet
Inter Rather Reliabaility_045145
5 pages
35 40 Ganesh
No ratings yet
35 40 Ganesh
6 pages
Characteristics of Research Tools
No ratings yet
Characteristics of Research Tools
3 pages
script-sir Fano
No ratings yet
script-sir Fano
1 page
Reliability and Reliability Analysis (Business Research Methods)
No ratings yet
Reliability and Reliability Analysis (Business Research Methods)
30 pages
Reliability and Validity
No ratings yet
Reliability and Validity
23 pages
RELIABILITY
No ratings yet
RELIABILITY
16 pages
Validity and Reliability
No ratings yet
Validity and Reliability
2 pages
CHAPTER 6
No ratings yet
CHAPTER 6
8 pages
PSYCHOMETRIC PROPERTIES in research
No ratings yet
PSYCHOMETRIC PROPERTIES in research
25 pages
Reliabilty Lecture (5)
No ratings yet
Reliabilty Lecture (5)
16 pages
Are Scores On Past Use of The Instrument Reliable and Valid?
No ratings yet
Are Scores On Past Use of The Instrument Reliable and Valid?
6 pages
reliability
No ratings yet
reliability
27 pages
Methods in Assessing Reliability
No ratings yet
Methods in Assessing Reliability
12 pages
UNIT-5 psychometry_240505_1652001
No ratings yet
UNIT-5 psychometry_240505_1652001
20 pages
ESSENTIAL FEATURES OF A SOUND TEST
No ratings yet
ESSENTIAL FEATURES OF A SOUND TEST
12 pages
Reliability Test by Group 2
No ratings yet
Reliability Test by Group 2
28 pages
8602 (2nd Assignment) pdf
No ratings yet
8602 (2nd Assignment) pdf
24 pages
Reliability
No ratings yet
Reliability
2 pages
Research Assignment (Piyanki Mam)
No ratings yet
Research Assignment (Piyanki Mam)
16 pages
MOUNT MARY COLLEGE OF EDUCATION
No ratings yet
MOUNT MARY COLLEGE OF EDUCATION
5 pages
Reliability
No ratings yet
Reliability
27 pages
RELIABILITY AND VALIDITY
No ratings yet
RELIABILITY AND VALIDITY
47 pages
SPL-3 Unit 2
No ratings yet
SPL-3 Unit 2
11 pages
Reliability by Vartika Verma
No ratings yet
Reliability by Vartika Verma
17 pages
Reliability and Validity in Research
No ratings yet
Reliability and Validity in Research
5 pages
Kyu Edu 2301 WK3
No ratings yet
Kyu Edu 2301 WK3
5 pages
Reliability and Validity
No ratings yet
Reliability and Validity
32 pages
Lesson 6.2 Item Analysis and Validation 3
No ratings yet
Lesson 6.2 Item Analysis and Validation 3
11 pages
reliability
No ratings yet
reliability
15 pages
Reading 03 Psychometric Principles
No ratings yet
Reading 03 Psychometric Principles
20 pages
Topic: Reliability SUBJECT: Methods of Research Student: Ma. Kasandra B. Monforte Professor: Mr. Graciano Banaga
No ratings yet
Topic: Reliability SUBJECT: Methods of Research Student: Ma. Kasandra B. Monforte Professor: Mr. Graciano Banaga
2 pages
Chapter 13 Assessing Quality of Measurement Tools 2
No ratings yet
Chapter 13 Assessing Quality of Measurement Tools 2
57 pages
Unit 2 Reliability and Validity (External and Internal)
No ratings yet
Unit 2 Reliability and Validity (External and Internal)
3 pages
RMBS M2 Lecture 5a
No ratings yet
RMBS M2 Lecture 5a
42 pages
Reliability and its Types
No ratings yet
Reliability and its Types
13 pages
QUALITY OF A TEST
No ratings yet
QUALITY OF A TEST
7 pages
Reliability of Research Tool
No ratings yet
Reliability of Research Tool
26 pages
05 - Designing Valid Communication Research PDF
No ratings yet
05 - Designing Valid Communication Research PDF
9 pages
KPD Validity & Realibility
No ratings yet
KPD Validity & Realibility
25 pages
10 - Chapter 6 PDF
No ratings yet
10 - Chapter 6 PDF
30 pages
Reliability
No ratings yet
Reliability
11 pages
PSY 323 TOPIC 3
No ratings yet
PSY 323 TOPIC 3
5 pages
Chracteristics of A Good Test
No ratings yet
Chracteristics of A Good Test
58 pages
Reliability
No ratings yet
Reliability
9 pages
Types of Reliability
No ratings yet
Types of Reliability
6 pages
Reliability
No ratings yet
Reliability
9 pages
Arshita Matta - 0011 - Psychometric Assignment 4
No ratings yet
Arshita Matta - 0011 - Psychometric Assignment 4
4 pages
Unit 9
No ratings yet
Unit 9
27 pages
Characteristics of Effective Selection Techniques
No ratings yet
Characteristics of Effective Selection Techniques
17 pages
EDU 301 - Lecture 3
No ratings yet
EDU 301 - Lecture 3
30 pages
Psych Assessment Unit V
No ratings yet
Psych Assessment Unit V
2 pages
Unit 6
No ratings yet
Unit 6
37 pages
Educational Research
No ratings yet
Educational Research
32 pages
Class 10
No ratings yet
Class 10
54 pages
Advance Research Methods: Dr. Amin
No ratings yet
Advance Research Methods: Dr. Amin
5 pages
Concept of Reliability, Validity and Norms (AutoRecovered)
No ratings yet
Concept of Reliability, Validity and Norms (AutoRecovered)
10 pages
ISTQB Certified Tester Advanced Level Test Manager (CTAL-TM): Practice Questions Syllabus 2012
From Everand
ISTQB Certified Tester Advanced Level Test Manager (CTAL-TM): Practice Questions Syllabus 2012
Gabriel Awoyemi
No ratings yet
Automated Software Testing Interview Questions You'll Most Likely Be Asked
From Everand
Automated Software Testing Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Research Overview
No ratings yet
Research Overview
14 pages
Curriculum Research in Nursing-Lesson Plan
No ratings yet
Curriculum Research in Nursing-Lesson Plan
13 pages
Embryology of Heart and Lung
No ratings yet
Embryology of Heart and Lung
7 pages
Genetic Counselling
No ratings yet
Genetic Counselling
3 pages
Nursing Theories
No ratings yet
Nursing Theories
5 pages
Assessment On RS
No ratings yet
Assessment On RS
13 pages
Remember: Goals and Plan of Care Should Be Base According To Client's Problems/needs NOT According To Your Own
No ratings yet
Remember: Goals and Plan of Care Should Be Base According To Client's Problems/needs NOT According To Your Own
11 pages
Congestive Cardiac Failure
No ratings yet
Congestive Cardiac Failure
36 pages
Curriculum Sarika Mam
No ratings yet
Curriculum Sarika Mam
39 pages
Sampling in Quantitative Studies: Basic Sampling Concepts 1. Population
No ratings yet
Sampling in Quantitative Studies: Basic Sampling Concepts 1. Population
11 pages
Osteomalacia: (Eg, Phenytoin, Phenobarbital)
No ratings yet
Osteomalacia: (Eg, Phenytoin, Phenobarbital)
1 page
Axillary Artery
No ratings yet
Axillary Artery
7 pages
Hyperthyroidism
No ratings yet
Hyperthyroidism
4 pages
Spinal Cord Injury: Neck Chest
No ratings yet
Spinal Cord Injury: Neck Chest
4 pages
Hypothyroidism
No ratings yet
Hypothyroidism
7 pages
Diabetes INSIPIDUS
100% (1)
Diabetes INSIPIDUS
6 pages
Inc Code of Ethics For Nurses in India
100% (3)
Inc Code of Ethics For Nurses in India
3 pages
Addison's Disease
No ratings yet
Addison's Disease
3 pages
Regulatory Bodies / Apex Bodies
No ratings yet
Regulatory Bodies / Apex Bodies
6 pages
Performance Appraisal System of Bangladesh Civil Service: An Analysis of Its Efficacy
No ratings yet
Performance Appraisal System of Bangladesh Civil Service: An Analysis of Its Efficacy
23 pages
Thesis Instrumentation Sample
100% (2)
Thesis Instrumentation Sample
8 pages
Quantitative Research Proposal
No ratings yet
Quantitative Research Proposal
15 pages
Vineland Adaptive Behavior Scales
100% (2)
Vineland Adaptive Behavior Scales
17 pages
Brand Extension
No ratings yet
Brand Extension
37 pages
Assessing The Effectiveness of Presumptive Tax System Implementing in Hawassa City of Sidama Regional State, Ethiopia
No ratings yet
Assessing The Effectiveness of Presumptive Tax System Implementing in Hawassa City of Sidama Regional State, Ethiopia
5 pages
An Empirical Test of The Space Transition Theory of Cyber Criminality: Investigating Cybercrime Causation Factors in Ghana
No ratings yet
An Empirical Test of The Space Transition Theory of Cyber Criminality: Investigating Cybercrime Causation Factors in Ghana
12 pages
Chapter Iii
No ratings yet
Chapter Iii
12 pages
The Relationship Between Export Market Orientation and Firm Performance: A Meta-Analysis of Main and Moderator Effects
No ratings yet
The Relationship Between Export Market Orientation and Firm Performance: A Meta-Analysis of Main and Moderator Effects
22 pages
Research Methods - STA630 Spring 2010 Mid Term Paper Session 2 PDF
100% (1)
Research Methods - STA630 Spring 2010 Mid Term Paper Session 2 PDF
52 pages
Asia Paci C Management Review: Marco Fabio Benaglia, Mei H.C. Ho, Tsaiyin Tsai
No ratings yet
Asia Paci C Management Review: Marco Fabio Benaglia, Mei H.C. Ho, Tsaiyin Tsai
12 pages
Sample Test Midterm Psych Assessment
No ratings yet
Sample Test Midterm Psych Assessment
14 pages
Design of Flexible Pavements As Per AASHTO-1993 Method
No ratings yet
Design of Flexible Pavements As Per AASHTO-1993 Method
50 pages
Morkunas Và Rudiene, 2020. The - Impact - of - Social - Servicescape - Factors - On - Custo
No ratings yet
Morkunas Và Rudiene, 2020. The - Impact - of - Social - Servicescape - Factors - On - Custo
15 pages
Selecting Employees: Fundamentals of Human Resource Management, 10/E, Decenzo/Robbins
100% (1)
Selecting Employees: Fundamentals of Human Resource Management, 10/E, Decenzo/Robbins
22 pages
Paper Bitara Stem Edy Et Al
No ratings yet
Paper Bitara Stem Edy Et Al
12 pages
Effectiveness of Risk Management Strategies on Performance of the Customs and Border Control Department in Kenya
No ratings yet
Effectiveness of Risk Management Strategies on Performance of the Customs and Border Control Department in Kenya
84 pages
IMPACT OF FEAR OF MISSING OUT (FoMO) ANXIETY AND AGGRESSION OF
No ratings yet
IMPACT OF FEAR OF MISSING OUT (FoMO) ANXIETY AND AGGRESSION OF
15 pages
1 s2.0 S1041608021000832 Main
No ratings yet
1 s2.0 S1041608021000832 Main
10 pages
@Kvafaee MA Entrance Exam_1404_Q313C OCR
No ratings yet
@Kvafaee MA Entrance Exam_1404_Q313C OCR
32 pages
Journal Chemophobia
No ratings yet
Journal Chemophobia
4 pages
An Empirical Analysis of The Antecedents and Performance Consequences of Using The Moodle Platform
No ratings yet
An Empirical Analysis of The Antecedents and Performance Consequences of Using The Moodle Platform
5 pages
Principles of High Quality Classroom Assessment
No ratings yet
Principles of High Quality Classroom Assessment
28 pages
Question Text: The Answer Sources Are Compiled From or Based On Secondary Sources
No ratings yet
Question Text: The Answer Sources Are Compiled From or Based On Secondary Sources
5 pages
Kajian Keandalan Utilitas Bangunan Gedung Rusunawa Di Kota Banda Aceh
No ratings yet
Kajian Keandalan Utilitas Bangunan Gedung Rusunawa Di Kota Banda Aceh
10 pages
Green Transition in The Hospitality Industry The Influence of Market Forces and Customer Dynamics On Sustainable Performance in The Digital Era
No ratings yet
Green Transition in The Hospitality Industry The Influence of Market Forces and Customer Dynamics On Sustainable Performance in The Digital Era
17 pages
Students Readiness For The Face-To-Face Classes in Junior and Senior High School
No ratings yet
Students Readiness For The Face-To-Face Classes in Junior and Senior High School
12 pages
10.1515 - Edu 2024 0016
No ratings yet
10.1515 - Edu 2024 0016
31 pages
g1 Bamm Full Manuscript
No ratings yet
g1 Bamm Full Manuscript
98 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Assessing Data Quality

Uploaded by

Assessing Data Quality

Uploaded by

ASSESSING DATA QUALITY (VALIDITY AND RELIABILITY)

Reliability of measuring instruments

Procedure of conducting test re-tests method

Subject Number Time 1 Time2

Procedure of split half test/technique

Validity of measuring instruments

Measures of validity / types of validity / aspects of validity

Criterion–related validity – involves determining the relationship between an instrument and an

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.