0% found this document useful (0 votes)

7 views

Chapter 5

Chapter 5 discusses measurement, reliability, and validity in research, outlining different levels of measurement such as nominal, ordinal, interval, and ratio. It emphasizes the importance of reliability, which ensures consistency in measurements, and validity, which assesses whether a test measures what it intends to measure. Various types of validity, including content, criterion, and construct validity, are explained, along with methods to enhance reliability and validity in testing.

Uploaded by

Vi Phạm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Chapter 5

Uploaded by

Vi Phạm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

CHAPTER 5

Measurement, Reliability,
and Validity
Levels of measurement
Measurement
The assignment of reflect the way in which outcomes
values to outcomes are measured or assessed
Levels of measurement
• Nominal variables are categorical in nature.
Gender (male or female), preferences (like or dislike), voting record (for or against),…
• Ordinal variables reflect rankings.
Rank in college, order of finishing a race
• Interval variables have equal intervals between them.
Temperature, intelligence test scores, …
• Ratio variables have equal intervals between them and have an absolute zero.
Age, weight, time

A continuous variable is one that can assume any value along some underlying
continuum (e.g. Height, Age,…)
A discrete or categorical variable is one with values that can be placed only into
categories that have definite boundaries (e.g. Gender, marital status)
Practice
Identify the levels of measurement of the following variables
1. Amount of money in savings account
2. Letter grades (A, B, C,…) on an English essays
3. Time spent commuting to work
4. Classification of exercises (beginning, intermediate, advanced)
5. Levels of agreement (Strongly disagree, Disagree, Neither agree
nor disagree, Agree, Strongly agree)
6. Flavors of ice-cream
7. Types of living accommodation (house, apartment, trailer, other)
8. Weight
9. Time of day (dawn, morning, noon, afternoon, evening, night)
10. IELTS bandscores
Reliability: consistency of the measurement
Validity: accuracy & truthfulness  test what should test

Is it reliable?
Is it valid?

Time 1 Time 2 Time 3 Time 4 Time 5

45 kg 40 kg 50 kg 50 kg 43 kg
• Reliability occurs when a test measures the same thing
more than once and results in the same outcomes.
Reliability • Reliability consists of both an observed score and a true
score component.

Observed score = 7.3

True score = 8.5
Error = Observed – True = -1.2
Identify whether the following factors would contribute to
TRAIT or METHOD sources of errors.

1. Level of ability
2. Bias in grading
3. Test-taking skills
4. Interaction between examiner and test taker
5. Health
6. Fatigue
7. Motivation
8. Emotional strain
9. Testing environment
10.Ability to understand instructions
Increasing Reliability
• Increase the number of items or observations (larger sample means
more representative and reliable)
• Eliminate items that are unclear
• Standardize the conditions under which the test is taken
• Moderate the degree of difficulty of the tests
• Minimize the effects of external events
• Standardize instructions
• Maintain consistent scoring procedures
Measuring • Reliability coefficients (r): range in value from +1.00 to -
1.00. A value of 1.00 would be perfect reliability
Reliability r >= 0.8  the test is reliable

Test–retest reliability examines consistency over time (Time 1 vs. Time 2)

Parallel-forms reliability examines consistency between forms (Form 1 vs. Form 2)
Inter-rater reliability examines consistency across raters (Rater 1 vs. Rater 2)
Internal consistency examines the unidimensional nature of a set of items (Individual vs. Entire)
Practice
Identify the correct type of reliability
1. Two trained teachers observe young learners ‘behavior in a classroom.
Each teacher rates observed behaviors using the same form and the
correlation between the two teachers’ ratings was calculated.

2. An IQ test is given to 70 participants on October 1st and then the same

IQ test is administered to the same group of 70 participants one month
later. The correlation of scores between the two tests is finally calculated.

3. Two versions of an ICT knowledge test with the same level of difficulty
and contents are administered to the same group of participants. The
reliability is then determined by computing the correlation between the
results of the two test versions.
VALIDITY
The test/instrument you are using actually measures what you need
to have measured.
• Validity refers to the results/outcomes of a test, not to the test itself.
• Validity progression occurs in degrees from low validity to high validity.
• The validity of the results of a test must be interpreted within the
context in which the test occurs.

Example:
Which one is a valid question for an English vocabulary test?
• Give two synonyms of the word “enormous”
• How many bones are in the human body?
TYPES OF VALIDITY
Content Validity
• Content validity indicates the extent to which a test represents the universe
of items from which it is drawn.
• Expert opinion is often used to establish the content validity of a test.
TYPES OF VALIDITY
Content Validity
• Content validity indicates the extent to which a test represents the universe
of items from which it is drawn.
• Expert opinion is often used to establish the content validity of a test.

What students learned What should appear in the test

Chapter 1: History 5 Questions about History
Chapter 2: Geography 5 Questions about Geography
Chapter 3: Culture 5 Questions about Culture
TYPES OF VALIDITY
Criterion Validity
• Criterion validity is a measure of the extent to which a test is related to some
criterion.
Concurrent validity: how well a test estimates present performance
Do Section 1 scores correlate with the test scores?
Do IELTS scores correlate with GPA of English-major students?

Predictive validity: how well it predicts (future) performance

Do academic achievements (GPA) correlate with (future) high-paid jobs?
TYPES OF VALIDITY
Construct Validity
• Construct validity: the extent to which the results of a test are
related to an underlying set of related variables.
Example: An English Listening comprehension test should test:
 ability to understand details (bottom-up processing)
 ability to comprehend major points or gist (top-down processing)
 ability to make inferences
 ability to guess the meaning of unknown words from the context
 ability to write responses in paragraphs
(based on the Interactive model of listening comprehension)
TYPES OF VALIDITY
Construct Validity

TO ESTABLISH CONSTRUCT VALIDITY:

• Correlate your test with some already established tests
Correlate your listening test score with IELTS listening test, FCE listening
test,…
• Compare the results of the test between different groups of people
with/without certain characteristics
Administer your listening test to low English proficiency students and to
high proficiency students
• Check whether the test items/components are consistent with the
underlying theory
Does your test include items for checking students’ ability to understand
details?
Reliability and Validity
Reliability and Validity
• A test can be reliable but not valid, but a test cannot be valid without
first being reliable.
Reliability is a necessary, but not sufficient, condition of validity.

Which one is a valid question for an English vocabulary test?

Give two synonyms of the word “enormous”
How many bones are in the human body?
Considering the situation

A mathematics test for Vietnamese 12-graders:

Nine thousands people came to the conference venue on

Friday. Half of them left the conference on Saturday. Then
on this Sunday morning, two thirds of the rest returned
home.

Question: How many people were still at the conference at

7:00 PM on this Sunday?
A mathematics test for
Vietnamese 12-graders:
Suggested answer
The math test has some problems Nine thousands people came to
in establishing its validity: the conference venue on Friday.
• The language of the test is English, Half of them left the conference
not the native language of the test on Saturday. Then on this
takers (construct validity)
Sunday morning, two thirds of
• The level of difficulty of the test is the rest returned home.
not appropriate for 12 grader: it
tests too basic math knowledge
(content validity)
Question: How many people
were still at the conference at
7:00 PM on this Sunday?

Ebook PDF Child Development A Cultural Approach 3rd Edition 2 PDF
98% (56)
Ebook PDF Child Development A Cultural Approach 3rd Edition 2 PDF
41 pages
Graded Motor Imagery
100% (2)
Graded Motor Imagery
82 pages
Characteristics of A Good Test
50% (2)
Characteristics of A Good Test
5 pages
Validity and Reliability
100% (4)
Validity and Reliability
19 pages
Blooms Taxonomy Action Verbs
67% (3)
Blooms Taxonomy Action Verbs
1 page
Consumer Behaviour & Marketing Communication
No ratings yet
Consumer Behaviour & Marketing Communication
69 pages
Basics of Social Research Canadian 3rd Edition Neuman Solutions Manual
0% (1)
Basics of Social Research Canadian 3rd Edition Neuman Solutions Manual
3 pages
Project Report On Quality of Work Life
No ratings yet
Project Report On Quality of Work Life
45 pages
Validity & Realibility
No ratings yet
Validity & Realibility
13 pages
Establishing Validity-and-Reliability-Test
No ratings yet
Establishing Validity-and-Reliability-Test
28 pages
Validity & Reliability
No ratings yet
Validity & Reliability
27 pages
Characteristics of A Good Test
No ratings yet
Characteristics of A Good Test
35 pages
Chapter 4 Assessment & Evaluation
No ratings yet
Chapter 4 Assessment & Evaluation
10 pages
Unit 9
No ratings yet
Unit 9
11 pages
Validity and Reliability Lesson 3.
No ratings yet
Validity and Reliability Lesson 3.
48 pages
MODULE 5.ppt
No ratings yet
MODULE 5.ppt
30 pages
Language - Testing - Characteristics of Good Test
No ratings yet
Language - Testing - Characteristics of Good Test
31 pages
PT Presentaion
No ratings yet
PT Presentaion
25 pages
L9 Qualities of A Good Measuring Instrument
No ratings yet
L9 Qualities of A Good Measuring Instrument
22 pages
What is Reliability
No ratings yet
What is Reliability
2 pages
Qualities of Good Test
No ratings yet
Qualities of Good Test
37 pages
2.measurement of Validity Reliability
No ratings yet
2.measurement of Validity Reliability
31 pages
Topic 3 Characteristics and Principles of Assessment
100% (1)
Topic 3 Characteristics and Principles of Assessment
45 pages
KPD Validity & Realibility
No ratings yet
KPD Validity & Realibility
25 pages
Validity Explains How Well The Collected Data Covers The Actual Area of Investigation
No ratings yet
Validity Explains How Well The Collected Data Covers The Actual Area of Investigation
7 pages
SPL-3 Unit 3
No ratings yet
SPL-3 Unit 3
4 pages
Summary Notes - Qualities of a good test (1)
No ratings yet
Summary Notes - Qualities of a good test (1)
49 pages
TEST VALIDITY
No ratings yet
TEST VALIDITY
5 pages
PSY 323 TOPIC 3
No ratings yet
PSY 323 TOPIC 3
5 pages
Quantitative Analysis - Sir Audrey
No ratings yet
Quantitative Analysis - Sir Audrey
6 pages
Validity and Reliability in Research
0% (1)
Validity and Reliability in Research
13 pages
meai.21 (1)
No ratings yet
meai.21 (1)
11 pages
Measuring Reliability and Validity
No ratings yet
Measuring Reliability and Validity
18 pages
What Is Validit1
No ratings yet
What Is Validit1
5 pages
Unit 6 8602
100% (1)
Unit 6 8602
22 pages
Validity and Reliability
No ratings yet
Validity and Reliability
31 pages
Unit 6 (8602)
No ratings yet
Unit 6 (8602)
14 pages
Lesson 8
No ratings yet
Lesson 8
1 page
BBA-BI-Class 19 Business Research Notes For BHM
No ratings yet
BBA-BI-Class 19 Business Research Notes For BHM
28 pages
Kyu Edu 2301 WK3
No ratings yet
Kyu Edu 2301 WK3
5 pages
LESSON 6 Assessment Reviewer
No ratings yet
LESSON 6 Assessment Reviewer
7 pages
Validity and Reliability
100% (2)
Validity and Reliability
20 pages
Validity TM
No ratings yet
Validity TM
8 pages
Language Testing
No ratings yet
Language Testing
29 pages
PE 7 MODULE 7 Correct
No ratings yet
PE 7 MODULE 7 Correct
8 pages
Reliability & Validity: Dr. Nitu Singh Sisodia
No ratings yet
Reliability & Validity: Dr. Nitu Singh Sisodia
20 pages
Educ Measurement Prelim
No ratings yet
Educ Measurement Prelim
24 pages
Characteristicsofagoodtest3 140227023631 Phpapp02
No ratings yet
Characteristicsofagoodtest3 140227023631 Phpapp02
41 pages
CHAPTER 4
No ratings yet
CHAPTER 4
86 pages
Unit 4: Qualities of A Good Test: Validity, Reliability, and Usability
No ratings yet
Unit 4: Qualities of A Good Test: Validity, Reliability, and Usability
18 pages
Qualities of Test(Validity & Relibility Etc)
No ratings yet
Qualities of Test(Validity & Relibility Etc)
38 pages
Introduction To Validity and Reliability
No ratings yet
Introduction To Validity and Reliability
6 pages
66cee8ee676c720018ba7acb_##_Research Aptitude 02- Daily Classnotes
No ratings yet
66cee8ee676c720018ba7acb_##_Research Aptitude 02- Daily Classnotes
13 pages
Lesson 6.2 Item Analysis and Validation
No ratings yet
Lesson 6.2 Item Analysis and Validation
24 pages
QUALITY OF A TEST
No ratings yet
QUALITY OF A TEST
7 pages
Topic 8F Validity Reliability and Sources of Error
No ratings yet
Topic 8F Validity Reliability and Sources of Error
24 pages
Validity and Reliability of Instruments
No ratings yet
Validity and Reliability of Instruments
26 pages
Educ105 - Coverage Exam
No ratings yet
Educ105 - Coverage Exam
14 pages
Validity and Relability
No ratings yet
Validity and Relability
4 pages
Validity and Reliability
No ratings yet
Validity and Reliability
19 pages
What Is Questionnaire?
No ratings yet
What Is Questionnaire?
4 pages
Validity and Reliability
No ratings yet
Validity and Reliability
3 pages
Week 3 Goodness of Measure
No ratings yet
Week 3 Goodness of Measure
12 pages
Test - Education (1) STANDARDIZED TESTS
No ratings yet
Test - Education (1) STANDARDIZED TESTS
9 pages
Language Classroom Assessment
From Everand
Language Classroom Assessment
Liying Cheng
No ratings yet
Cracking the CSET (California Subject Examinations for Teachers), 2nd Edition: The Strategy & Review You Need for the CSET Score You Want
From Everand
Cracking the CSET (California Subject Examinations for Teachers), 2nd Edition: The Strategy & Review You Need for the CSET Score You Want
The Princeton Review
No ratings yet
Lesson 4 Individual Differences
No ratings yet
Lesson 4 Individual Differences
5 pages
Diary Entry Based Questions
No ratings yet
Diary Entry Based Questions
11 pages
Mybriefcbt Provider Manual
No ratings yet
Mybriefcbt Provider Manual
106 pages
DLC Ilm Course Guide
No ratings yet
DLC Ilm Course Guide
18 pages
Babatugon, Louis Roy Vincent A
No ratings yet
Babatugon, Louis Roy Vincent A
5 pages
Case Study-Transformational Leadership
100% (1)
Case Study-Transformational Leadership
8 pages
Contesting The Nature of Conformity PDF
No ratings yet
Contesting The Nature of Conformity PDF
6 pages
Socio Project
No ratings yet
Socio Project
15 pages
Saturday by Ian McEwan
No ratings yet
Saturday by Ian McEwan
11 pages
Kolehiyo NG Lungsod NG Lipa: College of Teacher Education
No ratings yet
Kolehiyo NG Lungsod NG Lipa: College of Teacher Education
6 pages
Addition Review LP
No ratings yet
Addition Review LP
4 pages
Personal Development - Inside Out
No ratings yet
Personal Development - Inside Out
2 pages
Susan Magsaman Slides Your Brain On Art
No ratings yet
Susan Magsaman Slides Your Brain On Art
35 pages
Spillman, Lyn - Culture As Meaning Making
50% (2)
Spillman, Lyn - Culture As Meaning Making
4 pages
Game Sense 2
No ratings yet
Game Sense 2
9 pages
Family Support in NICU
No ratings yet
Family Support in NICU
23 pages
Prof. Manoj Mishra, Cnlu
No ratings yet
Prof. Manoj Mishra, Cnlu
12 pages
Piagets Theory of Child Development
No ratings yet
Piagets Theory of Child Development
4 pages
Slide - Egg-65648-ADKAR Assessment Template
No ratings yet
Slide - Egg-65648-ADKAR Assessment Template
13 pages
The Nature of Emotional Intelligence
No ratings yet
The Nature of Emotional Intelligence
10 pages
Brian Tracy 18 Pasos para Programar La Mente para El Exito
No ratings yet
Brian Tracy 18 Pasos para Programar La Mente para El Exito
19 pages
On The Zombie Within
No ratings yet
On The Zombie Within
1 page
Basics of Psychotherapy
No ratings yet
Basics of Psychotherapy
5 pages
Marketing Keller's Brand Equity Model
100% (1)
Marketing Keller's Brand Equity Model
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter 5

Uploaded by

Chapter 5

Uploaded by

CHAPTER 5

Time 1 Time 2 Time 3 Time 4 Time 5

Observed score = 7.3

Test–retest reliability examines consistency over time (Time 1 vs. Time 2)

2. An IQ test is given to 70 participants on October 1st and then the same

What students learned What should appear in the test

Predictive validity: how well it predicts (future) performance

TO ESTABLISH CONSTRUCT VALIDITY:

Which one is a valid question for an English vocabulary test?

A mathematics test for Vietnamese 12-graders:

Nine thousands people came to the conference venue on

Question: How many people were still at the conference at

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.