0% found this document useful (0 votes)

10 views

Slide 4-Reliability

Uploaded by

zeenathussain013

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Slide 4-Reliability

Uploaded by

zeenathussain013

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Reliability of Measures

The Concept Of Reliability

• The reliability of a measurement procedure is the
stability or consistency of the measurement.
• A measurement procedure is said to have reliability if
it produces identical (or nearly identical) results when
it is used repeatedly to measure the same individual
under the same conditions.
• For example, if we use an IQ test to measure a
person’s intelligence today, then use the same test for
the same person under similar conditions next week,
we should obtain nearly identical IQ scores.
• The inconsistency in a measurement comes from error.
• Observer error: The individual who makes the
measurements can introduce simple human error into the
measurement process.
• Environmental changes: there are small changes in the
environment (such as time of day, temperature, weather
conditions, and lighting) from one measurement to another,
and these small changes can influence the measurements.
• Participant changes: The participant can change between
measurements. As noted earlier, a person’s degree of focus
and attention can change quickly and can have a dramatic
effect on measures of reaction time. I.e. Hunger Vs. IQ
• In summary, any measurement procedure involves an
element of error and the amount of error determines the
reliability of the measurements. When error is large,
reliability is low, and when error is small, reliability is high.
Reliability Types
1. Inter Rater Reliability
• When measurements are obtained by direct observation
of behaviors, it is common to use two or more separate
observers who simultaneously record measurements; is
the degree of agreement or consistency between two or
more scorers (or judges or raters) with regard to a
particular measure.
• For example, two psychologists may watch a group of
preschool children and observe social behaviors. Each
individual records (measures) what she observes, and
the degree of agreement between the two observers is
called inter-rater reliability.
• Inter-rater reliability can be measured by computing the
correlation between the scores from the two observers
or by computing a percentage of agreement between
the two observers
2. Test-retest Reliability
• The reliability estimate obtained by comparing the
scores obtained from two successive measurements is
commonly called test-retest reliability.
• A researcher may use exactly the same measurement
procedure for the same group of individuals at two
different times; The reliability of test scores is by
repeating the identical test on a second occasion.
• The reliability coefficient in this case is simply the
correlation between the scores obtained by the same
persons on the two administrations of the test.
• Of course, sometimes poor test–retest correlations do
not mean that a test is unreliable
Limitations of Test-retest reliability
● Carry Over Effect: This effect occurs when the first
testing session influences scores from the second session.
For example, test takers sometimes remember their
answers from the first time they took the test. Carryover
problems are of concern only when the changes over
time are random not systematic.
● Practice effects- type of carryover effect. Some skills
improve with practice. When a test is given a second
time, test takers score better because they have
sharpened their skills by having taken the test the first
time.
● The time interval between testing sessions must be
selected and evaluated carefully. If the two
administrations of the test are close in time, there is a
relatively great risk of carryover and practice effects.
● Motivation Level
3. Parallel Form Reliability
• Researcher may use modified versions of the measurement
instrument (such as alternative versions of an IQ test) to
obtain two different measurements for the same group of
participants.
• The same persons can be tested with one form on the first
occasion and with another, comparable form on the second.
The correlation between the scores obtained on the two forms
represents the reliability coefficient of the test
• When different versions of the instrument are used for the
test and the retest, the reliability measure is often called
parallel-forms reliability.
• Both groups take both tests: group A takes test A first, and
group B takes test B first. The results of the two tests are
compared, and the results are almost identical, indicating high
parallel forms reliability
Just think!
You missed the midterm examination and have
to take a makeup exam. Your classmates tell you
that they found the midterm impossibly difficult.
Your instructor tells you that you will be taking
an alternate form, not a parallel form, of the
original test. How do you feel about that?
Limitations
● test scores may be affected by factors such as
motivation, fatigue, or intervening events
such as practice, learning, or therapy
● The order of administration is usually
counterbalanced to avoid practice effects
● Developing alternate forms of tests can be
time-consuming and expensive.
4. different methods for Internal
Consistency estimates of Reliability
1.Split half Reliability
2. KR 20 Formula and Cronbach Alpha
1. Split Half Reliability
To measure the degree of consistency, researchers commonly
split the set of items in half and compute a separate score for
each half. The degree of agreement between the two scores is
then evaluated, usually with a correlation. This general process
results in a measure of split-half reliability. Only a single
administration of a single form is required.
Steps: Divide the test into two equivalent halves
1. Random method
2. Odd-even
3. Content & difficulty
4. Compute the Pearson r b/w scores on the two halves of the
test
2.KR20 Formula and Cronbach Alpha
• Cronbach’s Alpha and the Kuder-Richardson
formula are two statistical techniques for dealing
with this problem
• KR20 first published in 1937 is a measure of
internal consistency reliability for measures
with dichotomous choices.
e.g. Right/wrong, true/false, correct/incorrect
• it shouldn’t be used for questions with partial
credit is possible or for scales like the Likert Scale.
If you have a test with more than two answer
possibilities (or opportunities for partial credit),
use Cronbach’s Alpha instead.
• Cronbach’s alpha refers to the degree of
correlation among all the items on a scale. It
is a measure of inter-item consistency is
calculated from a single administration of a
single form of a test and it is used to estimate
the internal consistency of homogeneous as
well heterogeneous tests (Most preferable).
How Reliability can be improved
• Sufficient .70 t0 .90 alpha coefficient
• Increase the number of items
• affect by sample size

• Factor and item analysis (correlation between

single item score and Total scale score; low
correlation indicates item is so hard, irrelevant,
different; also omitted to improve overall reliability)
• Correction for attenuation- low reliability reduces
the chances of finding significant correlations
between measures. If a test is unreliable,
information obtained with it is of little or no value.
Thus, we say that potential correlations are
attenuated, or diminished, by measurement error
Thank You ☺

CLEP® Calculus Book + Online
From Everand
CLEP® Calculus Book + Online
Gregory Hill
3/5 (1)
Reliability
No ratings yet
Reliability
9 pages
Introduction To Reliability: What Is Reliability? Why Is It Important?
No ratings yet
Introduction To Reliability: What Is Reliability? Why Is It Important?
14 pages
Reliability Reviewer
No ratings yet
Reliability Reviewer
5 pages
Language Test Reliability
No ratings yet
Language Test Reliability
20 pages
Strructures
No ratings yet
Strructures
28 pages
Psychometric Properties
No ratings yet
Psychometric Properties
3 pages
9 Reliability
No ratings yet
9 Reliability
10 pages
RELIABILITY AND VALIDITY
No ratings yet
RELIABILITY AND VALIDITY
47 pages
cia psycho
No ratings yet
cia psycho
6 pages
TYPESOFRELIABILITY
No ratings yet
TYPESOFRELIABILITY
5 pages
Relibility Testing
No ratings yet
Relibility Testing
44 pages
Reliability 08
No ratings yet
Reliability 08
42 pages
3 - Types of Reliability
No ratings yet
3 - Types of Reliability
36 pages
Reliabilty Lecture (5)
No ratings yet
Reliabilty Lecture (5)
16 pages
PSYCH STATS SEMI
No ratings yet
PSYCH STATS SEMI
11 pages
Reliability and Validity
No ratings yet
Reliability and Validity
32 pages
5 Reliability
No ratings yet
5 Reliability
29 pages
PSY 210 L7 Reliability
No ratings yet
PSY 210 L7 Reliability
8 pages
3 - Reliability
No ratings yet
3 - Reliability
38 pages
CHAPTER 6
No ratings yet
CHAPTER 6
8 pages
Lesson 09 - Tagged
No ratings yet
Lesson 09 - Tagged
34 pages
Reliability Estimates: Source of Error Variance Is Test Administration
No ratings yet
Reliability Estimates: Source of Error Variance Is Test Administration
8 pages
Reliability and its Types
No ratings yet
Reliability and its Types
13 pages
Chapter 13 Assessing Quality of Measurement Tools 2
No ratings yet
Chapter 13 Assessing Quality of Measurement Tools 2
57 pages
Validity and Reliability
100% (1)
Validity and Reliability
22 pages
KPD Validity & Realibility
No ratings yet
KPD Validity & Realibility
25 pages
Psychometrics
No ratings yet
Psychometrics
102 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
9 pages
RELIABILITY 2024
No ratings yet
RELIABILITY 2024
30 pages
Week4 1 Testing
No ratings yet
Week4 1 Testing
28 pages
Reliability
No ratings yet
Reliability
11 pages
Week 7reliability
No ratings yet
Week 7reliability
25 pages
UNIT-5 psychometry_240505_1652001
No ratings yet
UNIT-5 psychometry_240505_1652001
20 pages
Readings Psy211
No ratings yet
Readings Psy211
23 pages
Reliability and Validity
No ratings yet
Reliability and Validity
23 pages
Good Psychometric Properties
No ratings yet
Good Psychometric Properties
44 pages
Psy 112 Handout 6
No ratings yet
Psy 112 Handout 6
6 pages
reliability
No ratings yet
reliability
15 pages
Chracteristics of A Good Test
No ratings yet
Chracteristics of A Good Test
58 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
38 pages
Reliability by Vartika Verma
No ratings yet
Reliability by Vartika Verma
17 pages
Students_Slides_1_Realibity
No ratings yet
Students_Slides_1_Realibity
59 pages
4 Reliability Validity
No ratings yet
4 Reliability Validity
47 pages
Item Reliability: Presented By: Jhoanna Rose M. Moreno Group 2 Adv. Psychometrics
No ratings yet
Item Reliability: Presented By: Jhoanna Rose M. Moreno Group 2 Adv. Psychometrics
17 pages
ASL1 Reliability
No ratings yet
ASL1 Reliability
10 pages
CLASS PRESENTATION - Test Reliability
No ratings yet
CLASS PRESENTATION - Test Reliability
7 pages
Psyc 385 Exam 2 Study Guide
No ratings yet
Psyc 385 Exam 2 Study Guide
17 pages
Reliability PDF
No ratings yet
Reliability PDF
5 pages
Reliability
No ratings yet
Reliability
10 pages
Unit 6
No ratings yet
Unit 6
37 pages
PSY211_READINGS
No ratings yet
PSY211_READINGS
12 pages
Deepika RM Seminar
No ratings yet
Deepika RM Seminar
23 pages
Questionnaire Reliability Validity
No ratings yet
Questionnaire Reliability Validity
29 pages
Lesson 9A_Reliability
No ratings yet
Lesson 9A_Reliability
9 pages
Handbook of Psychological Assessment Fourth Edition
100% (1)
Handbook of Psychological Assessment Fourth Edition
9 pages
Reliability: Floramae Z. Campos Student/MA-GC
No ratings yet
Reliability: Floramae Z. Campos Student/MA-GC
29 pages
Class 10
No ratings yet
Class 10
54 pages
Research in Psychology
From Everand
Research in Psychology
Connor Whiteley
No ratings yet
Measurement - Task Sheets Gr. 3-5
From Everand
Measurement - Task Sheets Gr. 3-5
Chris Forest
No ratings yet
The Children Eating Behavior Inventory Cebi
No ratings yet
The Children Eating Behavior Inventory Cebi
15 pages
Assessing Relationships Among Strategic Types, Distinctive Marketing Competencies, and Organizational Performance
No ratings yet
Assessing Relationships Among Strategic Types, Distinctive Marketing Competencies, and Organizational Performance
12 pages
HC Alaba 2019
No ratings yet
HC Alaba 2019
11 pages
3.employee Retention - A Review of Literature
No ratings yet
3.employee Retention - A Review of Literature
9 pages
Leadership in Health Services: Article Information
No ratings yet
Leadership in Health Services: Article Information
22 pages
Laterality: Asymmetries of Body, Brain and Cognition
No ratings yet
Laterality: Asymmetries of Body, Brain and Cognition
11 pages
Chapter # 1
No ratings yet
Chapter # 1
19 pages
AlSheibani Re Thinking
No ratings yet
AlSheibani Re Thinking
10 pages
Identity Change
No ratings yet
Identity Change
16 pages
Marketing Strategy Effectiveness in Nigerian Banks
No ratings yet
Marketing Strategy Effectiveness in Nigerian Banks
16 pages
Customer Service Quality and Online Reviews On Customer Loyalty in E-Commerce
No ratings yet
Customer Service Quality and Online Reviews On Customer Loyalty in E-Commerce
11 pages
Reading Charts in Ophthalmology
No ratings yet
Reading Charts in Ophthalmology
18 pages
Types of Vocational Training and Their Use: Miguel Aurelio Alonso García
No ratings yet
Types of Vocational Training and Their Use: Miguel Aurelio Alonso García
9 pages
Download full Straight Talk About Communication Research Methods 3rd Edition Christine S Davis ebook all chapters
100% (1)
Download full Straight Talk About Communication Research Methods 3rd Edition Christine S Davis ebook all chapters
66 pages
10 21449-Ijate 827950-1404106
No ratings yet
10 21449-Ijate 827950-1404106
16 pages
2016 Bajaj Mediating Role of Self Esteem On The Relationship Between Mindfulness Anxiety and Depression
No ratings yet
2016 Bajaj Mediating Role of Self Esteem On The Relationship Between Mindfulness Anxiety and Depression
5 pages
The Modified ADBB. Assessing For Infant Withdrawal During Routine Examinations
100% (1)
The Modified ADBB. Assessing For Infant Withdrawal During Routine Examinations
1 page
Coping Strategies On Academic Performance Among Undergraduate Students in Malaysia
No ratings yet
Coping Strategies On Academic Performance Among Undergraduate Students in Malaysia
5 pages
TP042947 - BRSM
No ratings yet
TP042947 - BRSM
32 pages
CS701 Research Methodology in Computer Science: Lesson 3: Tools of Research
No ratings yet
CS701 Research Methodology in Computer Science: Lesson 3: Tools of Research
28 pages
Nature of Reliability and Other Desired Characteristics: Report By: Marrione Eubert M. Estepa
100% (1)
Nature of Reliability and Other Desired Characteristics: Report By: Marrione Eubert M. Estepa
14 pages
5090 Chapter2
No ratings yet
5090 Chapter2
63 pages
Rosenzweig PF study (children)
No ratings yet
Rosenzweig PF study (children)
53 pages
Brand Extension
No ratings yet
Brand Extension
37 pages
Robertsand Priest Reliabilityand Validity
No ratings yet
Robertsand Priest Reliabilityand Validity
7 pages
The Mediating Effect of Self-Regulation on the Relationship between Mathematical Disposition and Mathematics Proficiency among Mathematics Education Students
No ratings yet
The Mediating Effect of Self-Regulation on the Relationship between Mathematical Disposition and Mathematics Proficiency among Mathematics Education Students
19 pages
Schedule For Oral-Motor Assessment (SOMA) : Methods of Validation
No ratings yet
Schedule For Oral-Motor Assessment (SOMA) : Methods of Validation
12 pages
Guidelines For Research Proposal
100% (1)
Guidelines For Research Proposal
6 pages
Social Media Influencer Over-Endorsement
No ratings yet
Social Media Influencer Over-Endorsement
12 pages
RE-sample MCQ For XQP
No ratings yet
RE-sample MCQ For XQP
11 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Slide 4-Reliability

Uploaded by

Slide 4-Reliability

Uploaded by

Reliability of Measures

The Concept Of Reliability

• Factor and item analysis (correlation between

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.