0% found this document useful (0 votes)

5 views

Reliability

The document discusses the concept of reliability in psychometrics, emphasizing the importance of consistency in measurement and the various sources of error variance that can affect test scores. It outlines different methods for estimating reliability, including test-retest, parallel forms, and split-half reliability, as well as the implications of measurement error. Additionally, it introduces the standard error of measurement and the standard error of difference as tools for assessing the precision and significance of observed test scores.

Uploaded by

alixandradevera.blabla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Reliability

Uploaded by

alixandradevera.blabla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

SOURCES OF ERROR VARIANCE

PSYCHOMETRIC PROPPERTIES: RELIABILITY

 Test construction - Variation may exist within
THE CONCEPT OF RELIABILITY
items in a test or between tests (i.e., item
 Reliability: Consistency in measurement. sampling or content sampling)
 Reliability coefficient is an index of reliability,  Test administration - Sources of error may
a proportion that indicates the ratio between stem from the testing environment; test taker
the true score variance on a test and the total variables such as pressing emotional
variance. problems, physical discomfort, lack of sleep,
 Observed score = True score plus error (X = T and the effects of drugs or medication and
+ E) examiner-related variables such as physical
 Error refers to the component of the observed appearance and demeanor may play a role.
score that does not have to do with a test  Test scoring and interpretation - Computer
taker’s true ability or the trait being testing reduces error in test scoring, but many
measured. tests still require expert interpretation (e.g.,
projective tests); subjectivity in scoring can
VARIANCE AND MEASURE MENT ERROR enter into behavioral assessment.
 Surveys and polls usually contain a disclaimer
 Variance = Standard deviation squared as to the margin of error associated with their
findings.
 Sampling error - The extent to which the
Variance equals true variance
population of voters in the study actually was
plus error variance. (Variance Score = true score +
representative of voters in the election.
Variance error)
 Methodological error - Interviewers may not
 Reliability is the proportion of the total have been trained properly, the wording in
variance attributed to true variance. the questionnaire may have been ambiguous,
 Measurement error: All of the factors or the items may have somehow been biased
associated with the process of measuring to favor one or another of the candidates.
some variable, other than the variable being
measured. RELIABILITY ESTIMATE S

Test-retest reliability: An estimate of reliability

THE CONCEPT OF RELIABILITY: MEASUREMENT obtained by correlating pairs of scores from the same
ERROR people on two different administrations of the same
Measurement error: test.

1. Random error: A source of error in measuring  Most appropriate for variables that should be
a targeted variable caused by unpredictable stable over time (e.g., personality) and not
fluctuations and inconsistencies of other appropriate for variables expected to change
variables in the measurement process (i.e., over time (e.g., mood)
noise)  As time passes, correlation between the
2. Systematic error: A source of error in scores obtained on each testing decreases
measuring a variable that is typically constant  With intervals greater than 6 months, the
or proportionate to what is presumed to be estimate of test-retest reliability is called the
the true value of the variable being measured. coefficient of stability.
Parallel-forms and alternate-forms  Average proportional distance (APD): Focuses
on the degree of difference between scores
 Coefficient of equivalence: The degree of the on test items; it involves averaging the
relationship between various forms of a test. difference between scores on all of the items,
 Parallel forms: For each form of the test, the dividing by the number of response options on
means and the variances of observed test the test, and then subtracting by 1.
scores are equal.
 Alternate forms: Different versions of a test MEASURES OF INTER-SCORER RELIABILITY
that have been constructed so as to be
parallel; they do not meet the strict  Inter-scorer reliability: The degree of
requirements of parallel forms but item agreement or consistency between two or
content and difficulty are similar between more scorers (or judges or raters) with regard
tests. to a particular measure.
 Reliability is checked by administering two  It is often used with behavioral measures
forms of a test to the same group; scores may  Guards against biases in scoring
be affected by error related to the state of  Coefficient of inter-scorer reliability: The
testtakers (e.g.,practice, fatigue, etc.) or item scores from different raters are correlated
sampling. with one another.

Split-half reliability: RELIABILITY INTERPRETATION

 Obtained by correlating two pairs of scores  Approaching 1 – Higher Reliability

obtained from equivalent halves of a single  High Standard = 0.90-0.95
test administered once; entails three steps:  Acceptable = 0.80-0.89
Step 1 - Divide the test into equivalent halves  Barely Acceptable = 0.60-0.70
Step 2 - Calculate a Pearson r between scores
on the two halves of the test TRUE SCORE MODEL VS. ALTERNATIVES
Step 3 - Adjust the half-test reliability using
the Spearman-Brown formula.  The true-score model is often referred to as
classical test theory (CTT), which is the most
 Spearman-Brown formula allows a test
widely used model due to its simplicity.
developer or user to estimate internal
consistency reliability from a correlation of  True score: A value that according to classical
two halves of a test. test theory genuinely reflects an individual’s
ability (or trait) level as measured by a
OTHER METHODS OF EST IMATING INTERNAL particular test.
CONSISTENCY  CTT assumptions are more readily met in
comparison to those of item response theory
 Inter-item consistency: The degree of (IRT).
relatedness of items on a scale; this helps  A problematic assumption of CTT has to do
gauge the homogeneity of a test. with the equivalence of items on a test.
 Kuder-Richardson formula 20: Statistic of  Domain sampling theory: Estimates the
choice for determining the inter-item extent to which specific sources of variation
consistency of dichotomous items. under defined conditions are contributing to
 Coefficient alpha: Mean of all possible split- the test score.
half correlations, corrected by the Spearman-  Generalizability theory: Based on the idea
Brown formula; it is the most popular that a person’s test scores vary from testing to
approach for internal consistency, and the testing because of variables in the testing
values range from 0 to 1. situation.
 Instead of conceiving of variability in a 1. How did this individual’s performance on
person’s scores as error, Cronbach test 1 compare with his or her
encouraged test developers and researchers performance on test 2?
to describe the details of the particular test 2. How did this individual’s performance on
situation or universe leading to a specific test test 1 compare with someone else’s
score. performance on test 1?
 This universe is described in terms of its 3. How did this individual’s performance on
facets, including the number of items in the test 1 compare with someone else’s
test, the amount of training the test scorers performance on test 2?
have had, and the purpose of the test
administration.
 Item response theory: Provides a way to
model the probability that a person with X
ability will be able to perform at a level of Y.
 IRT refers to a family of methods and
techniques used to distinguish specific
approaches.
 IRT incorporates considerations of an item's
level of difficulty and discrimination.
 Difficulty relates to an item not being easily
accomplished, solved, or comprehended.
 Discrimination refers to the degree to which
an item differentiates among people with
higher or lower levels of the trait, ability, or
other variables being measured.

THE STANDARD ERROR OF MEASUREMENT

 Standard error of measurement, often

abbreviated as SEM, provides a measure of
the precision of an observed test score; an
estimate of the amount of error inherent in an
observed score or measurement.
 The higher the reliability of the test, the lower
the standard error.
 Standard error can be used to estimate the
extent to which an observed score deviates
from a true score.
 Confidence interval: A range or band of test
scores that is likely to contain the true score.

THE STANDARD ERROR OF THE DIFFERENCE

 The standard error of difference: A measure

that can aid a test user in determining how
large a difference in test scores should be
before it is considered statistically significant.
It can be used to address three types of
questions:

Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
9 Reliability
No ratings yet
9 Reliability
10 pages
RELIABILITY
No ratings yet
RELIABILITY
4 pages
4 Reliability Validity
No ratings yet
4 Reliability Validity
47 pages
Reliability Estimates: Source of Error Variance Is Test Administration
No ratings yet
Reliability Estimates: Source of Error Variance Is Test Administration
8 pages
reliability
No ratings yet
reliability
2 pages
test constrcution
No ratings yet
test constrcution
39 pages
PSY211_READINGS
No ratings yet
PSY211_READINGS
12 pages
Readings Psy211
No ratings yet
Readings Psy211
23 pages
SUPPLEMENTARY READINGS FOR RELIABILITY, VALIDITY, UTILITY
No ratings yet
SUPPLEMENTARY READINGS FOR RELIABILITY, VALIDITY, UTILITY
8 pages
Psyc 85 - Reliability
No ratings yet
Psyc 85 - Reliability
37 pages
Week 4 - Reliability
No ratings yet
Week 4 - Reliability
8 pages
PSYCH STATS SEMI
No ratings yet
PSYCH STATS SEMI
11 pages
PSYCH ASSESSMENT - MIDTERMS
No ratings yet
PSYCH ASSESSMENT - MIDTERMS
16 pages
Reliability Reviewer
No ratings yet
Reliability Reviewer
5 pages
3 - Reliability
No ratings yet
3 - Reliability
38 pages
Paprint
No ratings yet
Paprint
3 pages
Statistics (RDA) Forumla Sheet
No ratings yet
Statistics (RDA) Forumla Sheet
2 pages
Psy 112 Handout 6
No ratings yet
Psy 112 Handout 6
6 pages
3 - Reliability
No ratings yet
3 - Reliability
18 pages
Language Test Reliability
No ratings yet
Language Test Reliability
20 pages
Group 4 (Reliability)
No ratings yet
Group 4 (Reliability)
78 pages
5.concepts of Reliability
No ratings yet
5.concepts of Reliability
60 pages
Psyc 385 Exam 2 Study Guide
No ratings yet
Psyc 385 Exam 2 Study Guide
17 pages
Chapter_5_New
No ratings yet
Chapter_5_New
13 pages
Psychometric Properties
No ratings yet
Psychometric Properties
3 pages
PSYCH ASSESSMENT
No ratings yet
PSYCH ASSESSMENT
22 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
33 pages
Good Psychometric Properties
No ratings yet
Good Psychometric Properties
44 pages
Lesson 9A_Reliability
No ratings yet
Lesson 9A_Reliability
9 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
38 pages
Reviewer Test Measurement Midterms
No ratings yet
Reviewer Test Measurement Midterms
6 pages
assessment midtrerm quiz reviewer
No ratings yet
assessment midtrerm quiz reviewer
3 pages
MIDTERM ASSESS
No ratings yet
MIDTERM ASSESS
3 pages
TYPESOFRELIABILITY
No ratings yet
TYPESOFRELIABILITY
5 pages
Handbook of Psychological Assessment Fourth Edition
100% (1)
Handbook of Psychological Assessment Fourth Edition
9 pages
Cornell Notes Template
No ratings yet
Cornell Notes Template
4 pages
Psychological Assessment HW #6
No ratings yet
Psychological Assessment HW #6
16 pages
Concept of Reliability, Validity and Norms (AutoRecovered)
No ratings yet
Concept of Reliability, Validity and Norms (AutoRecovered)
10 pages
Strructures
No ratings yet
Strructures
28 pages
RELIABILITY
No ratings yet
RELIABILITY
5 pages
CC04 PA Reliability
No ratings yet
CC04 PA Reliability
10 pages
CHAPTER 4 Norms and Reliability - PPT
No ratings yet
CHAPTER 4 Norms and Reliability - PPT
54 pages
Chapter 5
No ratings yet
Chapter 5
11 pages
Psychological Assessment 1
No ratings yet
Psychological Assessment 1
11 pages
RELIABILITY
No ratings yet
RELIABILITY
58 pages
Psych-Testing-Reviewer-Midterm
No ratings yet
Psych-Testing-Reviewer-Midterm
9 pages
Educational Measurement & Evaluation
No ratings yet
Educational Measurement & Evaluation
58 pages
Module 4 Psychometric properties (1)
No ratings yet
Module 4 Psychometric properties (1)
49 pages
RELIABILITY AND VALIDITY
No ratings yet
RELIABILITY AND VALIDITY
47 pages
RELIABILITY 2024
No ratings yet
RELIABILITY 2024
30 pages
Reliability Test by Group 2
No ratings yet
Reliability Test by Group 2
28 pages
Reliability 08
No ratings yet
Reliability 08
42 pages
Introduction To Reliability: What Is Reliability? Why Is It Important?
No ratings yet
Introduction To Reliability: What Is Reliability? Why Is It Important?
14 pages
Reliability and Its Importance
No ratings yet
Reliability and Its Importance
57 pages
Psych Ass Ratio March 4
No ratings yet
Psych Ass Ratio March 4
4 pages
Evaluating a Psychometric Test as an Aid to Selection
From Everand
Evaluating a Psychometric Test as an Aid to Selection
Zuzana Robertson C.Psychol
5/5 (1)
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Elements Of Clinical Study Design, Biostatistics & Research
From Everand
Elements Of Clinical Study Design, Biostatistics & Research
Aditya Patel
No ratings yet
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
NATE Module 1 - Week1 PDF
No ratings yet
NATE Module 1 - Week1 PDF
21 pages
Ficha 1-7ºano: A Football Star
No ratings yet
Ficha 1-7ºano: A Football Star
4 pages
Speech on Teachers
No ratings yet
Speech on Teachers
4 pages
Solution RM&IPR, UTU, Back Paper (2023-24)
No ratings yet
Solution RM&IPR, UTU, Back Paper (2023-24)
8 pages
The Routledge Handbook of Second Language Acquisition, Morphosyntax, and Semantics 1st Edition Tania Ionin - The ebook is available for instant download, read anywhere
No ratings yet
The Routledge Handbook of Second Language Acquisition, Morphosyntax, and Semantics 1st Edition Tania Ionin - The ebook is available for instant download, read anywhere
80 pages
COMPUTER SCIENCE GRADE 7 DESIGNS 3rd JAN2022
No ratings yet
COMPUTER SCIENCE GRADE 7 DESIGNS 3rd JAN2022
109 pages
My Instructional Model: Karima Mohammed UWI Open Campus 3/26/2014
No ratings yet
My Instructional Model: Karima Mohammed UWI Open Campus 3/26/2014
25 pages
DCT 2 Test Format - 13 July 2015
100% (1)
DCT 2 Test Format - 13 July 2015
37 pages
Ampidnhs.308126@deped - Gov.ph: Asynchronous Learning (MDL-D / MDL - P)
No ratings yet
Ampidnhs.308126@deped - Gov.ph: Asynchronous Learning (MDL-D / MDL - P)
6 pages
Maximum Mark: 40: Cambridge International Examinations Cambridge International General Certificate of Secondary Education
No ratings yet
Maximum Mark: 40: Cambridge International Examinations Cambridge International General Certificate of Secondary Education
2 pages
Graphic Designing in Chandigarh
No ratings yet
Graphic Designing in Chandigarh
4 pages
Skills Competition (Mechatronics)
100% (1)
Skills Competition (Mechatronics)
13 pages
2025 Eight common challenges for students
No ratings yet
2025 Eight common challenges for students
4 pages
Two Track Approach
100% (1)
Two Track Approach
36 pages
Power Layout: Josephine P. Isturis Lea N. Delfinado Raul C. Asis Josephine P. Isturis Raul C. Asis Leonor M. Briones
No ratings yet
Power Layout: Josephine P. Isturis Lea N. Delfinado Raul C. Asis Josephine P. Isturis Raul C. Asis Leonor M. Briones
1 page
Hotel Room Service - Randall's ESL Cyber Listenin…
No ratings yet
Hotel Room Service - Randall's ESL Cyber Listenin…
1 page
Writing Process Worksheet UNIT 10
No ratings yet
Writing Process Worksheet UNIT 10
3 pages
swaraj
No ratings yet
swaraj
2 pages
Factors Affecting The Capacity of Millennial Farmers in Chili Farming Community in Garut Regency
No ratings yet
Factors Affecting The Capacity of Millennial Farmers in Chili Farming Community in Garut Regency
8 pages
Cambridge Flyers 1 2017 Authentic Exemination Papers Answer Booklet Key
No ratings yet
Cambridge Flyers 1 2017 Authentic Exemination Papers Answer Booklet Key
33 pages
Vibgyor Cbse Subjective Test Xi Che Qp
No ratings yet
Vibgyor Cbse Subjective Test Xi Che Qp
2 pages
DizonFlorence
No ratings yet
DizonFlorence
2 pages
The Oxford Handbook of Political Theory
No ratings yet
The Oxford Handbook of Political Theory
3 pages
Tutorial Letter 101
100% (2)
Tutorial Letter 101
29 pages
Atta Ullah Meer: 20 Madison Street, Brampton, Ontario, L6S 3C5 & Cell # (416) 786-6157 Home # (905) 487-2306
No ratings yet
Atta Ullah Meer: 20 Madison Street, Brampton, Ontario, L6S 3C5 & Cell # (416) 786-6157 Home # (905) 487-2306
4 pages
RRL
No ratings yet
RRL
4 pages
Lame's Parameter Utilization On Reservoir's Lithological and Pore-Fluid Characterization: Lower Pannonian Case Study
No ratings yet
Lame's Parameter Utilization On Reservoir's Lithological and Pore-Fluid Characterization: Lower Pannonian Case Study
6 pages
Conducting Case Analysis
No ratings yet
Conducting Case Analysis
2 pages
Complex Sentence With Noun Clause
No ratings yet
Complex Sentence With Noun Clause
5 pages
RPP Teaching of Writing
No ratings yet
RPP Teaching of Writing
10 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Reliability

Uploaded by

Reliability

Uploaded by

SOURCES OF ERROR VARIANCE

PSYCHOMETRIC PROPPERTIES: RELIABILITY

Test-retest reliability: An estimate of reliability

Split-half reliability: RELIABILITY INTERPRETATION

 Obtained by correlating two pairs of scores  Approaching 1 – Higher Reliability

THE STANDARD ERROR OF MEASUREMENT

 Standard error of measurement, often

THE STANDARD ERROR OF THE DIFFERENCE

 The standard error of difference: A measure

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.