EDUC3 Module 3
EDUC3 Module 3
UNDERSTANDING
Psychological Educational
performance Standard
Some will really pass All or none may pass
There is competition There is no
for a limited competition for a
percentage of high limited percentage of
scores high score
Describes pupil’s Describes pupil’s
performance mastery of course
compared to others objectives
Interpretation
Verbal Non-verbal
Words are used by Students do not use
students in attaching words in attaching
meaning to or meaning to or in
responding to test responding to test
items items (e.g. graphs,
Language Mode numbers, 3-D
subjects)
Standardized Informal
Constructed by a Constructed by a
professional item classroom teacher
writer
Covers a broad range Covers a narrow
of content covered in range of content
a subject area
Uses mainly multiple Various types of
choice items are used
Items written are Teacher picks or
Construction screened and the best writes items as
items were chosen for needed for the test
the final instrument
Can be scored by a Scored manually by
machine the teacher
Interpretation of Interpretation is
results is usually usually criterion-
norm-referenced referenced
Individual Group
Effect of Biases
Power Speed
These were developed in an attempt to eliminate some of the major problems inherent
in the use of self-report measures, such as the tendency of some respondents to give
“socially responsible” responses.
4. Expression Techniques - : In this the people are asked to express the feeling or
attitude of each other people.
6. Make all alternatives plausible and attractive to the less knowledge or skilfull.
What process is most nearly opposite of photosynthesis?
Poor Better
a. Digestion a. Digestion
b. Relaxation b. Assimilation
c. Respiration c. Respiration
d. Exertion d. Catabolism
7. Make the alternative grammatically parallel with each other and consistent with
a. 1-2 glasses
b. 2-3 glasses*
c. 3-4 glasses*
d. At least 4 glasses
Better: What is the daily minimum required amount of milk a 10-year old child
should drink?
a. 1 glass
b. 2 glasses
c. 3 glasses
d. 4 glasses
9. When possible, present alternatives in some logical order (chronological, most to
least, alphabetical).
At 7 a.m. two trucks leave a diner and travel north. One truck averages 42 miles
per hour and the other truck averages 38 miles per hour. At what time will be 24 miles apart?
Undesirable Desirable
a. 6 p.m. a. 1 a.m.
b. 9 a.m. b. 6 a.m.
c. 1 a.m. c. 9 a.m.
d. 1 p.m. d. 1 p.m.
e. 6 a.m. e. 6 p.m.
10. Be sure there is only one correct or best response to the item.
Poor: The two most desired characteristics in a classroom test are validity and
a. Precision
b .Reliability*
c. Objectivity
d. Consistency*
Best: The two most desired characteristics in a classroom test are validity and
a. Precision
b. Reliability*
c. Objectivity
d. Standardization
11. Make alternative approximately equal in length.
Poor: The most general cause of low individual incomes in the US is
a. Lack of valuable productive services to sell*
b. Unwillingness to work
c. Automation
d. Inflation
Better: What is the most general cause of low individual incomes in the US?
a. A lack of valuable productive services to sell*
b. The population’s overall unwillingness to work
c. The nation’s increase reliance on automation
d. An increasing national level of inflation.
12. Avoid irrelevant clues, such as grammatical structure, well-known verbal
associations or connections between stem and answer.
Poor: (grammatical clue) A chain of islands is called an
a. Archipelago
b. Peninsula
c. Continent
d. Isthmus
Poor: (verbal association) The reliability of a test can be estimated by a
coefficient of
a. Measurement
b. Correlation*
c. Testing
d. Error
Poor: (connections between stem and answer) The height to which a water
dam is build depends on
a. The length of the reserve behind the dam.
b. The volume of the water behind the dam.
c. The height of water behind the dam.*
d. The strength of the reinforcing the wall.
13. Use at least four alternatives for each item to lower the probability of getting the
item correctly by guessing.
14. Randomly distribute the correct responses among the alternative positions
throughout the test having approximately the same proportion of the alternatives a, b, c, d and e
as correct response.
15. Use the alternative NONE OF THE ABOVE and ALL OF THE ABOVE
sparingly. When used, such alternatives should occasionally be used as the correct response.
True-False Test Items
True-false test items are typically used to measure the ability to identify whether or not the
statements of facts are correct. The basic format is simply a declarative statement that the student
must judge as true or false. No modification of the basic form in which the student must respond
“yes” or “no”, “agree” or “disagree.”
Three Forms:
1. Simple – consists of only two choices
2. Complex – consists of more than two choices
3. Compound – two choices plus a conditional completion response
Examples:
Simple: The acquisition of morality is a developmental process. True False
Complex: The acquisition of morality is a developmental process. True Fals Opinio
e n
Compound: An acquisition of morality is a developmental process. True Fals
e
If the statement is false, what makes it false?
Advantages of True-False Items
True-false items can provide:
1. The widest sampling of content or objectives per unit of testing time
2. Scoring efficiency and accuracy
3. Versatility in measuring all levels of cognitive ability
4. Highly reliable test scores; and
5. An objective measurement of student achievement or ability.
Limitations of True-False Items
1. Incorporate an extremely high guessing factor
2. It can often lead the teacher to write ambiguous statements due to the difficulty of
writing statements which are unequivocally true or false.
3. Do not discriminate between students varying ability as well as other item types.
4. It can often include more irrelevant clues than do other item types.
5. It can often lead a teacher to favour testing of trivial challenge.
Suggestions for Writing True-False Items (Payne, 1984)
1. Base true-false items upon statements that are absolutely true or false, without
qualifications or exceptions.
Poor: Nearsightedness is hereditary in origin.
Better: Geneticists and eye specialists believe that the predisposition to
nearsightedness is hereditary.
9. False items tend to discriminate more highly than true items. Therefore, use
more false items than true items (but not more than 15% additional false items).
Matching Test Items
In general matching items consists of a column of stimuli presented on the left side
of the exam page and a column of responses placed on the right side of the page. Students are
required to match the response associated with a given stimulus.
Advantages of Using Matching Test Items
1. Require short period of reading and response time allowing the teacher to cover
more content.
2. Provide objective measurement of student achievement or ability.
3. Provide highly reliable test scores.
4. Provide scoring efficiency and accuracy.
Disadvantages of Using Matching Test
Items
1. Have difficulty measuring learning objectives requiring more than simple recall
or information.
2. Are difficult to construct due to the problem of selecting a common set of stimuli
and responses.
Suggestions for Writing Matching Test items
1. Include directions which clearly state the basis for matching the stimuli with the
responses. Explain whether or not the response can be used more than once and indicate where
to write the answer.
Poor: Directions: Match the following.
Better: Directions: On the line to the left of each identifying location and
characteristics in Column 1, write the letter on the country in column III that is best defined.
Each country in Column may be used more than once.
Note:
1. Keep matching items brief, limiting the list of stimuli to under 10
2. Include more responses than stimuli to help prevent answering through the
process of elimination.
3. When possible, reduce the amount of reading time by including only short
phrases or single word in the response list.
Completion Test Items
The completion items require the student to answer a question or to finish an
incomplete statement by filling in a blank with correct word or phrase.
Example:
According to Freud, personality is made up of three major systems, the ,
the , and the .
Advantages of Using Completion Items
Completion items can:
1. Provide a wide sampling of content;
2. Efficiency measure lower levels of cognitive ability;
3. Minimize guessing as compared to multiple choice or true-false items; and
4. Usually provide an objective measure of student achievement or ability
Limitations of Using Completion Items
Completion items:
1. Are difficult to construct so that the desired response is clearly indicated;
2. Have difficulty in measuring learning objectives requiring more than simple recall
of information;
3. Can often include irrelevant clues than do other item types;
4. Are more time consuming to score when compared to multiple choice or true-
false items; and
5. Are more difficult to score since more than one answer may have to be
considered correct if the item was not properly prepared.
Suggestions for Writing Completion Test Items
1. Omit only significant words from the statement. Poor:
Every atom has a central (core) called nucleus.
Better: Every atom has a central core called a (an) (nucleus)
2. Do not omit so many words from the statement that the intended meaning is
lost.
Poor: The were to Egypt was the were to Persia as were to the clearly
tribes of Israel.
Better: The Pharaohs were to Egypt as the were to Persia as were to the
early tribes of Israel.
3. Indicate for each item appoint or weight and an estimated the limit for
answering.
Poor: Compare the writing of Bret Harte and Mark Twain in terms of setting,
depth of characterization, and dialogue styles of their main characters.
Better: Compare the writings of Bret and Mark Twain in terms of setting, depth
of characterization, and dialogue styles of their main characters. (10 points 20 points)
4. Ask questions that will elicit responses on which experts could agree that one
answer is better than another.
5. Avoid giving a student a choice among optional items as this greatly reduces the
reliability of the test.
6. It is generally recommended for classroom examinations to administer several
short-answer items rather than only on or two extended-response items.
instrument 2. Content Validity- is done through a careful and critical examination of the
.
objectives of assessment so that it reflects the curricular objectives.
3. Criterion-related Validity- is established statistically such that a
set of scores revealed by the measuring instrument IS CORRELATED with the
scores obtained in another EXTERNAL PREDICTOR OR MEASURE.
It has two purposes:
a. Concurrent Validity- describe the present status of the individual by correlating
the sets of scores obtained FROM TWO MEASUREs GIVEN
CONCURRENTLY.
Example: Relate the reading test result with pupils’ average grades in reading given by
the teacher.
Table of specification is a device for describing test items in terms of the content and
the process dimensions. That is, what a student is expected to know and what he or she
is expected to do with that knowledge. It is described by combination of content and
process in the table of specification.
Sessions Distribution
1. Definition of linear function 2 4 1-4
TOTAL 20 40 40
Total number of
class sessions
Example :
Number of class
session= 2 Desired
number of items= 40
Total number of
class sessions=20
Number of items=
Number of class
=2x40 20
Number of items= 4