0% found this document useful (0 votes)

2 views

UKALTA-BS1-23-What-is-language-testing

This briefing sheet provides an overview of language testing, defining it as the evaluation of language for certification or decision-making. Key concepts discussed include test purpose, stakes, validity, reliability, and fairness, along with new trends such as multilingualism, on-demand testing, and the impact of artificial intelligence on language assessment. The document emphasizes the importance of designing fair and valid tests that accurately reflect language proficiency.

Uploaded by

Nate Owen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

UKALTA-BS1-23-What-is-language-testing

Uploaded by

Nate Owen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

BRIEFING SHEET

www.ukalta.org info@ukalta.org @UKALTA2

No. 1/23, July 2023

WHAT IS LANGUAGE
TESTING?
This briefing sheet answers the following questions:

▪ What is language testing?

▪ What are some key concepts in language testing?
▪ What are some new trends in language testing?

AUTHOR
Nathaniel Owen, Oxford University Press, Oxford.

UKALTA: THE FACE AND HEART OF LANGUAGE TESTING IN THE UK

BRIEFING SHEET No. 1/23, July 2023

WHAT IS LANGUAGE TESTING?

Language testing is the practice of evaluating language for the purposes of
certification or decision-making.

Language assessment refers to any activity in which

information is collected from language learners from
which we make judgements about their language
proficiency or progress.

Assessment can therefore encompass both informal, classroom-based

quizzes, self-assessment or peer assessment and more formal, high-
stakes summative assessment. Testing is typically the more specific term
for formal, standardized assessment. Language tests are usually
constructed and provided by examination agencies who are granted the
authority to issue certificates which are recognised for specific purposes
such as entry to higher education, or by employers who wish to see
evidence of proficiency as part of decision-making for prospective
employment.

WHAT ARE SOME KEY CONCEPTS IN LANGUAGE TESTING?

Some key concepts in language testing include (but are not limited to) test
purpose, stakes, validity, reliability and fairness.

▪ Test Purpose. Any individual test should always have a purpose

which can be clearly explained by the test creators. For example, a
low-stakes classroom test could be diagnostic; that is, created and
implemented to identify areas requiring further learning or remedial
action. Alternatively, classroom-based tests might be evaluative, to
make claims about proficiency at the end of a course. Additionally,
this test might be used to make decisions based on a cut score,
such as for admission to higher education. Other purposes for
testing might be to place learners in suitable classes (a placement
test) or a progress test based on a subset of course learning
objectives. Each test is made up of prompts, which elicit a response
from a test taker. This response is the evidence that we use to
inform decision-making. This evidence may be used formatively, to
enable learners to judge their progress. In proficiency tests, the
evidence may be used summatively, which means decisions are
made about an individual on the basis of performance at the end of
a course of instruction.
Page 2
BRIEFING SHEET No. 1/23, July 2023

▪ Stakes. Stakes refers to the consequences of test outcomes.

Stakes may vary depending on test purpose. For example,
classroom-based tests may be low-stakes, as the decisions made
on the basis of the scores are unlikely to have long-term
ramifications for individual test takers. High-stakes tests are those
which are used for significant decision-making, such as university
admissions, graduation, moving to another country or applying for a
job. The stakes of a test will have a significant impact on test takers’
motivation, what and how test takers study and even their mental
health. Tests should therefore be designed with these effects in
mind: to accentuate the quality of the information provided to
decision-makers and trying to avert potential negative effects on
test takers.

▪ Validity. In testing, validity refers to the extent to which inferences

about test scores are justified. A validity argument sets out the
evidence and theory to support claims about test score meaning.
The kinds of evidence might include a comparison of the test
content with the kinds of content we would find in the ‘real world’.
Evidence might also include studies relating test scores to external
criteria, for example academic performance in the first year of
university study. This is known as predictive validity. Studies may
focus on conversation analysis which shows that a certain test
reflects a predicted range of language functions or conversational
features. The type of evidence required depends upon the claims
made for the scores. The theory in the validity argument provides
the rationale for claiming that the evidence presented supports the
claims made about the test scores. In low-stakes tests, particularly
classroom-based assessments, validity evidence may be collected
informally. Validity questions we might ask would include: Has the
feedback resulted in learner improvement? Are the tasks engaging
and challenging for learners at this level? Is learner motivation
improving?

▪ Reliability. Reliability refers to the consistency of measurement

across facets of the testing context. These facets might include
time, place, interlocutor, and rater/marker. Test scores should not
vary depending upon where the test is taken, who the interlocutor
may be, or who rates the performance, as these facets are
irrelevant to what we are trying to measure. We can investigate the
impact of these facets on test scores by changing one while holding

Page 3
BRIEFING SHEET No. 1/23, July 2023

others constant and evaluating the impact on test scores. If a test

successfully assesses language proficiency, we would not expect
test scores to change if the test is taken two or three times over one
week by the same test taker (excepting any normal random error
variance due to chance factors). However, over longer periods of
time, scores may change due to study or attrition. Reliability is
therefore related to validity. A test must be reliable in order to be
valid. However, reliability is not sufficient evidence on its own to
claim validity (after all, it is possible to reliably measure the wrong
thing). Validity and reliability are important because they speak to
our concern for test fairness in all aspects of assessment practice.

▪ Fairness. Fairness in testing means that all learners should have

an equal opportunity at the point of assessment to show their
proficiency. This means that there should be no bias towards or
against any subgroups of the population, and that the scores should
be independent of test method facets that are irrelevant to what we
are assessing. Evidence of differential performance among
subgroups may lead to charges of discrimination. Consistent with
the distinction between reliability and validity, test bias is something
that we can investigate statistically. However, demonstrating an
absence of test bias is insufficient to claim that the test is fair.

Test fairness is an argument made about the

defensibility of test use for a specific purpose by
stakeholders.

WHAT ARE SOME NEW TRENDS IN LANGUAGE TESTING?

Trends in language testing often mirror those in Applied Linguistics, or in
the Education field more generally. Therefore, some recent concerns in
language testing include (but are not limited to) multilingualism, on-demand
testing and artificial intelligence. (See other UKALTA briefing sheets on
some of these topics.)

▪ Multilingualism. Increasing attention is being paid to educational

contexts in which learners speak more than one language and draw
upon these for their learning experiences. There is concern that the
linguistic diversity of these contexts is not well reflected in existing
tasks or test content. How should stakeholders and language test
developers respond? What kinds of ‘Englishes’ should be included

Page 4
BRIEFING SHEET No. 1/23, July 2023

in high-stakes tests, and how well are students served by English

language tests in these contexts? Additionally, there are questions
about the appropriacy of English language tests designed for use in
contexts such as the UK, USA or Australia being used in English-
medium contexts such as Sweden, Nepal or India, for example.
These are countries in which English serves as the language of
education, but is not the first language of the students in that
context.

▪ On-demand testing. The Covid-19 pandemic resulted in an

increase in ‘on-demand’ or ‘at home’ testing. At-home tests are
much more flexible and convenient for test takers who can take a
language test at a time and place of their choosing. However, there
are legitimate security concerns associated with the use of at-home
testing versus traditional testing with the use of invigilators. At-
home tests typically employ online invigilators known as ‘remote
proctors’ to oversee high-stakes language tests. Despite this, there
are understandable concerns that remote proctoring may be an
excessive infringement on personal privacy if security includes
‘room sweeps’ by remote proctors using the test takers’ own
webcam. Remote proctoring is usually supplemented by artificial
intelligence solutions, such as test taker eye movements.

▪ Artificial intelligence. Artificial intelligence is swiftly becoming the

major focus of research in language testing. Whether for remote
proctoring, adaptive testing, or automated scoring, AI-related
research is rapidly influencing how we assess language.
Discussions around automated scoring have been ongoing for more
than a decade, with incremental improvements in score reliability
evident. However, there are continuing discussions as to whether
the level of agreement between human assessors and machine
scoring is sufficient to claim that automated scoring models are
reliable. Additionally, traditional models of validity may be too
impoverished to encompass the use of sophisticated engines such
as Google’s BERT or OpenAI’s GPT-3. The field is actively looking
towards how validity in language testing can inform and be
informed by approaches to machine learning. In early 2023, we saw
significant discussion about the impact of ChatGPT on Educational
Assessment and in Education more generally. In future it is likely
that the impact of models such as OpenAI’s GPT4.0 or Google’s

Page 5
BRIEFING SHEET No. 1/23, July 2023

BERT will significantly influence the direction of language testing

and assessment.

Douglas, D. (2010). Understanding Language Testing. London: Hodder

Education/Routledge.

An introductory text that explains basic terminology and key concepts in language
testing, outlines the skills required to design and use language tests, and
introduces simple statistical tools for test analysis. No prior knowledge of language
testing is assumed.

Fulcher, G. (2010). Practical Language Testing. Hodder Education/Routledge.

An intermediate text dealing with the purpose of testing in context and an analysis
of test use in society. The text then follows the ‘test development cycle’ to explain
in detail the process of test design, implementation, and interpretation.

Carr, N. (2011). Designing and Analyzing Language Tests. Oxford University

Press.

Provides a comprehensive overview of concepts, principles, and methods for

designing, developing, and evaluating language tests. The book aims to equip
readers with the knowledge and skills to critically analyse existing tests and
thoughtfully develop new language tests that are valid, reliable, practical, authentic
and beneficial for test takers.

Green, A. (2020). Exploring Language Assessment and Testing (2nd ed.).

Oxford University Press.

Covers a wide range of topics related to language assessment, including

theoretical foundations, assessment design and development, test administration
and scoring, and the use of assessment results.

Hughes, A., & Hughes, J. (2020). Testing for Language Teachers. Cambridge
University Press.

Covers topics such as the purposes of testing, test content, test methods, scoring,
analysing test performance, developing tests, ethics, and monitoring standards.
Guidance is provided on developing tests, evaluating existing tests, giving
feedback on tests, and using tests as part of the teaching process

Page 6

INNOVENT203
No ratings yet
INNOVENT203
7 pages
Ncae
100% (2)
Ncae
27 pages
Understanding Assessment
No ratings yet
Understanding Assessment
29 pages
Lg Testing
No ratings yet
Lg Testing
18 pages
UE-MA-LT-W3-Qualities of Tests-2019
No ratings yet
UE-MA-LT-W3-Qualities of Tests-2019
21 pages
2. Chapter 2 Principles of Language Assessment-Handout
No ratings yet
2. Chapter 2 Principles of Language Assessment-Handout
46 pages
Task 1b. The Principles of Language Testing
100% (2)
Task 1b. The Principles of Language Testing
9 pages
Abdulwahab 19
No ratings yet
Abdulwahab 19
68 pages
Tiểu Luận Ktra Đánh Giá-13985266-21-05-2024-Highlight - Report
No ratings yet
Tiểu Luận Ktra Đánh Giá-13985266-21-05-2024-Highlight - Report
17 pages
Unit 2 Principles of Language Assessment
No ratings yet
Unit 2 Principles of Language Assessment
23 pages
Applied Linguistics and Language Testing
No ratings yet
Applied Linguistics and Language Testing
5 pages
Chapter 2_Principles of Language Assessment
No ratings yet
Chapter 2_Principles of Language Assessment
33 pages
PRINCIPLES - OF - LANGUAGE - ASSESSMENT Chapter Two
No ratings yet
PRINCIPLES - OF - LANGUAGE - ASSESSMENT Chapter Two
18 pages
Bài 1+2
No ratings yet
Bài 1+2
64 pages
Makalah
No ratings yet
Makalah
12 pages
Language Testing Ppt 2
No ratings yet
Language Testing Ppt 2
27 pages
Dyra Hadi
No ratings yet
Dyra Hadi
22 pages
Testıng 2
No ratings yet
Testıng 2
28 pages
Principles of Language Assessment - Tips For Testing
93% (14)
Principles of Language Assessment - Tips For Testing
4 pages
Paper Language Assessment
No ratings yet
Paper Language Assessment
6 pages
Principles of Language Assessment
No ratings yet
Principles of Language Assessment
7 pages
Principles of Language Testing
No ratings yet
Principles of Language Testing
48 pages
Principles of Language Assessment
No ratings yet
Principles of Language Assessment
22 pages
Task 1B (I)
No ratings yet
Task 1B (I)
5 pages
LT 1 Meeting 3 4
No ratings yet
LT 1 Meeting 3 4
25 pages
1700214341
No ratings yet
1700214341
22 pages
Making A Good Test
No ratings yet
Making A Good Test
5 pages
Lecture2 20111
No ratings yet
Lecture2 20111
18 pages
3P. principles of asssessment
No ratings yet
3P. principles of asssessment
31 pages
Principles of Language Assessment: Debi Annisa Anang Yunianto W by
No ratings yet
Principles of Language Assessment: Debi Annisa Anang Yunianto W by
17 pages
Basic Principles of Language Testing and Assessment
No ratings yet
Basic Principles of Language Testing and Assessment
55 pages
English Language Assessment
No ratings yet
English Language Assessment
7 pages
Assessment and Evaluation in Education- teacher note forthe midterm
No ratings yet
Assessment and Evaluation in Education- teacher note forthe midterm
8 pages
Principles of Language Assessment
No ratings yet
Principles of Language Assessment
32 pages
Lta 2nd Group Part 2
No ratings yet
Lta 2nd Group Part 2
9 pages
Testing A Test' - Evaluating Our Assessment Tools Testing A Test' - Evaluating Our Assessment Tools
No ratings yet
Testing A Test' - Evaluating Our Assessment Tools Testing A Test' - Evaluating Our Assessment Tools
83 pages
Principles of Language Assessment
100% (2)
Principles of Language Assessment
6 pages
Language Testing Summary of Chapters 1 6
No ratings yet
Language Testing Summary of Chapters 1 6
34 pages
Language - Testing - Characteristics of Good Test
No ratings yet
Language - Testing - Characteristics of Good Test
31 pages
Group 6 - Issues in Foreign Language Testing
No ratings yet
Group 6 - Issues in Foreign Language Testing
131 pages
Bed 106
No ratings yet
Bed 106
93 pages
Clil HD Brown Chapter 23
No ratings yet
Clil HD Brown Chapter 23
2 pages
Principles of Language Assessment
100% (1)
Principles of Language Assessment
26 pages
EVALUATION
No ratings yet
EVALUATION
11 pages
Build Bright University Language Testing and Assessment: Chapter-2
No ratings yet
Build Bright University Language Testing and Assessment: Chapter-2
26 pages
Midterm
No ratings yet
Midterm
3 pages
Brown_Language Assessment_23_24
No ratings yet
Brown_Language Assessment_23_24
15 pages
Language Assessment Principles and Class
100% (1)
Language Assessment Principles and Class
9 pages
Good Test Adopted 2
No ratings yet
Good Test Adopted 2
4 pages
Language Testing and Assessment - Script Linh
No ratings yet
Language Testing and Assessment - Script Linh
6 pages
W6 - Good Test Qualities
No ratings yet
W6 - Good Test Qualities
37 pages
Reliability and Validity of A Test and Its Procedure Conducted at A Japanese High School
No ratings yet
Reliability and Validity of A Test and Its Procedure Conducted at A Japanese High School
17 pages
Essay III Principles of Language Assessment
100% (1)
Essay III Principles of Language Assessment
7 pages
El 115 FT
No ratings yet
El 115 FT
8 pages
Principles of Language Assessment
No ratings yet
Principles of Language Assessment
35 pages
Language Testing: Open University of Sudan
100% (2)
Language Testing: Open University of Sudan
33 pages
LA - Session 2 - Principles of Assessment PDF
No ratings yet
LA - Session 2 - Principles of Assessment PDF
29 pages
Lecture3 - TESTING AND EVALUATION PDF
No ratings yet
Lecture3 - TESTING AND EVALUATION PDF
45 pages
Principles of Language Assessment
No ratings yet
Principles of Language Assessment
7 pages
М. Н. Аверина, Современный урок иностранного языка, Ярославль, 2004 H. Douglas Brown, Teaching by Principles: An Interactive Approach to Language Pedagogy, San Francisco State University, 1994
No ratings yet
М. Н. Аверина, Современный урок иностранного языка, Ярославль, 2004 H. Douglas Brown, Teaching by Principles: An Interactive Approach to Language Pedagogy, San Francisco State University, 1994
14 pages
Testing Impact Review
From Everand
Testing Impact Review
Mason Ross
No ratings yet
Evolution of Tests
From Everand
Evolution of Tests
Mason Ross
No ratings yet
Autism Spectrum Disorder
No ratings yet
Autism Spectrum Disorder
67 pages
Đề Cương Speaking
No ratings yet
Đề Cương Speaking
8 pages
Ugcf Under Nep 2020 Structure For Hons Without Research 14.10.2023. Final
No ratings yet
Ugcf Under Nep 2020 Structure For Hons Without Research 14.10.2023. Final
66 pages
Resume of Shahin 111
No ratings yet
Resume of Shahin 111
2 pages
Brampton Christian Academy: Course Title: English
No ratings yet
Brampton Christian Academy: Course Title: English
4 pages
Substitute Teacher Information Miss. Estrada's - Grade Class RM # - Date
No ratings yet
Substitute Teacher Information Miss. Estrada's - Grade Class RM # - Date
9 pages
Amity University Noida
No ratings yet
Amity University Noida
7 pages
BCA Syllabus Final1
No ratings yet
BCA Syllabus Final1
53 pages
Assignment / Tugasan: Teknologi Dan Ict Dalam Pendidikan Awal Kanak-Kanak
No ratings yet
Assignment / Tugasan: Teknologi Dan Ict Dalam Pendidikan Awal Kanak-Kanak
7 pages
How to Pass Fast Using AI-900 Exam Questions and Answers PDF DumpsQueen Guide
No ratings yet
How to Pass Fast Using AI-900 Exam Questions and Answers PDF DumpsQueen Guide
14 pages
LGP-GPOA-english Club
No ratings yet
LGP-GPOA-english Club
3 pages
Propaganda LP PDF
No ratings yet
Propaganda LP PDF
4 pages
Decision Making Matrix - Template - Inappropriate Use
No ratings yet
Decision Making Matrix - Template - Inappropriate Use
5 pages
Larsen-Freeman On Teaching Methods and Approaches For Teaching English As A Foreign Language
No ratings yet
Larsen-Freeman On Teaching Methods and Approaches For Teaching English As A Foreign Language
15 pages
Student Learning Outcomes SLO
100% (1)
Student Learning Outcomes SLO
3 pages
Cover Pagewqewq
No ratings yet
Cover Pagewqewq
27 pages
22 Templates For IELTS Writing @ielts Rocket1
No ratings yet
22 Templates For IELTS Writing @ielts Rocket1
25 pages
UK Sharp
No ratings yet
UK Sharp
3 pages
CCNE Exam Booklet 3
No ratings yet
CCNE Exam Booklet 3
10 pages
Phonics Booklet
No ratings yet
Phonics Booklet
2 pages
Gagné
No ratings yet
Gagné
4 pages
Kazemi 2014
No ratings yet
Kazemi 2014
6 pages
Major Project: Front Page, Certificate & Acknowledgement Format
50% (20)
Major Project: Front Page, Certificate & Acknowledgement Format
3 pages
Bulacan State University (Bulsu) Sarmiento Campus College of Education
No ratings yet
Bulacan State University (Bulsu) Sarmiento Campus College of Education
3 pages
Reflections On A Rebuttal To Guidelines
No ratings yet
Reflections On A Rebuttal To Guidelines
10 pages
The Environment, The Economy and Entrepreneurship
No ratings yet
The Environment, The Economy and Entrepreneurship
5 pages
Sample Cover Letter For Administrative Assistant in Education
100% (1)
Sample Cover Letter For Administrative Assistant in Education
8 pages
PROPOSAL of Hatid Aral Project
No ratings yet
PROPOSAL of Hatid Aral Project
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

UKALTA-BS1-23-What-is-language-testing

Uploaded by

UKALTA-BS1-23-What-is-language-testing

Uploaded by

BRIEFING SHEET

www.ukalta.org info@ukalta.org @UKALTA2

No. 1/23, July 2023

▪ What is language testing?

UKALTA: THE FACE AND HEART OF LANGUAGE TESTING IN THE UK

WHAT IS LANGUAGE TESTING?

Language assessment refers to any activity in which

Assessment can therefore encompass both informal, classroom-based

WHAT ARE SOME KEY CONCEPTS IN LANGUAGE TESTING?

▪ Test Purpose. Any individual test should always have a purpose

▪ Stakes. Stakes refers to the consequences of test outcomes.

▪ Validity. In testing, validity refers to the extent to which inferences

▪ Reliability. Reliability refers to the consistency of measurement

others constant and evaluating the impact on test scores. If a test

▪ Fairness. Fairness in testing means that all learners should have

Test fairness is an argument made about the

WHAT ARE SOME NEW TRENDS IN LANGUAGE TESTING?

▪ Multilingualism. Increasing attention is being paid to educational

in high-stakes tests, and how well are students served by English

▪ On-demand testing. The Covid-19 pandemic resulted in an

▪ Artificial intelligence. Artificial intelligence is swiftly becoming the

BERT will significantly influence the direction of language testing

Douglas, D. (2010). Understanding Language Testing. London: Hodder

Fulcher, G. (2010). Practical Language Testing. Hodder Education/Routledge.

Carr, N. (2011). Designing and Analyzing Language Tests. Oxford University

Provides a comprehensive overview of concepts, principles, and methods for

Green, A. (2020). Exploring Language Assessment and Testing (2nd ed.).

Covers a wide range of topics related to language assessment, including

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.