18081
18081
Root Cause
Investigations for CAPA:
Clear and Simple
James L. Vesper
PDA
Bethesda, MD, USA
ISBN: 978-1-942911-28-9
Copyright © 2018 by James L. Vesper and Tim Sandle
All rights reserved.
10 9 8 7 6 5 4 3 2 1
All rights reserved. This book is protected by copyright. No part of it may
be reproduced,ISBN:
stored in a retrieval system, or transmitted in any form or by
978-1-942911-50-0
any means, electronic,
Copyright mechanical,
© 2020 by James photocopying,
L. Vesper recording, or otherwise,
without written Allpermission
rights reserved. from the publisher. Printed in the United States
10 9 8 7 6 5 4 3 2 1
of America.
Where a This book is
ISBN:
product protected byregistration
978-1-942911-42-5
trademark,
Copyright © 2020 Tim Sandle
copyright. Nomark, part of itormay be reproduced,
other protectedstored
in
mark is made in the a retrieval
All text,
system,
rights ownership
reserved.
or transmitted in any form or by
of the mark remains with the lawful any means, electronic,
mechanical, photocopying, recording, or otherwise, without written
owner of the mark.All No claim,
rights
intentional
reserved.
orprotected
otherwise,
This book isPrinted
is made by reference
permission from the publisher. inby copyright.
the United No partof
States ofAmerica.
it may
to any such marksWhere inreproduced,
be theabook. stored in a retrieval system or transmitted in any means,
electronic, product
mechanical,trademark, registration
photocopying, mark,
recording, or other protected
or otherwise, without mark
At the time of
is made printing,
in the
written text, all
permission web
ownership
from the site
of thelinks
publisher. mark referenced
remains
Printed in the with functioned,
United the lawful
States of owner
however PDAof and DHI
mark.cannot
theAmerica. No claim, guarantee
intentionalthe or accuracy
otherwise, of the information
is made by reference to any
Where a product trademark, registration mark, or other protected
or that the listed web
such marks
mark sitesinwill
is made not
theinbook. move
the text,Websites or cited
ownership delete areinformation.
of the current
mark at the
remains time
with theof publication.
lawful
While every effort has been made by the publishers, editor, Ifand
The author
owner has
of themade
mark. every
No effort
claim, to provide
intentional or accurate
otherwise, citations.
is made bythere are
reference to any such marks in the book. Websites cited are current at the
any
authors to ensure timeomissions,
the of accuracy please
publication.
contact
ofThetheauthor the publisher.
information
has made every contained in this
effort to provide book,
accurate
While
citations.every
If thereeffort
are any has been made
omissions, pleaseby the the
contact publishers
publisher.and the author
this organization accepts While
no responsibility forbyerrors or omissions. The
to ensure theevery effort has
accuracy of been
the made
information the publisher
containedand the
in author to
this book, this
views expressed inensure this book are those
the accuracy of theofinformation
the editorexpressed
and authorsin this and
book, may
the
organization
organization accepts
acceptsno no responsibility
responsibility forfor errors
errors or omissions.
or omissions. The views
The views
not represent those of either
expressed
expressedin this Davis
book
in this book Healthcare
are
arethose
those of theInternational
of the authorand
author andmaymay ornot
not therepresent
PDA,
represent its
those those of
officers, or directors.of either Davis Healthcare International or the PDA, its officers, or directors.
either Davis Healthcare International or the PDA, its officers, or directors.
This book is printed on sustainable resource paper approved by the Forest Stewardship
PDA This book is The
Council. printed onGasch
printer, sustainable
Printing, isresource Davis Healthcare International
a member paper approved
of the Green by theand
Press Initiative Forest Stewardship
all paper
Council. The printer, Gasch Printing, is a member of thePublishing, LLC and all
used is from SFI (Sustainable Forest Initiative) certified mills.
Green Press Initiative
paper
Bethesda Metro used is from SFI (Sustainable Forest Initiative) certified
Center 2636mills.West Street
PDA Global Headquarters Davis Healthcare International Publishing, LLC
Suite 1500 Bethesda Towers, Suite 150 River Grove 2636 West Street
PDA 4350
Global Headquarters
East-West Highway Davis Healthcare InternationalRiver Publishing,
Grove LLC
Baltimore, MDBethesda
20814 Towers,
Bethesda, Suite 600
MD 20814 IL 60171 2636 West Street
IL 60171
United States 4350 East-West
United States
Highway
www.pda.org/bookstore
United States United States
River Grove, IL 60171
www.DHIBooks.com
301-986-0293 Baltimore, MD 20814
001-301-986-0293 United States
www.DHIBooks.com
United States www.dhibooks.com
www.pda.org/bookstore
CONTENTS
FOREWORD ix
ACKNOWLEDGMENTS xiii
INTRODUCTION xv
ABOUT THE AUTHOR xxi
iii
LICENSED TO JOSE CASTELLA
iv
LICENSED TO JOSE CASTELLA
10 INTERVIEWS 165
Interviews compared to interrogations 165
Fear166
An interesting case study of how our memories
can warp 166
How memories are created—and recreated 167
Ways to obtain the most accurate recounting of an
incident168
The cognitive interview process 169
Conclusion173
References174
v
LICENSED TO JOSE CASTELLA
18 COMMUNICATION267
Who sees what? 268
Methods for incident communication 269
vii
LICENSED TO JOSE CASTELLA
INDEX 303
viii
LICENSED TO JOSE CASTELLA
FOREWORD
ix
LICENSED TO JOSE CASTELLA
x
LICENSED TO JOSE CASTELLA
Books such as Root Cause Investigations for CAPA: Clear and Simple
provide a path for reaching that level of awareness and thus help
our industry achieve our healthcare product and patient welfare
xi
LICENSED TO JOSE CASTELLA
Hal Baseman
June 2020
xii
LICENSED TO JOSE CASTELLA
ACKNOWLEDGMENTS
And a special thank you (again) to Gray Brown for being patient
and very supportive as I worked on yet another writing project.
xiii
LICENSED TO JOSE CASTELLA
xiv
LICENSED TO JOSE CASTELLA
INTRODUCTION
One of the most common questions a writer has is, “How do I start?”
In writing this book my question was, “how do I stop?” Every time
I went online to find an article or check a reference, I would see
something else and think, “I need to add that!” If I kept that up, I
would still be writing and discovering and adding and on and on.
(As it is, I am way overdue in getting this to the publisher.)
xv
LICENSED TO JOSE CASTELLA
xvi
LICENSED TO JOSE CASTELLA
xvii
LICENSED TO JOSE CASTELLA
xviii
LICENSED TO JOSE CASTELLA
James Vesper
Provincetown, MA
June 2020
xix
LICENSED TO JOSE CASTELLA
xx
LICENSED TO JOSE CASTELLA
xxi
LICENSED TO JOSE CASTELLA
xxii
LICENSED TO JOSE CASTELLA
Things happen:
1
LICENSED TO JOSE CASTELLA
Anyone who has worked in our industry has their own mental register
of favorite horror stories or nightmares they have experienced.
A DIFFERENT INDUSTRY
The pharmaceutical industry of today is significantly different
than the one of the 1970s–80s. That period was prior to the current
significant economic pressures placed on name brand manufacturers,
in part by competition from generic firms. Additionally, the supply
chain structure was very different from what we see today. Then,
if a batch of product failed due to a manufacturing breakdown, it
resulted in some discomfort but not too much agony. Yes, the failure
might have cost many thousands of dollars, but another lot of the
active pharmaceutical ingredient (API) could be made (or found)
and a replacement batch squeezed into the production schedule.
Batch rejection rates of 10–25 percent were not uncommon.
Pre-crash
Crash
Post-crash
After the crash of Asiana Airlines Flight 214 while landing in San
Francisco on 6 July 2013, the word “miracle” was used by reporters;
307 passengers and crew were on board; there were three fatalities.
Other commentators with an aviation background challenged that
characterization saying that air travel is safer because of the work
by professionals designing better planes, writing more effective
procedures, improving the use of simulator training, flight crews
communicating and working together more skillfully, and a host of
other factors (Brown, 2013).
Healthcare
In 1999, the US Institute of Medicine (IOM) released its report
To Err is Human: Building a Safer Health System, that said between
44,000 and 98,000 people die each year in the US due to preventable
medical mistakes. Total annual financial costs were estimated to
range from $17 to $29 billion, with added healthcare costs being half
of that amount, not to mention the physical pain and psychological
CONCLUSION
We have seen that there are reasons for investigating problems
and implementing corrective actions beyond simply wanting to
achieve regulatory compliance. Positive health outcomes, patient/
user safety, customer (stakeholder) satisfaction, and public health
are some examples; long-term cost reduction and other economic
benefits are others. Accomplishing these desirable outcomes
requires efficient, knowledge-generating investigations that point
to effective corrective actions, topics that will be covered in the
remaining chapters.
REFERENCES
AHA (2018) Trendwatch – Aligning efforts to improve quality.
American Hospital Association. https://www.aha.org/system/
files/2018-10/AHA_TrendWatch_Report_Quality_Healthcare_v31_
pages.pdf. Accessed 2 Mar 2020.
Brown, D.P. (2013) A few thoughts on Asiana Airlines flight 214 crash
at SFO. AirlineReporter, 8 Jul 2013. http://www.airlinereporter.
com/2013/07/a-few-thoughts-on-asiana-airlines-flight-214-crash-at-
sfo/. Accessed 19 Oct 2015.
REGULATORY REQUIREMENTS
AND EXPECTATIONS
(Portions of this chapter are from Chapter 6, “Discrepancy
Observations and Investigations,” in GMP in Practice, 5th Edition, 2018,
cowritten by James Vesper and Tim Sandle.)
• The root causes recorded were not always those identified in the
procedure.
• Two overdue CAPA were open at the time of the inspection (186
days and 60 days overdue).
GMP EXPECTATIONS
If one were to ask what inspectors in the US, Europe, Canada, and
the World Health Organization expect regarding investigations and
corrective actions, the answer may include the following summary
of expectations found in the GMP regulations from those countries
or regions (FDA, 2019; EC, 2020; Health Canada, 2018; WHO,
2011). Guidelines published by the International Conference on
Harmonization (ICH, 2008) are also included in this review.
CA
EU
WHO
ICH
There shall be a quality control unit that shall have the responsibility
and authority to approve or reject all components, drug product
containers, closures, in-process materials, packaging material,
labeling, and drug products, and the authority to review production
records to assure that no errors have occurred or, if errors have
occurred, that they have been fully investigated [§ 211.22(a)].
CA
EU
WHO
ICH
There shall be a quality control unit that shall have the responsibility
and authority . . . to review production records to assure that no
errors have occurred or, if errors have occurred, that they have been
fully investigated [§ 211.22(a)].
CA
Any deviation from instructions or procedures is avoided. If
deviations occur, qualified personnel investigate, and write a report
that describes the deviation, the investigation, the rationale for
disposition, and any follow-up activities required. The report is
approved by the quality control department and records maintained
[C.02.011 #5].
EU
Quality control personnel should have access to production areas
for sampling and investigation as appropriate [EU 6.4].
All incidents, not only system failures and data errors, should
be reported and assessed. The root cause of a critical incident should
be identified and should form the basis of corrective and preventive
actions [EU Annex 11, 13].
WHO
Any deviation from instructions or procedures should be avoided
as far as possible. If deviations occur, they should be done in
accordance with an approved procedure. The authorization of the
deviation should be approved in writing by a designated person,
with the involvement of the quality control department, when
appropriate [WHO Annex 3–GMP, 16.3].
ICH
A structured approach to the investigation process should be
used with the objective of determining the root cause. The level of
effort, formality, and documentation of the investigation should be
commensurate with the level of risk, in line with ICH Q9. CAPA
methodology should result in product and process improvements
and enhanced product and process understanding [Q10, 3.2.2].
First page:
Incident title, number, date of report, and summary (what was the
problem, what was the cause, what will be done).
Following pages:
k. Recommended disposition.
l. Background information (optional).
m. Documents attached (list).
US
CA
EU
WHO
ICH
The level of effort, formality, and documentation of the investigation
should be commensurate with the level of risk, in line with ICH Q9
[Q10, 3.2.2].
CA
EU
WHO
ICH
There shall be a quality control unit that shall have the responsibility
and authority . . . to review production records to assure that no
errors have occurred or, if errors have occurred, that they have been
fully investigated [§ 211.22(a)].
CA
EU
WHO
ICH
REFERENCES
EC (2020) EudraLex - Volume 4 - Good Manufacturing Practice
(GMP) guidelines. European Commission. https://ec.europa.eu/
health/documents/eudralex/vol-4_en. Accessed 2 Mar 2020.
31
LICENSED TO JOSE CASTELLA
– Looks for and shares ideas that could improve tasks, quality,
and compliance.
• Report writer
• Management
DEVELOPING COMPETENCIES
The combination of formal training, learning by doing, and being
coached are very effective ways of becoming competent. It is a
mistake, however, to assume that everyone needs the same type and
level of training. In most cases, it is more effective and efficient to
have a smaller number of lead investigators and report writers who
do many investigations than many people who do only one or so a
year. The more you do, the better you get.
Often the owner is the manager of the area where the problem
was found. This should be defined in a policy or procedure.
• Meaning: The work itself and the product generated gave each
team member a sense of pride and accomplishment.
• Impact: Team members felt that what they did was important,
contributing to something of value (re:Work, 2020).
a set of special skills and abilities to keep the group on task, follow
acceptable norms and practices, and help them reach their goal. In
other words, facilitators help establish and strengthen relationships
among the group’s members so they can effectively and efficiently
accomplish what they need to.
So, how do you work with the group to make the investigation
process as easy as possible ?
• People are more committed to ideas and plans that they have
helped to create.
• Help establish the goals and objectives for the project and the
agenda for each meeting.
• Keep the group on track with the agenda and process by asking
questions and minding the clock.
• Create and ensure a safe place where individuals and what they
are contributing are respected.
At the start of the session, go over the agenda with the participants
and obtain their agreement. Digressions from the agenda that might
occur during the session should be placed on the “parking lot” flip
chart. Consider having a team member be the timekeeper, perhaps
using the alarm on their smartphone to signal it is time to move
ahead.
Ground rules
Most people who have worked in team/group settings are
familiar with having ground rules that set the expectations for the
participants. These are primarily based on common sense and the
need for participants to respect each other and their ideas. Together,
these create a safe working environment to discuss, explore, and
decide.
Facilitation techniques
A good facilitator uses a number of techniques. These include:
• Using flip charts: Making flip chart notes not only records
decisions, priorities, and key points of discussion, but also
Notes/documentation
The facilitator is usually not the person who takes notes during the
session. Ask a team member to help with this, allowing you as the
facilitator to focus on the process and the people. The notetaker works
closely with the facilitator, listening particularly to the summaries
the facilitator gives with actions, timing, and assignments. (In some
situations, such as when the investigation is rather simple, it may be
more expeditious for the facilitator to provide a written summary of
what happened.)
REPORT WRITERS
Preparing the written report—whether it is in a more traditional
technical report format or in sections that are part of an online
tracking application—requires skills in organizing information,
clearly telling the story, and critical thinking. Some firms employ
technical writers for this task while others provide some level of
training for investigators/writers who have other responsibilities as
well. It is not unusual to find firms that approach this by having new
writers use existing reports as templates or models that are to be
followed. As will be discussed in later chapters, grammar, spelling,
and appearance are important to readers, particularly health
authority inspectors and auditors.
CONCLUSION
The size and scope of the investigation affects the decision of who
to include on the investigation team. Having curiosity, wanting to
learn, and critical thinking skills are important characteristics for
team members to have. Training in the investigation process, root
cause investigation tools, and facilitation skills can make team
members more effective and efficient.
REFERENCES
Duhigg, A. (2016) What Google learned from its question to build
the perfect team. NY Times Magazine, Feb 25, 2016. https://www.
nytimes.com/2016/02/28/magazine/what-google-learned-from-its-
quest-to-build-the-perfect-team.html. Accessed 5 Feb 2020.
47
LICENSED TO JOSE CASTELLA
The time the incident occurs is the beginning of the “golden hours,”
which are usually considered the first 24 hours after an accident or
event. Investigators (and detectives) view this period as the most
favorable and productive span of time for gathering information
(e.g., measurements, photographs, witness statements) and artifacts
(e.g., mixed-up tablets, incorrectly labeled bottles, broken gaskets)
before memories fade and items decompose, are lost, or mishandled.
For example:
1. How will you approach the investigation based on the information obtained up to
this point?
• What are the risks that could occur if we implement this change?
Step 12. Write the report; send for review and approval.
This task may have started when the unwanted event was first
recognized; it often spans the entire investigation and CAPA process.
As lessons learned become evident and solutions are put in place,
information should be shared with those who may be affected by the
event or could benefit in some way. This should not be considered
the same as training, but rather disseminating relevant information.
(See Chapter 18.)
CONCLUSION
The process described in this chapter is meant to be a general guide
for those performing, managing, or supporting an investigation.
Steps may be performed in a different sequence; in some cases a
step may be added or deleted. In any case, these eight key questions
need to be answered:
• What happened?
REFERENCES
ICH (2008) Q10 – Pharmaceutical quality system. Geneva:
International Conference for Harmonisation. https://database.ich.
org/sites/default/files/Q10_Guideline.pdf. Accessed 12 Mar 2020.
PSYCHOLOGICAL SAFETY
Before describing ways that unwanted events can be discovered,
an important question to ask is, “How willing is someone to tell
others, including management, the news of a problem or failure,
particularly if that person was involved or responsible?” While
there are often human factors and error traps that are the underlying
causes of what we call “human error,” it still takes a large amount of
courage for someone to self-report a mistake or problem.
DIRECT OBSERVATION
The simplest way to discover an event that needs some level of
investigation is when one senses a problem. The problem could be
of a type that anyone would conclude that “we’ve got a problem
here”—a solvent leak, a packaging line that unexpectedly stops, or
a person who has called to say she found a red tablet in a bottle
of white tablets. In each of these examples, the person that raises
the issue has determined that what they are experiencing does
not match the mental model of what they were expecting or the
“normal,” desired situation.
Mental models
A mental model is a picture that a person has in their mind of how
something works, how the pieces fit together, and the expectations
of what should happen. Examples of using a mental model could
be an employee who notices something that seems out of place or a
customer who calls in with a complaint that there is a blue tablet in
what should be a bottle of all white tablets.
• Structure: The parts and how they fit together to form a system
(the push button, the wire or wireless connection, and the device
that creates the audible sound).
Situational awareness
A key difference that distinguishes a novice airplane pilot from those
who are the best in their profession (and have the most flying time)
is situational awareness (Endsley, 2006), a characteristic that world-
class athletes have as well (Epstein, 2013). Hockey great Wayne
Gretzky put it simply: “A good hockey player plays where the puck
is. A great hockey player plays where the puck is going to be.”
From the above definition and example, one can see that
situational awareness has three levels (Endsley, 2006). First,
there is the perception: acquiring important information about the
situation. The operator heard something unusual. Perception has
two important requirements. First, the person must be able to sense
and acquire the information. An interesting study of hundreds
of baseball players showed that professional players’ vision was
superior to the vision of college athletes and far better than the
general public. Visual acuity that allows players to sense information
earlier (literally milliseconds) and from farther away is a predictor
of success (Epstein, 2013). The second important requirement of
perception is paying attention to the important information and
ignoring nonvalue-adding information. Having expertise in a field
(meaning that the person has had many years of being exposed
to a variety of experiences and challenges) is critical in discerning
what is and is not important. And, even with that expertise, in many
complex, dynamic situations, this determination still can be difficult.
In the CIP example, the operator was able to distinguish the normal
sounds of the equipment and ambient room noise from the unusual
one.
not when they have completed their assigned task but now—
immediately.
But what about those cases when someone without any specific
experience in field walks into a laboratory or looks at a graph with
“new eyes” and asks a question that initiates an investigation? They
may not have extra sensory perception (ESP), but they might have
another mental model (e.g., data recording and integrity) that they
are applying in a different situation. Or, perhaps those who have
spent many hours in one particular lab or reviewing data have failed
to keep their mental models current. Or there may be a perceptual
bias often described as “we see what we want to see.”
Signal detection
Another avenue that can identify an event is signal detection. While
there are many definitions of signal, a simple, useful one is that
a signal is “a clue or a sign or a piece of evidence” that gives an
Control charts show the results of the process, not the cause.
They can be used to help identify special cause variation where a
change has occurred resulting in variation that is outside the typical
or historical patterns seen in the process. Control charts can also
show common cause variation, such as natural, typical patterns or
when the process devolves or deteriorates. To understand these
types of problems, one needs to have a thorough understanding of
the process and factors that can affect it. Figure 1 shows a simplified
example of a control chart.
CONCLUSION
Discovering that an event is happening or has occurred can be
done through direct observation of an event or a symptom that is in
evidence. Events can also be discovered by looking at data that has
been collected using a range of tools—some simple and others very
complex.
REFERENCES
Craik, K.J.W. (1943) The Nature of Explanation. Cambridge, UK:
Cambridge University Press.
Lazer, D., Kennedy, R., King, G., Vespignani, A. (2014) The parable
of Google flu: Traps in big data analysis. Science, 343:1203–1205.
Silver, N. (2012) The Signal and the Noise: Why So Many Predictions
Fail – but Some Don’t. New York, NY: Penguin Press.
Tague, N.R. (2005) The Quality Tool Box, 2nd ed. Milwaukee, WI:
Quality Press.
77
LICENSED TO JOSE CASTELLA
Risk analysis. For each hazard, the team determines its cause,
severity, current control/detection strategy, and likelihood of
occurrence. Tools used for risk analysis include risk rating check
sheets, preliminary risk assessment (PRA), (the over-used [Vesper
et al., 2016]) failure mode and effects analysis (FMEA), and fault
tree analysis (FTA). Rating scales are used to assign categories of
severity, likelihood, and sometimes, if FMEA is used, detection.
Severity
Risk-based triage
The most frequently seen application of RBT and QRM related to
quality events and deviations is having an established set of rules
to determine the extent and thoroughness of the investigation
that is warranted based on what is initially known. The concept
of triage is thought to have originated in the Napoleonic wars,
where responders to an event with mass casualties would quickly
categorize those who would be expected to succumb to their injuries
regardless of further treatment; those who would be expected to
survive without additional treatment because their injuries were
minor; and those who, only if they received medical attention,
would survive. Obviously, the evaluation criteria for quality events
is different, but the concept of categorizing the events to decide
where to spend one’s limited resources is similar. Using this type of
risk-based approach is aligned with the Pareto Principle, or the 80/20
rule, that quality guru Joseph Juran said applied to quality problems
in manufacturing where a small number of causes produced most
of the defects, or as he said, “the vital few and the trivial many”
(Hindle, 2008, p. 55).
Some organizations also stipulate that this type of event (one that
has low criticality) has happened fewer than x times in a 12-month
period.
Patient impact
Two questions that a health authority inspector, when reviewing a
deviation investigation, will want to see answered are, “What is the
scope of the problem?” i.e., “How big is the problem?” and “What
is the effect or impact of this problem on patients?” One way to
answer the effect question is through an impact assessment. There
are different ways this can be considered. One qualitative approach
comes from the “Swiss cheese” model discussed in Chapter 7.
Using this concept, one can describe the quality system elements or
practices that are multiple effective barriers to the defect harming
the patient.
CONCLUSION
There are many important decisions that need to be made related
to a quality event or deviation. Quality risk management and risk-
based thinking can be the source of information that will contribute
to consistent, data-driven decisions. It is important that the decision
is as free from bias as possible and not simply a justification of a
poor practice.
REFERENCES
Hindle, T. (2008) Guide to Management Ideas and Gurus. London:
Profile Books.
All models are wrong, but some are useful. (George E. P. Box)
SINGLE-EVENT MODEL
Perhaps the oldest—and simplest—paradigm explaining how
incidents happen is the single-event model. Its simplicity, however,
is also its weakness. According to the model, an incident is caused by
a single event that is necessary (i.e., the situation would not happen
if the event didn’t take place) and sufficient (i.e., the event was all
that was needed to cause the incident). In other words, the model
simply shows cause and direct effect.
For example, I’m walking and fall because I slip on an icy patch.
Or a shark bites a swimmer’s leg because she was in the “wrong
place at the wrong time.” Or the wrong ingredient is added to a
tank because the operator misread the ingredient’s label. In each of
these examples there is a cause (ice, shark, operator mistake) and
an effect (fall, injury, contaminated batch). The single-event model
merely presents the occurrence of an event, without asking why or
how the triggering (proximal) event occurred.
why or how events unfolded. The news media often present events
in this way: A reporter may report that an alligator has gotten into
a family’s swimming pool in Florida (cause) and scared the family
(effect) but feels no need to explain how the alligator got there in the
first place—the houses encroaching on and destroying the natural
habitat of the alligator, etc. This approach is also evident in accident
statistics, in which events are regarded as having only one cause,
and in insurance policies, which refer to “acts of God.”
4. Accident
5. Damage or injury
HIERARCHICAL MODELS
A third type of model used to describe accidents is the “hierarchical
model.” Unlike the domino model, in which one thing causes another
to occur, the hierarchical model identifies factors that contributed to
or set the stage for an accident but did not cause it directly.
FACTORIAL MODEL
Systems Accident
Almost all the tools used for risk estimation can consider the
contribution of human factors to accidents or incidents. Checklists
of points to consider when using a preliminary risk analysis or
guide words when examining processes and procedures have been
identified (Bridges et al., 1994), but a simpler—and potentially
more powerful—approach when identifying a potential human
contributor to a possible incident is to ask repeatedly, “How could
that happen?” or “What is the true cause of that?”
CONCLUSION
We’ve looked at some models used to explain how accidents occur;
the most useful ones are those that take into consideration indirect
or contributing factors as well as direct causes.
REFERENCES
Benner, L. (1975) Accident investigation: Multilinear events
sequencing methods. Journal of Safety Research, 7:2.
Gupta, J.P. (2002) The Bhopal gas tragedy: could it have happened
in a developed country? Journal of Loss Prevention, 15:1–4.
Or consider this: The same situation, but in this case, the message on
your screen says:
This computer is not properly connecting with the projector. Check the
connection or replace the cable.
Now consider this situation: You are sent a quality event notice
that there was a mix-up in the clinical supplies that were sent to a
clinical study site. The root cause of this was identified as:
Human error.
Or, the same the situation, but in this case more details are provided:
There was not a packaging list or checklist used to confirm contents of package
being sent to clinical study site PQR.
107
LICENSED TO JOSE CASTELLA
The term human factors is used two different ways. The first is
in the context of ergonomics, specifically:
Human factors use knowledge of human abilities and limitations to
design systems, organizations, jobs, machines, tools, and consumer
products for safe, efficient, and comfortable human use (HFES, 2019).
The second usage, the one that is more suited to our discussion, can
be defined as:
Human factors are the faulty systems, processes, circumstances,
conditions, and causes that lead people to make mistakes or fail to
prevent them.
CLASSIFICATIONS
You can categorize the things that cause or contribute to human
error in different ways. The simplest has three groupings:
Category Example
Deviance Individual chooses to violate a
Blameworthy prescribed process, procedure, or
practice.
Inattention Individual inadvertently deviates from
procedure or specification
Lack of ability Individual does not have knowledge,
skills, capability, or training to perform
task.
Process inadequacy Competent individual follows
procedures or instructions for an
inherently faulty or incomplete
process.
Task challenge Competent individual attempts a task
too difficult to be executed reliably
every time.
Process complexity A process composed of many
elements fails when it encounters
novel inter-actions.
Uncertainty Lack of clarity about future events
causes people to take reasonable
actions with undesired results.
Hypothesis testing An experiment conducted to prove
that an idea or design will succeed
fails.
Exploratory testing An experiment conducted to expand
Praiseworthy knowledge and investigate a possibility
leads to an undesired result.
An example
– Tools and technology: The tools and supports that are part
of the task. It could be that the procedure (a simple, often
paper-based technology) was confusing or missing a step.
Another example could be poorly designed batch records
(whether on paper or online) that vary widely from product
to product.
The next level up in the HFACS model considers the role that
supervision has. Generally these are all latent errors or underlying
conditions that are often uncovered by repeatedly asking “why?”
Four categories related to supervision are:
An example
It was observed that operators were signing the “verified by” sections
of production records in a way inconsistent with industry practice.
Specifically, the personnel sat at a table at the end of their shift and
filled in the “verified by” entries. The consultant asked the operators
what “verification” meant and received a variety of answers, none of
them being “having real-time witness of a critical activity,” which is
a definition often used in the pharma industry. There was no formal
training provided on verification nor were there definitions in any
policy or procedure. The event was considered to be a decision
error—that is, “conscious, goal-intended behavior that proceeds as
designed, yet the plan proves inadequate or inappropriate for the
situation” (HFACS, 2014). (Had there been proper definitions and
if people had been trained on these, the event would have been
classified as a violation.)
The list of TWIN error precursors can be used in at least two different
ways. First, when investigating an event that is categorized as human
error, the lists can be checklists of points to consider. Second, these
lists can also be used proactively—for example, a smaller subset of
the items under each TWIN category could be selected, in part based
on trends within a department or organization. Prior to beginning
a task, those involved can see if any of these precursors might exist
and take appropriate precautions.
An example
No flight crewmember may engage in, nor may any pilot in command
permit, any activity during a critical phase of flight which could
distract any flight crewmember from the performance of his or her
duties or which could interfere in any way with the proper conduct of
those duties (US FAA 14 CFR § 121.542 (2)).
Checklist questions
One other tool that can be useful during an investigation of human
error is a set of questions to ask. This list (Table 4) was developed by
Kevin O’Donnell, PhD, a highly regarded inspector with the Irish
Health Products Regulatory Agency (HPRA) (Poska, 2010).
Other tools
If you do a web search for tools related to investigating human error,
you will find a large number of them. Each has its own advantages
and disadvantages. Some tools are very detailed and can be used in
cataloging and creating metrics about specific human factors that
cause or contribute to unwanted events. This can be very helpful
but also requires significantly more effort as one learns to effectively
use the tool. There are other tools that use a defined set of categories
and failure causes. These are helpful to new investigators but can be
limiting as they do not encourage one to think outside of the lines or
boundaries of the tool.
CONCLUSION
One of the most important improvements you can make in your
organization’s root cause investigation program would be to not
allow human error to be named as the root cause of an unwanted
event. When you see that there is a person at the “sharp end” of a
failure sequence, there are other factors that cause or contributed to
that event. Your challenge will be find those causes by using some of
the tools discussed in this chapter.
VALUABLE RESOURCES
Two very useful manuals are available (free!) on the web and are
among the best detailed guides for understanding human factors
and performance. Both were written for the US Department of
Energy and have been used in preparing this chapter:
REFERENCES
DOE (2009) Human Performance Improvement Handbook, Volume
2: Human Performance Tools for Individuals, Work Teams, and
Management. DOE-HDBK-1028-2009.
Juran, J. (1992) Juran on Quality by Design: The New Steps for Planning
Quality into Goods and Services. New York, NY: The Free Press
Kahneman, D. (2011) Thinking, Fast and Slow. New York, NY: Farrar,
Straus and Giroux
Moura, R., Beer, M. Patelli, E., Lewis, J., Knoll, F. (2016) Learning from
major accidents to improve system design. Safety Science, Vol.
84, April 2016, pp. 37–45. https://strathprints.strath.ac.uk/70459/1/
Moura_etal_SS2016_Learning_past_accidents_improve_system_
design.pdf. Accessed 3 Mar 2020.
137
LICENSED TO JOSE CASTELLA
The symbols used in flow charting have long been used in data
processing and information technology (IT) applications. Figure 4
shows some of the more commonly used symbols.
• Methods
• Materials
• People
• Equipment
• Facilities
Brainstorming
The term “brainstorming” was used first by advertising executive
Alex Faickney Osborne in 1948 and then later in his 1953 book, Applied
Imagination, to describe a method of generating novel ideas—the
wilder and crazier the better. Since then it has been a method used
to solicit ideas or thoughts for most anything including potential
hazards and, in our case, reasons for an unwanted event.
2. Practice no criticism.
Timelines
A relatively simple tool that has special utility when you are trying to
understand an event where the sequence or chronology is important
is a timeline. A simple timeline (Figure 6) could be used in presenting
Five whys
If you have spent much time with a six- or seven-year-old child, you
probably have experienced being asked multiple “why” questions––
Why is the sky blue? Why can you not see the blue at night? Why do
stars only come out at night? And on and on. That same tenacity at
asking questions is what drives the five whys approach to problem
solving. In talking with those involved in pharma/biopharma
investigations, the five whys method is one of the most commonly
used. The intent of the five whys is to look for the series of cause
and effect relationships that form the failure sequence. By asking
these repeated why questions, you are in fact creating a fault tree
(discussed later in this chapter).
The five whys method can be used in two different ways. First, it
can be used to generate hypotheses—what are the possible reasons
that the unwanted event could happen? Think of this as looking at
the problem broadly. Second, the method can help go more deeply
into a particular set of reasons that are evidence-supported.
One caution in using the five whys is that once you start in a
category (for instance, “equipment”), you are confined in that
category. For example, if you are asking why your car will not start
and you are not looking for evidence as you move down, you become
locked into a particular pathway (e.g., people or machines). To avoid
this, consider asking why about a particular fishbone category—for
example, methods or materials. If evidence cannot be found for that
category, you have at least tried to broadly look at potential causes.
Change analysis
Things are working well, specifications are being met, and then all
of a sudden your results are not as expected. You have a problem.
One way to determine the cause of the problem is to perform a
change analysis and look at the differences between the acceptable
and nonacceptable product, process, event, or decision. To isolate
one or more factors responsible for difference, change analysis uses
a set of categories including what, where, when, who, and extent
(how much or how many). Once the factors that have changed are
identified by this method, the five whys could be used to determine
the root cause of the change.
Problem statement: The number of line clearance failures is higher during some
months than at other times during the year.
IS IS NOT
What There are a higher number of There is not a problem with
quality incidents due to presence other operational metrics
of unwanted (“rogue”) tablets
Where The higher failure rate is at the The higher rate is not seen at
Tech Ops facility other facilities
When The higher rate occurs in Feb, The significantly higher rate does
Aug, and Nov not occur at other times
Extent The higher rate is seen across all There is not one line higher than
solid labeling/packaging lines another at the TO facility
Who Most operational associates have Most operational associates at
>25 years of experience other facilities have 8–12 years of
experience
As the investigator did the analysis, he was able to see that the
site where the problems were occurring (and recurring) was where
there were long-service employees who had considerable amounts
of vacation. They would take their vacations in the winter (often
leaving the cold Midwest for warmer climes), at the end of summer
with their children or grandchildren (before school started), and
before the Christmas holidays. This insight brought attention to the
inadequate training of replacement and temporary personnel used
to fill in for the vacationing personnel.
Fault trees
A fault tree (FT) is graphical tool that can be used to identify the
actual or potential root causes of an event; the tool can be used
prospectively (in a risk assessment) or reactively (in an incident
investigation). It can give details about the failure pathway or failure
mechanism and the various factors that are needed for the unwanted
event to occur. FTs are described as “top-down” tools because they
begin with an unwanted event at the top of the diagram and then
work down to identify the root causes. FTs use deductive reasoning
by asking the question, “What caused . . . ?” or more simply, “Why?”
(As mentioned earlier, the five whys method can be used to create
FT diagrams.) When used in a root cause investigation, FTs allow the
team to consider multiple paths to the top event (the failure or fault);
as evidence is collected, this supports particular failure pathways.
A fault tree begins with the “top event.” Usually this is the
symptom that was observed. The diagram builds incrementally,
moving deeper and deeper into the reasons why this event occurred.
It is important that you move through these systematically; you are
wanting to understand each different level of the events.
2. Identify what could cause this top event. These are known as
fault events. Use rectangles to name these; if you do not intend
to go any further into the cause of a fault event, put the name in
a diamond shape. A diamond acts as a placeholder to show that
you considered this but, for whatever reason, are not going into
more detail.
3. Connect this layer of fault events to the top event by using a gate
(a logical connector). Usually at this first level, each of the fault
events you identify is necessary and sufficient to cause the top
event, so you use an OR gate. This would indicate that any of
the events below the gate are potential causes for the top event.
If two or more events are needed, meaning that they are both/
all necessary but in and of themselves not sufficient, you use an
AND gate.
4. Consider the evidence or data that you have. What events are
supported by the evidence? Go deeper into these by asking
what caused each of these events based on the data you have.
If you are making an informed guess, this would be where you
would search out data to support or reject your idea. Again,
think incrementally. Add AND or OR gates as appropriate.
5. Continue until you get to the basic (or root) causes, where you
can take action so these things do not happen again.
Observation
Looking and seeing employ the sense of sight, but observation takes
seeing to another level. Sherlock Holmes, the detective created
by the author Sir Conan Doyle (1930) has an exchange with his
colleague, Dr. Watson, in the story “A Scandal in Bohemia,” and
makes the distinction between seeing and observing. At issue was
whether Watson, who had been up and down a set of stairs on many
occasions, knew how many stairs there were. Watson claimed his
eyes are just as good as Holmes, to which Holmes replied:
“Quite so,” [Holmes] answered, lighting a cigarette, and throwing
himself down into an armchair. “You see, but you do not observe. The
distinction is clear. For example, you have frequently seen the steps
which lead up from the hall to this room.”
“Frequently.”
“How often?”
“Well, some hundreds of times.”
“Then how many are there?”
“How many? I don’t know.”
“Quite so! You have not observed. And yet you have seen. That is just
my point. Now, I know that there are seventeen steps, because I have
both seen and observed.” (Emphasis added.)
Within the Assess step, she uses another acronym, COBRA, to guide
the observer through the process:
Ask: Ask someone else to observe with you. They may have a
very different set of experiences and expertise that allows them to
pick up something you didn’t see or didn’t think was important. Try
to find someone who is neutral and not as invested in the situation
as you are. They may also be able to more precisely name something
that can help as you communicate or articulate your findings.
semi had tears and rips on the lowest two layers; white crystals that
appear to be sodium chloride are seen on the floor of the truck.” Also,
it is not yet time to conclude that the ripped bags were made by the
poor handling or truck loading practices. That can be determined
with more investigation.
A3
The A3 approach to problem solving is really more than a simple
tool—it is a formalized method that takes one through the problem-
solving and improvement process. The result is a record of the
investigation and its results that are used for documentation and
communication. The approach comes from the Toyota Quality
System (Shook, 2009) and is centered on a piece of ISO A3 paper,
11.8 x 16.5 inches (29.7 x 42.0 cm), which has formally designated
places for presenting different types of information. And yes, the
method provides a defined way to fold the paper so it fits neatly in
a report or folder.
Connector
between events;
Connector from
a condition
Assumed event Data is not available to confirm event; no
solid data available.
(Dotted lines can make any symbol into
one that is assumed.)
Information that is added to the ECF chart can come from any of
the other tools we have discussed; the chart is a tool used to organize
and display the information so that it can be more easily analyzed to
discover the root, proximal, and contributing causes.
• What was the action, event, or decision that caused the unwanted
event? This is the “proximal event.”
• What were the events that began the path toward the ultimate
unwanted event (e.g., failure)? (Note: you may find more than
one of these!) These would be the “root cause(s).”
• What are the controls that did not work or did not exist? These
are candidates for corrective actions.
As you move through the process, you might find the tool being
used is not helping you achieve your objective. In that case, try
another one.
CONCLUSION
As with other activities—preparing lab samples, doing home
repairs, performing surgery—having the right tools will contribute
to better and more efficient outcomes. The same holds true when
investigating an unwanted event; there are a number of tools beyond
the commonly used fishbone and five whys. Taking some time to
learn and apply them can greatly improve your investigational
skills.
REFERENCES
Doyle, A.C. (1930) The complete Sherlock Holmes. Garden City, NY:
Doubleday.
ISPE and PDA (2019) ISPE – PDA guide to improving quality culture
in pharmaceutical manufacturing facilities. https://ispe.org/sites/
default/files/regulatory/ispe-pda-guide-to-improving-quality-culture.
pdf. Accessed 5 Mar 2020.
PDA (2008) Technical Report #54: Quality Risk Management for Aseptic
Processes. Bethesda, MD: PDA.
10
INTERVIEWS
165
LICENSED TO JOSE CASTELLA
FEAR
As you are getting ready to meet with those involved in the event,
think of their predicament. Because you want to talk with them,
they may fear that their job is on the line. As was discussed in
Chapter 5, the concept of psychological safety is critically important.
If personnel perceive that there is a safe space where they can be
honest and vulnerable without fear of retribution, they will be much
more willing to be candid (Edmondson, 2018).
antiaircraft fire, when, in actuality, it was the lead helicopter that was
struck; his aircraft was not affected. In April 2015, it was reported
that there were other exaggerations attributed to him (Roig-Franzia,
et al., 2015; Farhi, 2015).
But the retrieval and re-storage of the information may cause the
memory to be altered in some ways (Bridge et al., 2012). For example,
if you are telling a colleague the story of an event you witnessed,
you may see their eyes widen at a certain point or that they appear
to be bored at another point. Your brain is registering this, saving
the additional new “meta” information as it re-stores and forms a
new, “richer” memory of the story you are telling. The next time
you tell the story, you might give a subtle punch to the part that
excited the previous listener and quickly gloss over the part that was
a bit duller. Doing this several times can wipe out the initial version
of the story, replacing it with a new, slightly different account.
Conceivably, this could have been what affected Mr. Williams’s
memory if he did multiple retellings of his story.
have some utility, getting the person to talk about what happened
by using a structured interview process is preferred.
Tell interviewee to
be as specific as
possible and add as
many details as they
remember.
When appropriate,
ask interviewee to
make a sketch or
drawing.
Principle of Focus on each scene You’re talking about
momentum or section of the the sound you
story; do not jump heard when the unit
around. malfunctioned. Was
there anything else
that you noticed at
the time?
Multiple and Enhance recall by Let’s start at the end Do the reverse order
varied recall asking the interviewee of the event when only when the story
to tell about the event you realized the has been completely
in reverse order or malfunction... What told and other follow-
from a different point were you doing right up questions have
of view. before that? been asked; otherwise
it could distort the
You said your chronology in the
supervisor was on witness’s mind.
the other side of the
room. What do you
think she would have
seen or heard?
reestablish the context. When ready, the person begins to tell the
story. It’s critical to avoid interrupting the interviewee—a story
that is free-flowing is what is desired here. Nonverbal cues like
nodding one’s head can provide useful feedback to the person
giving the information without breaking up the narrative. One
of the most difficult aspects of this technique for the interviewer
is not interrupting the person and asking for more detail; the
opportunity of drilling down occurs after the first complete
telling of the story. While the interviewee is giving the story, the
interviewer should take only minimal notes and focus on what
is being said.
CONCLUSION
Interviews are an important part of investigating an incident as
one tries to determine the root, contributing, and proximal causes
REFERENCES
Bjork, R. (2012) Storage strength vs. retrieval strength. https://www.
youtube.com/watch?v=1FQoGUCgb5w. Accessed 8 Feb 2020.
Chabris, C.F. and Simons, D. (2014) Why our memory fails us. New
York Times. https://www.nytimes.com/2014/12/02/opinion/why-
our-memory-fails-us.html. Accessed 3 Mar 2020.
Deming, W.E. (1982) Out of the Crisis. Cambridge, MA: MIT Press.
11
IMMEDIATE ACTIONS
Earlier, we defined immediate action as: the action(s) taken to stop the
event or nonconformance from getting worse. In other words, you are
trying to figuratively “stop the bleeding.” Initiating an investigation
because of a deviation would not be considered an immediate action.
177
LICENSED TO JOSE CASTELLA
CORRECTIONS
The International Standards Organization (ISO) defined corrections
to be: “Action to eliminate a detected nonconformity” (ISO 9000:2005
– 3.6.2). Associated with this definition are two notes:
CONCLUSION
When an incident occurs, the immediate actions can make a
significant difference in the scope and impact of the situation. In
some cases the knowledge of a system and common sense may be
enough; in other cases having some level of preparation (“just in
case”) or even a checklist or procedure may be warranted.
12
181
LICENSED TO JOSE CASTELLA
Since the unwanted happened and you are wanting to not have it
happen again, you are taking a corrective action.
You also should consider new or residual risks that might occur
as a result of the change—this is part of the risk management process
discussed earlier (Chapter 6). A residual risk is defined as risk that
remains after the corrections, corrective actions, and preventive
actions have been put into place. The risk assessment does not
necessarily have to be complicated. For most simple proposed
corrective actions, five questions can get you through the process:
Elimination
If your cause indicated that something specific was the cause of the
unwanted event, could it be eliminated? For example, if cardboard
cartons that caused nonviable particulate contamination were used
to transport items into a classified environment, would it be possible
to get rid of the cardboard? Elimination also applies to processing
steps or activities associated with a process. Having a detailed flow
diagram or process map is very helpful here.
Substitution
If elimination of a hazard is not possible, consider substituting a
less hazardous action or material. For example, in an equipment
preparation area (grade C/D or ISO-8), washed/cleaned parts were
placed in plastic bags and heat sealed. Nonviable air samples
showed that the heating element was sometimes generating low
amounts of fume when sealing the bags. The firm eliminated the
bagging/sealing process step and substituted it with putting the
parts into clean plastic totes that had a lid which could be tightly
sealed. Another example is that instead of using general-purpose
copy paper for batch records in an aseptic processing area, the
Engineering controls/automation
One way to eliminate “human error” is to not have people involved
in the activity, something often called “engineering the person out.”
Automating and then validating the process can provide a high
degree of assurance that it will be performed the same way every
time. For example, some firms use robots to load and unload freeze
dryers or they have highly automated aseptic filling lines placed in
isolators.
Isolation
Isolating an activity can remove the hazard from what it could
adversely affect. Using an isolator in which aseptic filling occurs
is an obvious example, as is keeping wood pallets out of drug
manufacturing areas. Isolation may be physical (like placing
unapproved or rejected materials in a quarantine lockup) or
temporal (relying on timing and scheduling).
Alerts or notifications
Alarms can be useful in certain environments to notify personnel,
for example, if an air handling system is not functioning correctly.
The type of alarm—visual or auditory—needs to be considered in
terms of where it will be used and the personnel working in that
area. A major concern with alarms, however, is alarm fatigue—
“sensory overload when clinicians (in a hospital setting, for
example) are exposed to an excessive number of alarms, which can
result in desensitization to alarms and missed alarms” (Sendelbach
et al., 2013). Research has shown that 70–99% of all alarms are false
(Gaines, 2019).
if something broke down, the problem would not affect the entire
system (e.g., walkway, process, etc.) but only part of it.
Training
Training or “re-training” is the other most frequently seen corrective
action, often because the investigation was not or very poorly
performed. Health authority inspectors know this, and effectiveness
checks—the corrective action did not prevent a recurrence—show
this as well.
to the lab manager, the inspector said this looked like there was a
significant problem with the lab’s training program; the manager
reluctantly agreed. The inspector then used some logic Jiu Jitsu
on the manager: “You’ve just admitted your training program is
inadequate. So, how do you know that a particular result that was
within limits that this lab analyst generated was truly within the
specification? How do you know that it was not just luck that caused
a bad product to be tested with the results of a good product because,
as you just said, you have an inadequate training program?” The
inspector had backed the manager into a corner that he could not
get out of.
“Nudges”
Nobel prize winner Richard Thaler and legal scholar Cass Sunstein
define nudges in their book of the same name (Thaler et al., 2009) as
positive reinforcements and indirect suggestions as ways to influence
the behavior and decision making of groups or individuals. Nudging
is a form of choice architecture that attempts to guide people to an
appropriate decision or action through the use of psychological-
based cues. One of the most notable nudges was implemented at
Amsterdam’s Schiphol airport. Not long after the airport was built,
facility management was reviewing the operational costs and noticed
that they were over budget in housekeeping and cleaning. In asking
“why?” multiple times, they found that cleaners were spending
more time cleaning the men’s washrooms, particularly around the
urinals. The issue, to put it most delicately, was the poor aim that
some men had when using the appliance. To reduce the problem
of poor aim, they had urinals made with the image of a black fly
etched into the porcelain. (If you are not a male, ask a man in your
Swiss cheese
When we looked at accident models (Chapter 7), we were considering
how unwanted events occurred because of failures due to barriers
that were either not present or that had inadequacies (conceptually
represented by the holes in the Swiss cheese). This model, however,
can be viewed another way, with the cheese slices representing
controls that are put in place to prevent a recurrence.
As one moves down the list of practices (Table 3) from the device
or process controlled confirmation to review, the rigor decreases and
there is more time between the event that is being confirmed (and
in a sense controlled) and the confirmation performed. There is also
an increased risk to operators (e.g., safety and health), products,
and patients if the event is not executed fully and properly.
I knew that farther ahead was another street that could get me
to the airport. I continued ahead, made a right turn, and was on this
parallel road without any problem. Error recovery had just occurred.
thieves triggered the alarm three times during a storm with the
alarm company responding each time by visiting. On the fourth
triggering, the alarm company did not respond—they figured
that the storm was causing false alarms; the thieves made their
move, stealing nearly $80 million of brand-name drug products
(NPR, 2010; Mahoney, 2012).
CONCLUSION
To be effective, corrective actions need to be aligned with the root
and proximal causes. Fortunately there are a number of different
approaches that can be taken depending on what was discovered.
If the causes are not found, efforts should emphasize how to detect
the event should it happen again and to protect the things of value.
REFERENCES
ASQ (Undated) What are the five S’s (5S) of lean? https://asq.org/
quality-resources/lean/five-s-tutorial. Accessed 26 Mar 2020.
Carr, N. (2014) The Glass Cage: Automation and Us. New York, NY: W.
W. Norton and Company.
FDA (2011) Guidance for industry process validation: General
principles and practices. https://www.fda.gov/files/drugs/published/
Process-Validation--General-Principles-and-Practices.pdf. Accessed
27 Apr 2020.
Gaines, K. (2019) Alarm fatigue is way too real (and scary) for nurses.
Aug 19, 2019. https://nurse.org/articles/alarm-fatigue-statistics-
patient-safety/. Accessed 8 Feb 2020.
GHTF (2010) Quality management system – Medical Devices –
Guidance on corrective action and preventive action and related
QMS processes. Global Harmonisation Task Force. http://www.
imdrf.org/docs/ghtf/final/sg3/technical-docs/ghtf-sg3-n18-2010-qms-
guidance-on-corrective-preventative-action-101104.pdf. Accessed
27 Apr 2020. NOTE: The work of the now-defunct Global
Harmonisation Task Force became part of the International
Medical Device Regulators Forum (IMDRF) in 2011.
Krantz, J. (2009) Recipe for disaster: The formula that killed Wall
Street. https://www.wired.com/2009/02/wp-quant/. Accessed 3 Mar
2020.
Mahoney, E. (2012) Feds say they crack $80 million drug heist from
pharmaceutical warehouse in Enfield. https://www.courant.com/
news/connecticut/hc-enfield-eli-lilly-drugs-0504-20120503-story.
html. Accessed 4 Mar 2020.
NPR (2010) Thieves grab $75 million worth of Eli Lilly pills. Mar
17, 2010. https://www.npr.org/templates/story/story.php?storyId=
124758613. Accessed 3 Mar 2020.
13
PROCEDURES: CAUSES OF
PROBLEMS AND POTENTIAL
CORRECTIVE ACTIONS
Each of the sets of reasons have different causes, but all can
result in similar compliance, performance, quality, safety, and
business impacts.
inserting a sterile cotton ball into the top of the tube” to mean for
her to physically leave the laboratory, instead of what was intended:
leaving a space between the media and cotton. As workforces get
more diverse and technology transfers extend to different countries,
having a consistent understanding of instructions is critical.
but the second person was not available when the step had to be
completed? Or a supervisor sharing their password with someone
else to enter the data because the supervisor could not keep up with
the workload assigned? Most people do not want to simply violate
rules or procedures; there is often an underlying reason for it.
If you had done this trip many times, you might only need the
simple memory jog that the low level of detail provides. If you had
never been there and were unfamiliar with the highways, bridges,
and how they change names, a higher level of detail may be best.
The Commissioner agrees that the phrase “in detail” could be construed
to include description of insignificant portions of the procedure, which
is not the intent. Therefore, he is inserting the word “sufficient” before
the word “detail” (FDA, 1978, comment #202, p. 45043).
High Connect
Enter
Adjust
Set
Record
Confirm
and those performing a task. Table 3 shows how these are typically
arranged.
1. Setup of equipment
1.1. Step 1
1.2. Step 2
1.2.1. Substep 1
1.2.2. Substep 2
1.2.2.1. Sub-substep 1
1.2.2.2. Sub-substep 2
1.3. Step 3
1.4. Step 4
If you are using this format, a caution is not to have more than
four levels of indentation—more than that can cause the reader to
become disoriented.
CHECKLISTS
In his book The Checklist Manifesto: How to Get Things Right, Atul
Gawande (2009) describes the very interesting history behind
checklists and what makes for a good one. He states, “Under
conditions of complexity, not only are checklists a help, they are
required for success” (p. 79). And, “Checklists can provide protection
against . . . elementary (simple) errors” (p. 50). Checklists are meant
to be tools for performance, not procedures or training guides.
Before induction of anaesthesia Before skin incision Before patient leaves operating room
(with at least nurse and anaesthetist) (with nurse, anaesthetist and surgeon) (with nurse, anaesthetist and surgeon)
Has the patient confirmed his/her identity, Confirm all team members have Nurse Verbally Confirms:
site, procedure, and consent? introduced themselves by name and role. The name of the procedure
Yes Confirm the patient’s name, procedure, Completion of instrument, sponge and needle
Is the site marked? and where the incision will be made. counts
Yes Has antibiotic prophylaxis been given within Specimen labelling (read specimen labels aloud,
the last 60 minutes? including patient name)
Not applicable Whether there are any equipment problems to be
Yes addressed
Is the anaesthesia machine and medication Not applicable
check complete? To Surgeon, Anaesthetist and Nurse:
Yes Anticipated Critical Events What are the key concerns for recovery and
Is the pulse oximeter on the patient and To Surgeon: management of this patient?
functioning? What are the critical or non-routine steps?
Yes How long will the case take?
Does the patient have a: What is the anticipated blood loss?
Known allergy? To Anaesthetist:
No Are there any patient-specific concerns?
Yes To Nursing Team:
Difficult airway or aspiration risk? Has sterility (including indicator results)
No been confirmed?
Yes, and equipment/assistance available Are there equipment issues or any concerns?
This checklist is not intended to be comprehensive. Additions and modifications to fit local practice are encouraged. Revised 1 / 2009 © WHO, 2009
CONCLUSION
Procedures are an important tool to document and communicate
information and requirements. The challenge is making them useful
tools that promote consistent, correct performance of the tasks.
Before changing procedures after an unwanted event, be sure that
the procedures caused or contributed to the deviation or that they
did not provide a mitigation strategy to protect the things of value.
Well-prepared checklists can help trained personnel perform their
tasks consistently, resulting in effective, efficient, and compliant
results.
REFERENCES
Canada Transportation Safety Board (1998) http://www.tsb.gc.ca/eng/
rapports-reports/aviation/1998/a98h0003/a98h0003.html. Accessed
3 Mar 2020.
14
TRAINING AS A CORRECTIVE
ACTION
221
LICENSED TO JOSE CASTELLA
• Keeps an open mind while doing an investigation and avoids biases that could
interfere with it.
• Identifies corrective actions that are aligned with the root, proximal, and
contributing causes.
• Writes clear investigation reports that are technically and grammatically correct.
• Can assess the criticality of an event and the need to inform leadership and
request needed resources.
Social learning
One of the most powerful ways that we learn is through social
learning—we learn by watching and listening to others. In the 1930s,
the Russian researcher Lev Vygotsky examined how infants learn
• S-OJT trainers who are selected in part because of the skills they
have in performing the task and in providing feedback and
coaching to the learner.
CONCLUSION
More often than not, training is given as a corrective action to a
quality event or deviation, particularly if a true or probable root
cause has not been found. Training is a powerful tool to improve
knowledge and skills if the investigation shows that they are lacking.
Otherwise, you will be wasting time and resources. If the problem is
performance related, solving it needs to consider human factors as
well as other contributors discussed in other chapters.
REFERENCES
Brown, J.S. and Duguid, P. (2000) The Social Life of Iinformation.
Boston, MA: Harvard Business School Press.
Deeks, T., ed. (2018) Phase Appropriate GMP for Biological Processes.
Bethesda, MD: PDA/DHI.
Marcus, G. (2012) Guitar Zero: The New Musician and the Science of
Learning. New York, NY: Penguin Press.
15
CORRECTIVE ACTION
EVALUATION AND EFFECTIVENESS
CHECKS
If you have identified the root cause(s) and put in place the
appropriate corrective actions, the next step is to have confidence
that your actions have been successful. Evaluation and effectiveness
checks are a way to do that. And, even if you show that the efforts
did not have the intended positive impact, you can use this as an
opportunity to gain information and knowledge.
235
LICENSED TO JOSE CASTELLA
Timing
As you consider your corrective action solution(s) to the problem,
you may decide that a one-time effectiveness check is sufficient or
instructor can ask questions that are listed on the S-OJT worksheet
such as:
• What are the things that can go wrong when performing this
task?
A CAUTION
As you think of metrics, you need to consider what inappropriate
behaviors they can potentially produce. For example, the head of a
manufacturing site had “decreased deviations” as one of his major
goals. Can you think of several ways this could be accomplished?
One might be fewer people reporting deviations. A metric of
closing out investigations within 30 days can result in inadequate
investigations that do not find the true root causes and that often
have “retrain personnel” as the principle corrective action.
CONCLUSION
Effectiveness checks provide confidence that the actions taken were
effective in correcting the underlying problem and preventing
a recurrence. There are a number of ways that this can be done
depending on the significance of the unwanted event and the
potential impact that the event could have on the product and
patients. A failed effectiveness check often means that the actual
root causes were not identified or that the actions taken were not
adequate. When several different methods are used at different time
points following the corrective actions, an organization can have
high level of confidence in the fixes that were applied.
REFERENCES
Kirkpatrick, D.L. (1994) Evaluating Training Programs: The Four Levels.
San Francisco, CA: Berrett-Koehler.
16
243
LICENSED TO JOSE CASTELLA
Audience Question
Area/department • What happened?
involved in incident • Why and how did it happen?
• If something was affected, can it be saved (reworked,
reprocessed)?
• Is there a disciplinary action that should be taken?
• What is the impact to the schedule/other products?
Management • What are the business implications of the incident?
• What does this mean for product availability?
• What are the compliance or regulatory implications?
• How bad was it?
• How will it be fixed?
Audience Question
Quality • Can products involved be released?
• What are the implications to other products and processes?
• What are the compliance or regulatory implications?
• What does this mean for product availability?
• Was the investigation adequate?
• Was the root cause found?
• Were scope and impact properly assessed?
Health authority • What does this mean for patient safety?
(inspectors) • Was the root cause found?
• If the cause wasn’t found, how will patients be protected if it
happens again?
• How might this affect drug availability?
• Is the process/facility (still) in a state of control?
• Were scope and impact properly assessed?
• Is this incident a recurrence or part of a trend?
• Are other products or lots in the market potentially affected
by this same issue?
Formats
There is no singular standard for what a report looks like. They can
and do vary in a number of ways, including the complexity of the
investigation and the type of system (paper-based or electronic)
used for documenting. For example, some firms (often smaller,
early start-ups) use paper-based forms for simple investigation
and narrative reports for more complex ones. Other firms use an
electronic system (TrackWise® is frequently seen; other database
options also exist) that has designated fields one fills in. Other
organizations use a combination—putting summaries into the fields
that come from the written report; a PDF version of the report is
uploaded into the application.
• No names are used: Only the roles are included (e.g., production
supervisor, laboratory analyst, materials mover). In some cases,
firms will have a confidential annotated copy that does have
names that can be used to link the incident with reviews of
training records, qualifications, and corrective actions.
• Unique identifiers are used: This can range from unique initials
of the person (e.g., GAB) or the whole or partial employee
number (e.g., X-1140).
• Full names: Firms that use this approach say that this is meant
to hold people responsible for their actions. This, however, does
seem to be placing blame on those at the “sharp end” of the
incident. (The use of identifiers like full name may be subject to
national labor laws or contracts.)
Word choice
When writing the report, one needs to be careful about what
words to use. Words that are imprecise or vague (“probably” or
“typically”) should not be used, or if they are, should have evidence
to support them. For example, saying “most probable root cause”
As you are writing, you are not just telling the story and providing
facts; you also need to manage the anxiety or fear of the reader. For
example, instead of saying “massive contamination,” provide a
quantitation of contamination: “30 to 40 percent of the batch showed
visible particulate matter.” Be aware that some reviewers have their
own list of “trigger words” that are to be avoided.
Spelling counts
Small details such as correct spelling, proper grammar, good
sentence structure and organization are important as they give
the reader a sense of the firm’s attention to detail. (If the report is
written in English by people for whom English is not the primary
Attachments
ǡʹͲͳͷ
LICENSED TO JOSE CASTELLA
inspectors can be satisfied with the summary and avoid the time of
reading through the whole report—a benefit to all involved.
What went wrong? Roof leak with water dripping into hallway.
Where? Building 23 hallway CA, leading to solid dosage
manufacturing offices.
When did it occur? Overnight (after 11 pm) 10 July 2019.
When was it discovered? 6 AM 11 July 2019.
Who discovered the event? Stan Smith (extension #7345).
What immediate action was S. Smith called plant services; barrier placed around
taken? water; personnel detoured from hallway.
Etc. . . .
CONCLUSION
Investigation reports, whether they are brief notes in a batch record
or longer, formal documents, need to provide enough details so one
can reconstruct the event and investigation, and see the alignment
between the event and the correction and the actions taken.
The report is not just a document to file; rather, it helps provide
knowledge to others. A good test of a report is asking if it will be
self-explanatory to a reader three or four years from now without
the assistance of the writer.
REFERENCE
ICH (2008) Pharmaceutical quality system – Q10. International
Conference on Harmonisation. https://database.ich.org/sites/
default/files/Q10_Guideline.pdf. Accessed 3 Mar 2020.
17
255
LICENSED TO JOSE CASTELLA
STATED REQUIREMENTS
Whether the documentation of the incident is performed using
an online tool or in a paper format, if is helpful if templates are
provided that guide the writers, reviewers, and approvers through
• What was the problem that the writer was intending to solve in
the report?
• What are the arguments the writer was making (e.g., root cause,
proximal cause)?
• What problem was the writer trying to solve? Did the author
solve it (e.g., corrections and corrective actions aligned with the
causes)? Are there problems that were not solved?
– Correctly informed?
– Uninformed?
– Misinformed?
– Illogical?
There are two particular practices that have been seen to make
reviews more effective and efficient. First, in can be helpful to have
a matrix or table of the roles of different reviewers and the points on
which they are to focus. This could also be in the form of a checklist
(see Table 1 and Table 2). Second, having a training session where
all reviewers and approvers together can look at sample reports
in order to “calibrate their eyeballs” can be effective in developing
consistency in reviews. To be most effective, these calibration
sessions need to be repeated periodically as new reviewers and
approvers join in the process.
GIVING FEEDBACK
If you are providing feedback to a writer, you need to remember that
most investigation report authors are not trained technical writers
and that they are learning their craft as they are writing the reports.
As a reviewer, you can provide comments that can strengthen both
the writer and the specific report or, if done incorrectly, can be
frustrating and disheartening.
• Look for the positive things in the report; provide at least one at
the start and at the conclusion of your feedback session.
RECEIVING FEEDBACK
Getting feedback on an investigation report (or other piece of writing)
is not always that pleasant, even if the reviewer is professional and
courteous. The most important thing is realizing that their feedback
is not about you, it is the report that they are commenting upon.
In actuality, their goal is similar to yours—to have a report that is
complete, clear, correct, and that can successfully withstand a critical
review by an auditor or inspector.
• Ask for the rationale—be sure you understand the reason for
their change. Is this something that may be new to you or comes
from the reviewer’s experience?
CHURNING METRICS
If you are managing the investigation/CAPA process, it can be
useful to track the number of cycles (or churns) that it takes to go
from a draft report to the final, approvable one. For a short report,
one or sometimes two cycles is typical; for a much longer, more
extensive investigation report, one can expect to see another one
or two cycles. If you see that there are multiple cycles, you need
to understand why. Not only does this consume the time of the
reviewers and writer, but it can also delay decisions about the lot or
lots in question. While these lots are on hold, work-in-process (WIP)
inventory charges are being assessed. The sooner a decision can be
made about the affected lots, the lower the WIP charges.
CONCLUSION
Reviews of a draft report by subject matter experts and leadership
help ensure the report is on target and answers questions that might
be asked about the situation in the future by auditors and inspectors.
Reviews are meant to improve the report. Clarity, accuracy, and
completeness are important to consider, as well as that the report is
as long as needed but no longer. Approval, including a final sign-
off by the Quality Unit, means that the report has been accepted by
the organization and the organization will implement the report’s
recommendations.
REFERENCES
FDA (2019) Part 211 Current good manufacturing practice for
finished pharmaceuticals. Code of Federal Regulations Title 21.
https://www.accessdata.fda.gov/scripts/cdrh/cfdocs/cfcfr/CFRSearch.
cfm?CFRPart=211. Accessed 4 Mar 2020.
18
COMMUNICATION
267
LICENSED TO JOSE CASTELLA
CONCLUSION
Communicating information about a quality event and the related
impacts and risks is part of knowledge management. It not only builds
awareness of such events among stakeholders, it also contributes
to transparency within an organization. Beyond simple awareness,
communicating problems can also provide direction to personnel
on how they can prevent and respond should circumstances arise.
REFERENCES
EMA (undated) Quality defects and recalls. https://www.ema.europa.
eu/en/human-regulatory/post-authorisation/compliance/quality-
defects-recalls#reporting-obligations-section. Accessed 3 Mar 2020.
19
273
LICENSED TO JOSE CASTELLA
How then can we learn from our errors and failures if we’ve
grown up being told mistakes are bad? How do our organizations
extract some sort of value from deviations and quality events? This
chapter presents several different approaches to learning from
mistakes that are used in a variety of organizations.
AFTER-ACTION REVIEWS
When analyzing failures as part of the learning from mistakes,
the organization is trying to make sense or find meaning based on
the quality event or deviation that has occurred. This can be done
during reflection when people carefully consider an event, action, or
decision and gain personal insights.
3. Was there a difference between what was planned and the actual
results? Why? Why not?
2. Share the results with others in the organization. What were the
essential elements of value that have broad applicability?
CONCLUSION
To look at life, one can say that everything we do is some sort of an
experiment. We often get the intended results, but now and then,
particularly when there may be information or experience lacking,
the outcome is less than what we were hoping for. To continually
improve—a characteristic of every quality program—it is essential
to continually learn. Having leadership that sees the value of this and
encourages this as a daily practice can find value even in failures.
REFERENCES
Birkinshaw, J. and Haas, M. (2016) Increase your return on failure.
Harvard Business Review, May 2016, pp. 88–93.
20
MANAGEMENT RESPONSIBILITIES
283
LICENSED TO JOSE CASTELLA
2. MANAGEMENT RESPONSIBILITY
Leadership is essential to establish and maintain a company-wide
commitment to quality and for the performance of the pharmaceutical
quality system.
Root Cause
A basic repeat deviation
monitoring program
Deviation investigations Ineffective human error Effective human error Human factors and HE HE & HP champions/
frequently attribute Human prevention program, program and improvement prevention is mandated practitioners across
Error (HE) as root cause metrics/monitoring in place initiatives in place and when designing a process organization
Lacking basic knowledge of but show little or no metrics/ monitoring show Evidence of Associate‐ Associates proactively use
HE prevention improvements limited improvement initiated HE reduction HE prevention tools
CAPA frequently involves Formal training in human initiatives Demonstrate sustained
just re‐training error and human factor All relevant Associates reduction of HE deviations
concepts (e.g. Kaizen, 5S, 6 trained on HE/Human over a number of years
Human Error
blocks, etc.) Performance (HP)
Prioritization of error
reduction activities based on
Table 1. Range of root cause and human error that can be observed as
Using a tool like this can help leadership determine where they
5/29/2020 10:56:24 AM
Licensed to Enger, Tehya/PDA: Copying and Distribution Prohibited.
Management Responsibilities 289
CONCLUSION
Having a successful, robust program for investigations and
corrective actions requires that leadership have a comprehensive
understanding on how and why failures can occur and continually
look for ways to improve. Leaders need to not only set and
communicate the vision to others in the organization, but help
eliminate barriers and provide a variety of resources to get there.
REFERENCES
Economist (2014) Why Sweden has so few road deaths. The Economist,
26 Feb 2014.
Appendix 1
DEFINITIONS
Accident An adverse outcome that was not caused by chance or fate. Most
accidents and their contributing factors are predictable and the
probability of their occurrence may be reduced through system
improvement (Kartoglu, 2015).
CAPA A system for implementing corrective actions and preventive
(corrective actions resulting from the investigation of complaints, product
action/preventive rejections, nonconformances, recalls, deviations, audits, regulatory
action) system inspections and findings, and trends from process performance and
product quality monitoring (ICH Q10).
Common cause Fluctuation caused by unknown factors resulting in a steady but
variation random distribution of output around the mean or average of the
attribute being measured. It is considered the “noise” in a system,
process, activity, or situation; it is the natural pattern of data.
Contributing A factor, situation, or agent that accelerates or intensifies the
cause occurrence of the unwanted event. If the contributing cause is
removed, it does not prevent the unwanted event from occurring.
Control Actions taken to reduce risk by reducing the chances of the
unwanted event occurring; sometimes called prevention.
Control strategy A planned set of controls, derived from current product and
process understanding, that assures process performance
and product quality. The controls can include parameters and
attributes related to drug substance and drug product materials
and components, facility and equipment operating conditions, in-
process controls, finished product specifications, and the associated
methods and frequency of monitoring and control (ICH Q10).
291
LICENSED TO JOSE CASTELLA
Proximal cause The event that causes or sets off a series of events that results in
the symptom. “The straw that broke the camel’s back.”
Qualification Action of proving and documenting that equipment or ancillary
systems are properly installed, work correctly, and actually lead
to the expected results. Qualification is part of validation, but
the individual qualification steps alone do not constitute process
validation (ICH Q7).
Quality event An unwanted result or situation that requires some sort of
response, usually defined by a procedure. It is the umbrella category
that would include excursions, nonconformances, deviations, out-of-
specification results, and accidents.
Residual risk Risk remaining after risk treatment measures (risk control and/or
mitigation) have been taken.
Risk The combination of the probability (likelihood) of occurrence of
harm and the severity (impact) of that harm (ICH Q9).
Risk treatment Activities taken to reduce risk. Typically includes control, mitigation,
and preparation.
Root cause Causal factor that, if corrected, would prevent recurrence of the
same or similar accidents. Root causes are the specific underlying
causes, can be reasonably identified, are under the control of
management to fix, and effective recommendations can be
developed to correct/prevent them (adapted from Rooney and
Vanden Heuvel, 2004).
Severity A relative measure of the impact of the unwanted event’s
consequence(s) on the thing of value.
Special cause A factor that has changed the system, process, activity, or situation.
variation It can be due to a new or neglected factor that has, for some
reason, appeared or emerged. Special cause variation is the “signal”
that is expressed.
Specification A list of tests, references to analytical procedures, and appropriate
acceptance criteria that are numerical limits, ranges, or other
criteria for the test described. It establishes the set of criteria to
which a material should conform to be considered acceptable for
its intended use. “Conformance to specification” means that the
material, when tested according to the listed analytical procedures,
will meet the listed acceptance criteria (ICH Q7).
State of control A condition in which the set of controls consistently provides
assurance of continued process performance and product quality
(ICH Q10).
Symptom Circumstances, events, or conditions that indicate a problem
situation exists or has occurred.
Validation A documented program that provides a high degree of assurance
that a specific process, method, or system will consistently produce
a result meeting predetermined acceptance criteria (ICH Q7).
REFERENCES
GHTF (2010) Quality management system – medical devices –
guidance on corrective action and preventive action and related
QMS processes. Global Harmonisation Task Force. http://www.
imdrf.org/docs/ghtf/final/sg3/technical-docs/ghtf-sg3-n18-2010-qms-
guidance-on-corrective-preventative-action-101104.pdf. Accessed
27 Apr 2020. NOTE: The work of the now-defunct Global
Harmonisation Task Force became part of the International
Medical Device Regulators Forum (IMDRF) in 2011.
ICH Q7 (2000) Good manufacturing practice guide for active
pharmaceutical ingredients. https://database.ich.org/sites/default/
files/Q7%20Guideline.pdf.
ICH Q8R2 (2009) Pharmaceutical development (revision 2). https://
database.ich.org/sites/default/files/Q8%28R2%29%20Guideline.pdf.
ICH Q9 (2005) Quality risk management. https://database.ich.org/sites/
default/files/Q9%20Guideline.pdf.
ICH Q10 (2008) Pharmaceutical Quality System. https://database.ich.
org/sites/default/files/Q10_Guideline.pdf.
Kartoglu, U. (2015) Pharmaceutical and vaccine quality illustrated.
Freely available at www.kartoglu.ch. Accessed 26 Mar 2020.
Rooney, J. and VandenHeuvel, L. (2004) Root cause analysis for
beginners. Quality Progress, 37(7).
Appendix 2
INCIDENT INVESTIGATOR’S
WORKSHEET
295
LICENSED TO JOSE CASTELLA
Where did it happen? Be as specific and descriptive as possible (e.g., room name,
location within room, interior or exterior of equipment, step of the
process).
When was it observed? The observation of the incident could be sometime after it
happened.
How was it observed This would explain how the incident was discovered (e.g., by
or discovered? chance, as part of a standard inspection practice).
Who observed or Name, phone (important for interviewing and follow-up questions),
discovered it? and title.
What was the first Identify the immediate actions taken (if any) to stop the problem
thing done when from getting worse or from affecting more product.
the incident was
discovered?
Who “owns” the Name, phone (important for interviewing and follow-up questions),
process? and title of the process owner.
What were the Identify the relevant environmental conditions (e.g., temperature,
conditions at the time humidity) or data from environmental monitoring (e.g., viable,
of the event? nonviable particulates).
What procedure was If a procedure was used, what was its name/number and revision
involved? number? What step or substep in the procedure was used?
Was anything different Identify what may have been different—first batch made after a
with this event shutdown, changes in a batch record, etc.
compared to similar or
“normal” situations?
What is your plan State the documents that will be examined, the people to be
in conducting the interviewed, the records to be reviewed, etc.This becomes a
investigation? “checklist”; it will be expected that each item listed here is
commented on in the report. Usually, the following items (2A–2D)
will be included in your review.
Has this—or something This is used to determine if this event is an isolated event or one
very similar—happened that has occurred before. If it is recurring event, you will need to
before? explain why.
What are the review Use a relevant time period based on the number of events, cycle
parameters used in (e.g., calibration period). Consider other places that the material,
your search? What equipment, or test may have been used. Look-backs should be at
was your rationale for least one year (or more if the opportunity for the unwanted event
the selecting the time rarely occurs).
period you examined?
If you didn’t find an Sometimes, for a one-time event, you will not be able identify the
assignable or most reason.This may be because of a lack of information.You need to be
probable cause, do you able to give evidence that you reasonably tried to discover the cause.
have a reason why? Maybe you will have an idea that will make it easier next time to solve
the problem if it happens again. If so, you can include it here.
What were the Identify the process you used to come to these conclusions—for
methods used in the example, by using a fishbone diagram or cause mapping process.
investigation? In some simple, obvious cases, the method may have been
through direct observation.
Part 5. Corrections
What must be done to Corrections are actions taken to fix the materials, equipment, or
the items affected by the process or potentially “rework” the material. If a satisfactory “fix”
incident to allow their cannot be made, an intermediate or product may be rejected and
release or use? destroyed.
How do you know Provide your justification for the correction you are recommending.
the corrections will be Your justification needs to logically and scientifically relate to the
adequate? impact.
How do you know the Provide your justification for the preventive action(s) you are
preventive action(s) will recommending.Your justification needs to logically and scientifically
be adequate? relate to the impact and risk.
When will these Identify the time frame and who (by name, phone number, title) is
preventive actions overseeing the activities.
be completed? Who
is responsible for
overseeing these
actions?
Part 9. Communication
Who should know Consider if there needs to be a report of this incident to
about this incident regulatory authorities. Also, should it be escalated to senior
and its outcome? management? Should other people or sites be notified so they can
Those involved? Other take preventive actions?
sites? Management?
Regulatory officials?
INDEX
303
LICENSED TO JOSE CASTELLA
risk 3, 7, 13, 14, 17, 19, 23, 26, 27, 29, single-event model 94, 95
34–36, 38, 50, 59, 63, 64, 70, 73, situational awareness 7, 67, 68, 69
77, 78, 80–86, 88–90, 104, 105, situational factors 122
114, 150, 178, 183, 188, 194, 195, skill-based errors 121
197, 200, 201, 211, 215, 243, 249, Skinner, B.F. 115
250, 270, 271, 274, 275, 287 slips 109, 121
risk analysis 80, 104 social learning 224, 227, 228
risk-based thinking 34, 35, 77, 83, 88, 90, special cause variation 70, 71
178, 194, 195, 215 spelling 45, 61, 208, 247
risk evaluation 70, 81, 82 sterile cockpit rule 129
risk matrix 81 stories 2, 227, 273
risk reduction 70, 81, 82 structural stage 257
risk review 83 structured on-the-job training (S-OJT)
risk treatment 81 32, 225, 228–230, 240, 241
role of leadership 280 structured on-the-job training (S-OJT)
root cause 8, 15, 16, 20, 21, 23, 45, 57, learning guide 32, 225, 228, 240
107, 108, 111, 119, 133, 137–139, subject matter expert (SME) 33, 36, 38,
146, 147, 149, 150, 157, 162, 182, 138
199, 232, 235, 236, 238, 239, 246, substitution 58, 184
248, 257, 259, 283, 285–287 summative evaluation 235, 236
root cause analysis 21, 137, 149, 157 supervisory violations 124
sabotage 114, 122 sustainability 202, 240
Sandle, Tim 13 Sweden 285
scope 21, 37, 38, 41, 45, 51, 52, 54, 57, Swiss cheese model 193
58, 61, 80, 88, 179, 236, 238, 239, symptom 63, 64, 73, 146, 150, 153, 160,
246, 251, 267–269 162, 236, 239, 248
selecting the team 41 systems accident 100, 102
selection bias 138, 155 tacit knowledge 118, 222, 223, 274
sharp end 109, 111, 133, 246 timelines 56, 144, 162
shortcuts 122, 124, 191, 241, 287 time pressure 3
should be 18–23, 25–27, 31, 37, 41, 42, to err is human 5, 115
49, 51, 54, 58, 61, 64, 65, 70, 78, tool selection 162
81–83, 88, 109, 110, 113, 157, Toyota Motor Company 186
160, 162, 178, 184, 191, 200, 208, training 5, 31, 32, 36, 37, 44, 45, 58, 61,
212, 214, 216, 221, 230, 236, 238, 80, 83, 96, 110, 111, 116, 118, 119,
241, 245, 247–249, 251, 256–268, 123–126, 133, 148, 190, 191, 199,
271, 287 202, 205, 207, 208, 211, 212, 216,
signal 42, 68, 69, 70, 72, 270 221, 222, 224–230, 232, 239–241,
signal detection 69, 70 246, 248, 258, 267, 275, 287