Agile AI Full
Agile AI Full
Agile AI Full
m
pl
im
en
ts
of
Agile AI
A Practical Guide to Building AI
Applications and Teams
REPORT
Table of Contents
2. Understanding AI Tools. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Contrasting Machine Learning and AI 13
The Role of Open Source in Innovation 15
Tooling 17
The Fundamentals of Machine Learning Projects 19
The Machine Learning Life Cycle 21
Distributed Workloads and Hybrid Environments 24
Summary 26
3. AI Skills. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
Understanding the Skills and Culture 28
Team Skills Assessment 29
Core Skills 31
Building Teams 37
4. Summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Use Cases 41
Data 42
Tools and Process 43
iii
Mindset 43
Integration and Trust 44
Conclusion 45
iv | Table of Contents
CHAPTER 1
Introduction: Agile AI
Processes and Outcomes
1 McKinsey & Company, “AI Adoption Advances, but Foundational Barriers Remain,”
November 2018; T. Fountaine, B. McCarthy, T. Saleh, “Building the AI Powered Orga‐
nization”, Harvard Business Review (2019); S. Ransbotham, D. Kiron, P. Gerbert, M.
Reeves, “Reshaping Business with Artificial Intelligence”, MIT Sloan Management
Review (2017).
1
Why are project failure rates so high? There are three areas in which
things can go wrong (see Figure 1-1):
Skills
First, high skill levels are needed to harness AI.
Process
Every use case can be developed in a different way. There is no
blueprint for developing AI applications and integrating them
into your current business workflow.
Tools
There are hundreds of open source and proprietary AI tools.
Each can have hidden limitations and costs. There is no guaran‐
tee of success.
Skills, processes, and tools are like a three-legged stool: if just one of
the three legs fails, the stool falls over. With this in mind, this book
offers an agile AI approach for business that will allow you to inno‐
vate quickly and reduce your risk of failure.
Beyond R&D
The most effective campaigns to achieve AI at enterprise scale tend
to focus related projects on business objectives. Let’s compare the
different ways companies are approaching AI to help us understand
data science compared to R&D.
If machine learning is the language behind AI, statistics are the
grammar of that language. Enterprises have long had analysts
reviewing historical data in search of trends using statistical princi‐
ples that are useful for reports and experimentation. Those reports
and experiments are one function of advising the business, and busi‐
ness analysts have achieved great success utilizing them, but often
the report is the final—or only—outcome of that wider analysis.
Predictive capabilities for AI, on the other hand, must be wrapped
into broader application deployments and made available.
By comparison, when they participate in conversations about AI,
many enterprises think only of their research and development
function. For line-of-business leaders, AI can feel unachievable, so
they might believe that whatever comes out of the research organi‐
zation will be the closest thing to AI that their enterprise will ach‐
ieve. Predictive algorithms are improving all the time, thanks to
R&D, and many organizations, from IBM to McKinsey, to Google,
have excellent research functions to do just that. AI, however, is not
the improvement of a function; rather, it’s the broad and strategic
application of a set of predictive capabilities designed up front to solve
difficult business problems.
As we look to drive business outcomes, IBM data science teams
embed with client teams working to solve the business problems at
hand, not their R&D functions. As we build out their AI capabilities,
we design our applications to operationalize the models from the
very beginning. It is common for enterprises, and our clients, to
struggle to make substantive use of their predictive solutions
because they don’t begin their data science exercise with the inten‐
tion of consuming the models they’ve spent many hours training. If
enterprises are to achieve AI or the ability to apply predictive
Organizing for AI
We frequently hear from technical leaders when they struggle with
how to best organize the talent within their teams to enable AI
development to thrive. This is always a revelatory conversation.
First of all, when the IBM DSE team begins any engagement, we
mandate that each client bring data scientists from within their
organization to the table. We strive to help grow skilled data science
practitioners; if we didn’t, each engagement would fail as soon as
IBM left. Those data scientists can then serve as Centers of Excellence
for other groups or lines of business needing to build AI solutions. It
was never intentionally decided that the DSE team would help lead
enterprises in this direction, but ultimately, we help clients grow
their data science practice. Clients ask all the time about how best to
scale data science in their enterprise. Our answer is to build the
appropriate organizational structure. Companies can innovate with
separate pockets of data scientists serving different purposes in a
decentralized model, but the best model is a hub-and-spoke model.
The hub serves as a Center of Excellence focused on supporting all
parts of your company, whereas the spokes are located in each line
of business focused on business problems.
Organizing for AI | 7
Data Scientists
For each use case you will implement, each project team needs a
variety of skills. First, the data scientists tend to be generalists with a
toolbox, which allows them to serve as service centers. The princi‐
ples of their practice endow them with comfort in handling many
different sets and types of data and applying best practices when it
comes to cleaning, preparing, and understanding that data. They are
mandated to maintain an understanding of the state of the art in the
field, but as it applies broadly to data science.
Data Engineers
Data engineers are masters of the data pipeline, and they’re essential
for owning the data cleaning and architecture that data scientists
depend upon. They also don’t pray to any singular type of data, so
their skill sets generalize well to teams in a broad sense.
Business Analysts
By contrast, business analysts are often tied to business units and are
experts at the intricacies and practices within the field of concern of
that unit. They know data within their field well, and when a data
scientist doesn’t know what a specific feature means, they turn to the
business analyst. When a data scientist doesn’t understand a prob‐
lem in any specific sense, it would likely be the business analyst to
whom they turn.
There is a naturally cohesive partnership between the data engineers
and data scientists, with transferable skills across many teams,
whereas business analysts tend to remain embedded and act as a sta‐
tionary Center of Excellence. Teams that are set up to build applica‐
tions with full modeling and data science pipelines might think each
business unit requires its own set of data engineers and scientists,
but in the enterprise that tends to be overkill, especially when all
enterprises are competing for relatively scarce data scientists and
data engineers.
In summary, there are a few essential conditions for the hub-and-
spoke model to work. The business analysts should know the busi‐
ness problem as well as the data, in detail. The data scientists should
have the know-how to dissect that problem and data, and model an
approach. The data engineers can then help with the data pipeline,
Summary
The IBM DSE team built a unique Agile process for our clients’
projects, and in this book, we draw from those experiences to show
you how your teams can succeed with an Agile approach to AI.
Legendary science-fiction author Arthur C. Clarke famously wrote
that “any sufficiently advanced technology is indistinguishable from
magic.” Despite what you might have read, AI is not magic. You
really need to understand this technology in order for your teams to
harness it. This book is a chance for us to share the stories and key
practices of our successful client engagements to help you succeed in
your Agile AI practices.
Summary | 11
CHAPTER 2
Understanding AI Tools
13
Wissner-Gross showed that the mean time between a new machine
learning algorithm being published and its use in an AI break‐
through was 18 years; however, the mean time between the required
datasets becoming available and those AI breakthroughs was 3 years.
Machine learning without the necessary data and use cases is merely
a pile of nuts and bolts waiting to be built into something useful.
Nonetheless, machine learning is about learning from data, not
about writing code, and that represents a fundamental difference
from previous software engineering practices.
Artificial intelligence is where the uses of machine learning begin to
affect social systems. That can be where machine learning models
perform tasks with human-like levels of proficiency, such as con‐
verting speech to text or recognizing tumors from radiology images.
Even more likely, it can be where machine learning models augment
teams of people, such as flagging potential credit card fraud that
customer service agents need to explore further. Or it can be where
machine learning is used for wildly nonhuman tasks such as making
a computer science student appear on video to be able to dance as
well as a professional ballerina—see “Everybody Dance Now” by
Caroline Chan et al.—also known as deep fakes. In each case, the AI
applications have effects on groups of people, and that’s how we
measure their impact.
Note that we also talk about data science here. In general, we see
data science teams, which tend to use tools for machine learning, and
might develop artificial intelligence applications. When we’re talking
about expert teams, we describe those as data science teams.
To use a Star Wars analogy, R2-D2 is a robot that takes in data,
learns, and makes decisions, but he was built to interface with an X-
wing star fighter, aiding as a copilot. Think of R2-D2 as machine
learning. C-3P0 is also a robot that takes in data, learns, and make
decisions, but he can speak and is programmed to understand mil‐
lions of languages. He aids humans, but in more general ways than
R2-D2. C-3P0 can replicate human intelligence but in many cases
exceeds human performance. Think of C-3P0 as artificial intelli‐
gence. What about deep learning? Think of Yoda, who operates at a
deeper level using the force (neural networks). This is mostly a joke,
but it’s an easy way to describe the relationship between machine
learning and AI.
This is not an exhaustive list of tooling; it’s a list of tools rather than
specific data science packages. In other words, we’ve identified that
Jupyter notebooks are important as a tool without qualifying that, to
work efficiently, a data science team will need to depend on libraries
and frameworks like TensorFlow or scikit-learn. Additionally,
Python and R are commonly used languages, and each has a variety
of usable workbenches. Besides Jupyter Notebook, graphical user
interfaces (GUIs) are commonly used, such as PyCharm for Python
and R-Studio for R. A notebook, however, is entirely sufficient for
executing exploratory data analysis (EDA), model training, and
model evaluation.
Data science in large part deals with cleaning and evaluating large
stores of data. Data science as a discipline took off in popularity
concurrent with advances in capabilities to store and process data—
namely, Hadoop Distributed File System (HDFS) and map-reduce
as the storage and processing mechanisms, respectively. As it
became more economical to store and evaluate data, the practice of
Tooling | 17
utilizing it matured. In light of that, the open source solutions for
terabyte-scale data persistence will likely revolve around HDFS and
what many term Hadoop’s successor: Spark. Spark differs from
Hadoop in that it has figured out how to load data, at scale, into
memory for processing. By comparison, Hadoop depends on data
read from storage. The difference in the upper bounds of processing
speed, in comparing data read storage as compared to data
read from memory, are drastic. Spark’s gains in computational speed
have buoyed its recent popularity and made it an essential part of
the open source data science stack.
Although Hadoop and Spark are used extensively in AI projects, tra‐
ditional relational database management systems (RDBMSs) are still
relevant today for data science. Many organizations keep their trus‐
ted data in these managed systems, and data scientists perform
extensive structured query language (SQL) commands to extract the
information they need.
After we’ve built a machine learning model, it must be deployed for
the intended use case. Sometimes, that involves deploying a model
as a microservice and scaling to meet customer demand. That might
involve Kubernetes, autoscaling, load balancing, and other network
engineering work. Other times, we must run a machine learning
model directly on a smartphone and other mobile devices. For that,
we need to use model compression before loading the model into the
embedded hardware. In either case, there are tools for managing the
model deployment and then monitoring model performance in
production.
A new field is emerging, called MLOps, for the operations work
required to manage the full end-to-end machine learning life cycle.
In other words, the definition of operations has had to change to
meet the needs of machine learning. Also note that the AI space is
replete with vendors attempting to own differing segments or steps
in the tooling. A lot of companies have perspectives on the full pipe‐
line, and solutions to accommodate storage, model training, and
model evaluation. An evaluation of vendors is beyond the scope of
this ebook; here, we simply offer our perspective on the minimum
required to staff and build in an AI pipeline.
Supervised Learning
Supervised learning is a subset of machine learning in which your
training data has a target feature (also called a label) that you’re try‐
ing to predict. As an example, if we were trying to create a model to
predict the balance in a client’s bank account, the dollar balance
would be the target variable. Models match input values to output
values based on historical information, and learn patterns within the
dataset to most closely approximate the labeled value when making
a prediction. There are many applications of supervised learning as
well as many models within supervised learning to help achieve
your modeling goals. Supervised learning models include linear
regression, logistic regression, decision trees, random forests, and boos‐
ted trees, to name just a few. Whenever your data has a target value,
you are going to use supervised learning to predict your target
outcome.
Unsupervised Learning
Unsupervised learning is the subset of machine learning in which
your training data does not have a label for you to predict, and so
your goal is to model likeness between the different training exam‐
ples. Unsupervised models find patterns of similarity across the
training samples, and attempt to cluster them together. In grouping
data points together, we can learn a lot about their likeness and nat‐
ural belonging. We can use unsupervised learning in applications
that need capabilities like document/record classification, market
segmentation, and data compression. Consider the example of clas‐
sifying handwritten digits. Each training row would represent a vec‐
Deep Learning
Figure 2-2 depicts how deep learning, based on artificial neural net‐
works, is part of the broader scope of machine learning methods.
Deep learning can be used as supervised learning or unsupervised
learning—or even a newly emerging category of self-supervised
learning that’s beginning to be used to describe generative adversarial
networks (GANs).
The main point is that deep learning architectures have several lay‐
ers of neural networks, which in practice causes certain properties
and behaviors. For example, deep learning models can train without
“saturating” such that millions of data examples can be used—unlike
earlier forms of machine learning. This, in turn, allows deep learn‐
ing models greatly increased accuracy, as observed in use cases for
computer vision, speech recognition, natural language, and so on.
A good way to understand approximately how deep learning works
is to consider the layers in an analogy for human vision. People have
retinas on the back of their eyes. The retina has rods and cones that
detect contrast and colors. At this lowest layer, human neural net‐
works are taking in raw data. Then as the neurons connect further
and further into the brain, processing progresses from raw input to
higher-level concepts and actions: person, clown, danger, run! The
Training
In the training part of the process, a data scientist takes their dataset
and evaluates multiple models to identify which is the most effective
for their predictive needs. Some models are easier to evaluate and
understand, and offer greater parsimony: consider (regularized) lin‐
ear regression. Some models are more effective at gathering predic‐
tive signal, and more complex to explain or implement: consider
boosted trees.
Deployment
The second part of the data science life cycle is deployment, wherein
the trained model must be made consumable. Data science and
building AI solutions is a powerful practice, but of little to no value
Running
The final step in the data science life cycle is running your model to
evaluate results, and that’s where your application is made consuma‐
ble. Traditionally in this step, a DevOps team takes the software
application that the data engineers and data scientists have built and
creates a standard for deploying it and making it consumable in
applications and in production. Software applications require main‐
tainability and scalability. Applications are only as successful as they
are easy to deploy and maintain.
Summary
This chapter covered the fundamentals of data science and AI tools
as well as provided an outline of possible trajectories to expect as
these projects become more and more commoditized. You, as a part
of a leadership team, have been armed with the knowledge of the
terms and practices your teams will need as your enterprise aims to
incorporate AI into its decision-making.
How can you assess the skills you need for a data science project? In
some regard, no matter how agile your team is during your science
project’s duration, it won’t matter if you don’t have a minimal apti‐
tude skill set within your team. In this chapter, we look at the core
skills in data science and effective practices for building a data sci‐
ence team and nurturing a supportive culture.
There are a few roles that are almost always found on data science
teams in industry. (Of course, some of the following descriptions
might change, depending on the needs of a business vertical or spe‐
cific focus.) We talk about these in general terms as follows:
Domain expertise
Understanding the business needs and nuances within it; for
example, regulatory compliance about data privacy if you work
in health care.
Data science
This is where the science and math come in—can you prove
insights about the business based on advanced analysis of its
data?
Coding
Machine learning models need to be integrated into application
software, and that requires programming.
27
Systems
Data science tends to rely on lots of compute resources and
requires people who are proficient with data engineering, dis‐
tributed systems, and high-performance computing.
These typical roles indicate what kinds of skills are needed on a data
science team. Before we dive into specifics, let’s first take a look at
some of the history that led to data science.
28 | Chapter 3: AI Skills
Today we take those definitions as givens. Successful data science
teams are diverse by nature. Because no single person is likely to be
an expert in several fields—computation, data management, applied
math, business use cases, decision science, and so on—we build
teams in which people with different skill sets and perspectives
collaborate.
30 | Chapter 3: AI Skills
components into production systems. Perhaps your organization
has a separate data engineering team, but requires that you embed
application developers to work side by side with data scientists. For
another example, in some verticals, such as finance, compliance
requirements and other domain expertise can become quite com‐
plex, so you might need Domain Expertise as a column. Adjust the
matrix to suit your project needs.
Core Skills
Let’s look through the core skills for data science, considering each
potential stage of a project.
Core Skills | 31
Data Preparation
Much of the work performed by data science teams focuses on
cleaning and preparing data. When an organization uses machine
learning, core business value creation comes through a well-known
set of activities: continually improving data sources; resolving data
quality issues; finding better means for feature engineering; digging
into the edge cases with data that might introduce risks through
bias, privacy, security, or ethics issues; and so on.
By analogy, if you worked at an investment fund where most busi‐
ness value creation came from carefully managing the portfolio of
investments, most of the time spent by your organization would
probably be focused on curating that portfolio. Data is a similar
game. It would be impossible for a business to get all of those data
sources 100% correct on the first try. Instead, much of that work
depends on interacting with customers and vendors and arriving at
a better understanding of the data and use cases over time—similar
to managing a portfolio of investments. It makes sense that so much
data science activity is focused on data preparation.
Important characteristics for data preparation, which should be
essential for almost everyone on a data science team, include the
following:
32 | Chapter 3: AI Skills
Coding Chops
Overall, data preparation is a process that needs to be repeatable, so
programming skills are vital. Programming languages such as
Python and R have become hugely popular for data science work.
Some of the big data distributed frameworks can be implemented in
Java or Scala (such as Apache Spark) or in C++ (such as Tensor‐
Flow), so having familiarity with a range of programming languages
can be important. Not everyone on a data science team needs to be
fluent with every programming language needed; this reinforces the
need for skill diversity.
Along with the programming languages, adjacent skills in version‐
ing (Git), debugging and issue tracking, performance analysis, and
other troubleshooting are important too. Probably the best skill is to
know how to find answers when you become stuck on a program‐
ming problem by pairing with other people on your team, searching
online, or researching previous work.
Most important, developing processes so that data preparation
workflows can become reproducible is critical for the “science”
aspect of data science. Using tools such as Jupyter Notebook is a
great way to help make data preparation workflows reproducible
while also helping make results more understandable by stakehold‐
ers. Collaboration based on Jupyter often implies running atop
Docker containers and Kubernetes clusters as well as other system
level skills such as Bash shell commands and other Linux use.
Core Skills | 33
in the data. The business concerns addressed by data science work
typically involve multidimensional problems. For example, an anti-
fraud model might need to limit credit card charge-backs, while also
considering the customer support costs for false positives, guarding
against bias toward particular subgroups of customers, and
minimizing computational infrastructure costs. Visualization is a
great way to explore multidimensional data, particularly for explor‐
ing the complex “performance” measures of machine learning mod‐
els. Stakeholders might be able to spot effects in the data by looking
at visualizations, over and beyond what the data science team recog‐
nizes. The skill sets involved in data visualization include using
libraries, data storytelling, and frontend programming skills.
Many data science projects don’t need to go beyond this point, after
stakeholders gain the insights needed. It can help to have people
who understand how stakeholders will use the inputs from the data
science team. For example, having familiarity with decision science
is one way to bridge between data science work and executive
decision-making.
Data Storytelling
Data storytelling is a way to communicate effectively with data.
There are many tools to help generate charts from data, although
those tools alone aren’t enough. The trouble is that people struggle
with understanding a collection of facts: clutter, attention span,
high dimensionality, cognitive overload—these aspects interfere.
Instead, people tend to learn through stories. Although it’s relatively
easy to generate a complex chart from data, it’s also easy to generate
charts that people will not readily grasp—or worse, ones that lead
to misconceptions.
In data science, we talk about how much time is dedicated to data
preparation, and, of course, much effort goes into the analysis to
reveal insights. However, the “last mile” of converting insights into
actions is the most crucial step. By creating a compelling story,
stakeholders will recognize what actions are needed. Anticipate
common questions and address those at the outset in your presen‐
tation. Better yet, create elements of narrative from your analysis,
then work side by side with stakeholders to help them craft the
story to be told.
For more about data storytelling, we recommend the following:
34 | Chapter 3: AI Skills
• Storytelling with Data: A Data Visualization Guide for Business
Professionals by Cole Nussbaumer Knaflic (O’Reilly)
• The collected works of Edward Tufte, available through
Graphic Press LLC
Platform Engineering
Whether your team is working on data preparation, discovery, data
visualization, modeling, or other areas, working with data requires
lots of computing resources. Some aspects are compute-intensive,
others demand lots of memory, still others need network band‐
width, and there’s always the issue of data storage. Making efficient
use of computing resources is crucial for the team to be effective.
Some organizations call this kind of work platform engineering. Oth‐
ers categorize it as site reliability engineering (SRE), DevOps, or
data engineering. In any case, you’ll probably be using some dis‐
tributed systems such as Apache Spark or TensorFlow, perhaps some
cloud services, perhaps Docker and Kubernetes. Having people with
engineering skills to use these systems and develop platform capa‐
bilities for the rest of the team is essential.
Feature Engineering
Machine learning models need training data, at least for supervised
learning, which is most commonly used. Data preparation will han‐
dle most of the work to develop training sets, but there are impor‐
tant nuances. Machine learning models build upon features in the
data. Your project might need to iterate on feature engineering to
train effective models, not simply take the raw data. For example,
some columns of a dataset might have imbalanced classes such as
fewer rows about people from a minority group. If not handled
appropriately, that situation can lead to bias in the resulting models.
Understanding feature engineering is an art, one that’s loaded with
math and science. To dive deeper into this, we recommend reading
Feature Engineering for Machine Learning, by Amanda Casari and
Alice Zhang (O’Reilly). Also check out the open source AIF360 tool‐
kit for a sophisticated set of tools that can be integrated into your
feature engineering workflows. AIF360 and tools like it help detect
potential bias and prepare data in ways that lead toward better AI
trust overall.
Core Skills | 35
Modeling
Finally! By many media accounts, data scientists spend most of their
time being concerned with exotic algorithms. Realistically, that’s
only a tiny part of the job. It’s important nonetheless, and you’ll need
people who understand the algorithms used to train machine
learning models, especially when it comes to evaluating results of
machine learning models, given that each family of algorithms tends
to use different metrics and evaluation criteria.
In the era of deep learning, modeling has become substantially more
costly to compute. Relatively few organizations train deep learning
models entirely from scratch; instead, they reuse models that some
other organization had previously trained. In the latter case, we use
transfer learning to adapt a previously trained model, “fine-tuning”
the model with data specific to a new use case. Consequently, it
becomes important to manage the hyperparameters—configurations
that control and adjust models—which might number in the thou‐
sands, with recent research efforts trending toward billions. Hyper‐
parameter optimization can lead to large expenses if not handled
well. Using popular frameworks such as TensorFlow, PyTorch, Ray,
and others is quite helpful, but you’ll need people who have these
skills.
36 | Chapter 3: AI Skills
decisions are not biased and that they properly represent your
brand.
Troubleshooting in Production
Over and beyond what the MLOps team will need to handle, it will
be required to do some troubleshooting in the live production
environment. Understanding the edge cases for security, privacy,
ethics, bias, and a range of compliance requirements is difficult. It
will probably require the people on your team with the deepest
experience in statistics, plus lots of hands-on experience with
machine learning.
This kind of troubleshooting work, and the risks associated with it,
are quite different from other areas of software engineering. Few
product managers, so far, have much experience directing projects
that depend on machine learning models; these are probabilistic sys‐
tems and require very different skills and processes than, say, web
app development. To learn more about how product management
for AI differs from previous practices, see “Why Machine-Learned
Models Crash and Burn in Production and What to Do about It” by
David Talby, and “Why Managing Machines Is Harder Than You
Think” by Pete Skomoroch.
Building Teams
Engineering teams tend to focus on process: how to use or develop
APIs most effectively, troubleshooting systems, estimating efforts
through time-boxing, paying down technical debt, and so on. All of
these are important for developing software, but those priorities
reflect the software-deployment end goal of iterating on a code base.
In contrast, data science work encompasses a much broader scope
than code and software deployment: organizational decisions must
be made, and risks must be considered, all in the context of domain
expertise. Instead of coding, data science teams emphasize systems
learning: specifically, preparing and curating data so that it engen‐
ders learning, produces insights, and augments decisions.
Team Culture
The culture that emerges within data science teams will be different
from that in engineering teams. Their priorities and risks are
Building Teams | 37
different, so their processes and methods are different. Whereas
software engineering teams many times align closer to technology,
data science teams align closer to business. Data science teams must
evaluate risks; they typically work with probabilistic systems, unlike
the deterministic systems that software engineers use. The results
from data science teams help to drive business outcomes, where
executives are focused on decision-making processes.
Think about how a software engineering team builds a web app.
Typically, the most experienced members of a team work on early
stages, such as requirements, API design, architecture, and so on. As
the project matures, less-experienced members of the team develop
individual features, unit tests, and so on.
In machine learning work this scenario is reversed. Building a
machine learning model from a given dataset is something even the
least experienced member of a data science team should be able to
do. However, after machine learning models are deployed in pro‐
duction, unanticipated problems emerge: fairness and bias issues,
security issues, privacy, model drift, and edge cases that require
advanced statistics to troubleshoot. Those problems require the
most experienced people on the team. They also tend to involve
compliance and regulators, so mistakes at that stage can escalate to
the executive level. In other words, the life cycle and process for
machine learning in production are nearly the opposite of what’s
needed for web app development. It’s no wonder that a substantially
different culture emerges for data science teams.
Even though engineering teams typically take requirements from
product managers and execute on those, data science teams are
often involved in more exploratory work. A kind of “startup mind‐
set” comes into play: the data science team might need to perform a
lot of work at the outset to see whether an idea would even be feasi‐
ble with the data available.
38 | Chapter 3: AI Skills
science” on the job posts, but that’s not what your data science team
needs.
Oddly enough, scientists are often a very good candidate pool. Early
in the history of data science teams in Silicon Valley firms, it became
apparent that roughly half of the people hired had come from phys‐
ics or physical science backgrounds. Physics grad students collect
and prepare data and run analysis, visualizations, and modeling, as
well learn to use a wide range of distributed computing platforms.
There also tend to be many more physics grad students than there
are jobs open for them. So that’s a good way to hire people familiar
with the tools and processes needed in data science teams—people
who can be productive on day one.
Although physics tends to utilize machine learning and high-
performance computing, social science research tends to engage
with statistics, follow protocols for ethical handling of data, and
anticipate outcomes on the people involved. The latter tends to use
smaller datasets, but the data typically concerns confidential infor‐
mation about human subjects: wages, medical history, family back‐
ground, home addresses, and so on. Privacy concerns, biases, ethics,
security, and data governance—these issues, which are innate practi‐
ces for social science research, are rapidly becoming top priorities
for data science teams. Moreover, social science researchers gener‐
ally understand how to study human behavior, which is often where
data science teams in industry need to focus: on customers. Perhaps
even more to the point, and similar to physics grad students, there
are generally many more social science grad students than there are
jobs open for them.
There are many good programs for people interested in “upskilling”
into data science roles. One point to keep in mind: even though
some candidates come from degree programs that emphasize the
skills, tooling, and mindset needed for data science work, there are
still relatively few university programs that focus specifically on this
interdisciplinary field. Also, the field is evolving too rapidly for any
one individual to acquire all of the skills needed; in other words,
data science teams require continual learning and good, ongoing
mentoring programs to be effective. Therefore, plan to provide
training. Foster a culture for ongoing mentoring because there’s
much that your people will need to learn, especially from one
another. Build paths for internal hires; that is, people who want to
“reskill” to move into the field. You’ll get people who are already
Building Teams | 39
familiar with your business and bring extra domain expertise into
your data science team.
One area that is not as easy to capture is the culture of your team. In
our experience, a resume is half the story; the other half is having
the appropriate mindset. A successful team generates endless curios‐
ity and passion: the curiosity to understand problems and make
things better, and the passion in the practice of AI and to drive
change and make an impact. Data science work is not easily prescri‐
bed, and every company will face different challenges. We like to
think that each company must develop its own unique data science
practice for its lines of business. Hiring people with the proper
mindset becomes crucial to support that. You also must hire to
diversify your team’s mindset. Avoid building a team where every‐
one has the same background. Also, less-experienced data science
team members who bring the aforementioned mindset can be
unleashed on business problems and really make a big impact on
your business. Build a team of people who are empowered to make a
difference.
Overall, try to get people who cultivate curiosity and are good com‐
municators. If needed, you might even organize a Toastmasters
group for your team to practice their public speaking skills.
40 | Chapter 3: AI Skills
CHAPTER 4
Summary
As you can see, artificial intelligence (AI) has many moving parts. A
practical Agile approach to AI focuses on the right team mindset of
flexibility, the right set of tools, and the right set of team skills.
Across the first three chapters, we talked broadly about those ideas
as well as identifying the proper uses cases, organizing your data,
and changing your business by teaching it to trust the products
you’re building. We want to take a few last words to close on the sig‐
nificance of these points so that you can go forth leading your team
to achieve its most ambitious AI projects.
Use Cases
Integrating AI into your business happens one use case at a time.
First, work to identify the appropriate use cases: use workshops and
design thinking, and ideate as a team to get many perspectives and
insights. Focus on business problems; don’t get caught in the trap of
building technology for the sake of technology.
After a project, conduct retrospectives on those use case implemen‐
tations. What worked well? What could’ve been handled better?
Identify patterns in your use cases, and share those across the orga‐
nization. Ideally, create a Center of Excellence for AI patterns within
your organization.
41
Data
You can’t deploy AI without first getting your data into shape. Those
are table stakes, the absolute basics. Recent industry surveys about
AI adoption in the enterprise have shown that the majority of firms
become bogged down in the technical debt they must resolve before
they can share data across divisions. Also recognize that data shar‐
ing runs counter to the inertia that tends to arise in large organiza‐
tions. When people have complex tasks to manage, it’s generally
simplest to break those into smaller, simpler tasks and then keep the
parts separate. Organizations respond to complexity in much the
same way over time, establishing silos between divisions. That’s
especially true when it comes to data management and access con‐
trols: the easiest answer to a request for data access is “no.” However,
the easiest answer is not always the best in the long run.
Some people have called data “the new oil”—an economic good
driving enormous value—but it’s better to think of data as an invest‐
ment portfolio. Your data science teams help manage those invest‐
ments in data, and they help realize equity and yield from them.
One idea that’s been making the rounds is data democratization: an
idea in which nearly everyone in a company can run queries, create
analysis, generate reports, and so on. Obviously, not all the data in a
firm will be made available to every employee, given that there will
likely be confidentiality concerns to balance, as well. Data democra‐
tization has drawbacks: making claims about data analysis depends
on having enough of a statistics background to get it right—
although this is where data science teams can help coach and
amplify results from the rest of the organization.
There’s a related term to data democratization: citizen data science.
This is when people throughout your organization become involved
in developing data insights; if you think of data science as a contin‐
uum in which people on one end are just beginning to “upskill” into
these kinds of roles, and experts on the opposite end are continually
learning new skills for their data science practice, citizen data sci‐
ence makes a lot of sense for a large organization. Drawbacks and
critiques aside, both of these approaches can make a lot of inroads
toward breaking down data silos.
42 | Chapter 4: Summary
Tools and Process
To get a job done well, you must have the appropriate tools and
apply the appropriate process to them. Embracing open source is an
excellent first step because there’s so much available and robust
developer communities to learn from. Also, don’t limit which tools a
data science team can use: let it evaluate alternatives and use what’s
appropriate for the people and the challenges involved. Bring in new
tools to support new talent while utilizing the existing approaches
and tooling to support your existing talent. For example, some of
your staff will need to use Python and machine learning frame‐
works, whereas other people might be more focused on SQL and
reporting.
Be agile in your approach! This means keeping in mind that effec‐
tive work is not about building the best possible model for one use
case; it’s about the cadence you establish for creating many effective
models for a range of use cases. Invest time to prove value; however,
time is typically the most expensive resource in business, so don’t
spend too long to prove value in a given use case. Time is not your
friend. Joni Rolenaitis, Experian CDO, said it best, “Go agile or go
home.”
Mindset
You need to foster the proper kind of culture for a data science team
so that the members can develop an effective mindset for the chal‐
lenges they face. Data and dogma don’t mix well. Data science teams
are often in the “hot seat” because they’re the gatekeepers for analy‐
sis that disproves what others in the organization might take as giv‐
ens. On a data science team, we’re fostering an analytic mindset, one
that’s driven by curiosity. To support that, create an environment in
which people can collaborate with their peers, where criticisms
coming from outside the team are constructive, and in which opin‐
ions—both inside and outside the team—are respected but not wor‐
shipped. Ultimately, data and reproducible analysis, and an extra
dose of curiosity, must rule over opinion.
Diversity helps. Diversity builds a wide range of skills and experi‐
ence across a team, which combine to produce effective solutions.
Data science is inherently interdisciplinary. Diversity also helps tem‐
per opinions so that they don’t turn into bias.
44 | Chapter 4: Summary
Conclusion
To close this book and pull together the components described
throughout it, your challenge is to build a data science strategy for
your organization. As Figure 4-1 shows, we approach this challenge
from two directions. We want a culture that supports discovery,
diversity of skills and perspectives; embraces new ideas and insights
about your business; and moves quickly to prove or disprove them.
We also want an effective data platform that supports a data science
team that has diverse needs, along with scaling for customer use
cases.
Conclusion | 45
of your data. Make these platform capabilities meet the needs of
your data science team’s culture and skills, both for immediate pur‐
poses and as their curiosity and insights and organization learning
grows. Also, make your platform capabilities meet your customers
needs as your business grows.
Ultimately, these two approaches must meet in the middle.
Throughout this entire process, there are feedback loops that you
can recognize and nurture so that both your platform and your team
improve continuously. With these practices, skills, and culture
together, you can be agile in building AI in your organization.
46 | Chapter 4: Summary