The AI Human Capital Playbook: 1 Workera
The AI Human Capital Playbook: 1 Workera
Capital
Playbook
Companies everywhere are building AI teams, but it’s
still unclear what aspiring machine learning engineers,
data scientists, and software engineers should focus
on when applying for AI jobs. This report walks you
through different types of organization, different roles
within them, the tasks you’ll work on, and the skills
recruiters are looking for in each role. It is the result
of two large-scale studies of the supply of and
business demand for AI talent. We're here to provide
mentorship and help you find a job that suits your
skills, experience, and aspirations.
1 Workera
CONTENTS
AI organizations.....................................................................................................................................4
The AI project development lifecycle: Tasks and Skills..................................................................6
Overview of the AI project development lifecycle...........................................................7
Data Engineering......................................................................................................................8
Modeling.................................................................................................................................. 10
Deployment............................................................................................................................ 12
Business Analytics................................................................................................................. 14
AI Infrastructure.................................................................................................................... 16
The roles of an AI team.................................................................................................................... 18
Overview................................................................................................................................ 19
Data Scientist......................................................................................................................... 20
Data Analyst........................................................................................................................... 21
Machine Learning Engineer................................................................................................. 22
Machine Learning Researcher............................................................................................ 23
Software Engineer - Machine Learning............................................................................. 24
Software Engineer................................................................................................................. 25
Conclusion........................................................................................................................................... 26
2 Workera
This report is a work in progress and is being
provided to the public for information purposes.
Because it is a work in progress, there are parts that
are either missing or will be progressively revised as
our team learns more about the supply of and the
demand for AI talent.
3 Workera
PART 1
AI
Organizations
We interviewed 100+ data science and machine
learning leaders from companies such as Airbnb,
Amazon, Earnin, Facebook, Google, Landing.ai, Lyft,
Proofpoint, and Upstart to understand the roles, tasks,
and skills that make up a corporate AI organization
(this is opposed to an academic setting, where often
the AI project development lifecycle differs).
4 Workera
Data Science vs.
Machine Learning
Organizations
5 Workera
PART 2
The AI project
development
lifecycle:Tasks
and Skills
From our research, most AI organizations’ work
divides into five tasks: data engineering, modeling,
deployment, business analysis, and AI infrastructure.
Together, these tasks make an AI project development
lifecycle. Each task requires specific skills and can be
the focus of multiple roles.
6 Workera
Overview of the AI
Projects Development
Lifecycle
A DS project starts with (i) data on which you can (ii) fit a
model. These models and (iii) other data analyses help you
make actionable business decisions. The (iv) AI infrastruc-
ture is necessary to support all tasks described above (i, ii
and iii).
7 Workera
Data
Engineering
8 Workera
Subtask Examples Technical Skills Involved
9 Workera
Modeling
10 Workera
Subtask Examples Technical Skills Involved
Using one of the following methods: Linear Regression, Logistic Regres- Machine Learning
Training machine sion, Decision Trees, Random Forest, XGBoost, Support Vector Machines, Algorithmic Coding
learning models K-means, K-Nearest Neighbors, Neural Networks, Principal Component Mathematics
Analysis, Naive Bayes Classifier, Lasso/Ridge regression, etc. Data Science
Defining evalua-
Choosing F1-score to evaluate a machine learning model’s performance on
tion metrics (usu- Machine Learning
a classification task
ally also involves Algorithmic Coding
Implementing evaluation metrics such as accuracy, precision, recall, inter-
a data product Mathematics
section over union, mean average precision (mAP), etc.
manager)
Research
Keeping up with Reading research papers Mathematics
the state-of-the-art Watching conference lectures or attending conferences Data Science
11 Machine Learning Workera
Deployment
Deployment includes all of the activities that make a
model available for use. Given a data stream (from the data
engineering task) and a model (from the modeling task),
individuals in charge of deployment will package and test
models before pushing them to production environments.
Deployment activities require the ability to write produc-
tion code, including strong back-end engineering skills
and understanding of cloud technologies.
12 Workera
Subtask Examples Technical Skills Involved
Improving
response times Setting up load-balancing requirements with engineers in charge of AI
Software Engineering
and saving Infrastructure
bandwidth
Encrypting files
that store model
Understanding encryption at a high level and leveraging existing func-
parameters, Software Engineering
tions
architecture and
data
Building APIs
Setting up HTTP RESTful API services to facilitate communications
for an appli- Software Engineering
between software components
cation to use a
Setting up authorization and authentication to access the API
model
Retraining ma-
chine learning Software Engineering
Monitoring changes in data distribution and staging model updates
models (lifelong Machine Learning
learning)
Fitting mod-
els on a re- Pruning or quantizing a model so it fits memory requirements Software Engineering
source-con- Deploying a model on a mobile device using TensorFlow Machine Learning
strained device
13 Workera
Business
Analysis
14 Workera
Subtask Examples Technical Skills Involved
Visualizing high-dimensional data in lower dimensions using methods such Domain-specific programming
as PCA or t-SNE languages
Building data
Building and presenting graphs produced using Tableau, ggplot or matplot- Data Science
visualizations
lib Mathematics
Building visualizations in Javascript, HTML and CSS Business Acumen
Building dash-
Writing a script that periodically notifies business leaders of trends in the
boards for Busi- Domain-specific programming
data
ness Intelligence
Translating statis-
Data Science
tics into actionable Making marketing decisions based on analysis of various sources
Business Acumen
business insights
Data Science
Optimizing web pages with A/B tests
Running A/B tests Algorithmic Coding
Evaluating systems in production
Business Acumen
15 Workera
AI Infrastructure
AI infrastructure aims to facilitate data engineering, mod-
eling, and deployment by building and maintaining reli-
able, fast, secure, and scalable software systems. Working
on AI infrastructure requires strong and broad software
engineering skills.
16 Workera
Subtask Examples Technical Skills Involved
Making software
Reducing latency by locating a model close to data Software Engineering
design decisions
Designing for scale Adding GPU compute or storage as needed Software Engineering
Maintaining soft- Managing software upgrades such as Python 2’s end of life on 01/01/2020,
Software Engineering
ware infrastructure and driving stability through automated monitoring and alerting
Securing data and Building security features allowing for production deployments into regulat- Software Engineering
models ed organizations, satisfying the needs for privacy and security
Writing unit and functional tests for multiple components across tasks of
Writing tests Software Engineering
the AI project lifecycle
Carrying out vari- Building a labeling software for a client, or key tools such as A/B testing
Software Engineering
ous software tasks frameworks or analysis environments
17 Workera
PART 3
The roles of
an AI team
There is no standard for roles in AI teams. Besides,
the lack of information about the supply of AI talent
makes it difficult for hiring managers to set
reasonable job requirements that correlate with
on-the-job performance. To bridge this gap, we
assessed the skills of thousands of individuals
aspiring to work in AI organizations and analyzed
hundreds of job descriptions for AI roles.
18 Workera
The Six Roles of
an AI Team
We identified six technical roles with distinct skill sets and a necessary step to enhance modeling, deployment, and
focus areas. Each of these roles contributes to a number of business analysis.
tasks in the AI development cycle.
For each role, we list the tasks it may carry out and the
All roles undertake (to a certain extent) the data engi- skills necessary to achieve those tasks.
neering task. That’s because data engineering is usually
Note:
• The dotted line indicates a less signifant involvement
with the task at hand. A Software Engineer - ML uses
out-of-the-box methods to carry out the modeling
task while an MLE, MLR or DS is able to customize
models.
19 Workera
Data
Scientist
TASKS
SKILL PROFILE
20 Workera
Data
Analyst
TASKS
SKILL PROFILE
21 Workera
Machine Learning
Engineer
TASKS
SKILL PROFILE
SKILL PROFILE
SKILL PROFILE
SKILL PROFILE
25 Workera
This report is a work in progress and is being
provided to the public for information purposes.
Because it is a work in progress, there are parts that
are either missing or will be progressively revised as
our team learns more about the supply of and the
demand for AI talent.
26 WORKERA Workera
a deeplearning.ai company