Ai in Semiconductor Manufacturing Paper
Ai in Semiconductor Manufacturing Paper
Ai in Semiconductor Manufacturing Paper
Wafer Map
Process Tool
Pattern
Matching
Classification
Example AI Applications
ML-based
Advanced Deployed In Intel ML-based
Process Root-Cause
Control
Manufacturing Analysis
Each application in Figure 1 fits the n-i, n problem statement. Approaching AI with a Clear Purpose and Goal
Here are some examples:
The internet abounds with articles highlighting the colossal
• RCA. Data from n-i steps is used to find the root cause for failure rates of AI and big data initiatives across industries.
the abnormality observed at step n. The AI project failure rate ranges between 60 percent and 85
• Machine learning for Advanced Process Control (APC). percent, 2 despite huge investments in AI. One of the primary
Data from n-i steps is used to control the process at step n. reasons for failed AI projects is that some are created without
a clear use case in mind. In our opinion, AI should not become
• Predictive Die Kill and Test Reduction. Machine-learning a solution in search of a problem.
models are built using data from n-i steps to predict failures
at some downstream step n. Advanced Analytics Building Blocks
• Automatic Defect Classification (ADC). Applications are
devised to detect anomalies and quantify baseline non-
systematic defects at a given step n.
30%
10% Modeling
Prioritizing AI Use Cases to Optimize Understand
the Problem
Business Value
Defining a framework for prioritizing and deploying AI 10%
Solution Deployment
is critical for success. It’s important because enormous (DevOps)
volumes of data are generated from manufacturing
operations every day. Equally essential is the burgeoning 50%
interest in AI from engineers and executives, as well as the Data Management
pervasiveness of self-learning courses and the relative ease
of creating pilot AI solutions. The prioritization process
turns out to be simple, but not trivial, comprising three main
ingredients: substantial business value, feasibility, and time
to value (see Figure 2). Specifically, it is critical to answer the
following questions to assess the need to automate existing Figure 3. Building blocks of an advanced analytics solution.
business processes and workflows with AI:
• Are there any cost, productivity, or yield benefits?
• Is the targeted application error-tolerant? That is, are
occasional false positives acceptable? As well-documented on the internet, the failure rate for
AI projects ranges between 60 percent and 85 percent
• Can the solution be truly automated, at scale? That is, can
it be integrated into existing manufacturing automation across industries, despite huge investments in AI. We
systems in a way that the AI models can be built, monitored, believe that one of the primary reasons for failed AI
and updated with minimal manual intervention? projects is that a majority of them were created without
• Will the solution be ready in time to produce the intended a clear use case in mind.
business impact?
2. Solution deployment (DevOps) accounts for another typically maintains only 70–85 percent accuracy over time
10 percent of the advanced analytics pie. DevOps is the due to multiple reasons, including:
only way to democratize complex algorithms across • The work is highly repetitive.
the organization. Even at the PoC stage, we are already
• Process changes can result in new defect types that require
planning for HVM deployment, which includes integrating
further training.
the solution into our factory automation systems, to
ensure adoption. • Classifying IC defects is inherently difficult. Some defects
require design layout cross-referencing for accurate
3. Modeling represents 30 percent of the advanced analytics diagnosis, while others simply cannot be perceived by the
pie. Here, we follow two rules: human eye and brain.
n Start with simple interpretable techniques, such as robust
linear models or single decision trees, before employing
less transparent AI methods such as ensembles or much 90%
heavier neural nets.
Accuracy
70%-85%
n Use the best possible AI engines (algorithms) that are
Classification Accuracy
Accuracy
customized for our data domain. Specifically, we have
invested heavily in exceedingly performant engines that
can handle the unique characteristics of semiconductor
data—highly imbalanced data, missing data, categorical
data, and very often, “dirty” data. We constantly
benchmark the performance and accuracy of our custom
AI engines against the open-source engines used by
millions of data scientists around the world. 3 Depending
on the use case, we may use our custom AI engines or
the open-source ones, or a mix of both. 0 3 6 9 12 15 18 21 24
4. Data management is the biggest slice of the advanced Months
analytics pie—about 50 percent. And yet it is often the
least exciting facet of advanced analytics. The problem Figure 4. Defect measurements – accuracy vs. time.
with data is that it lives in different formats—structured
and unstructured, video files, text, and images—and is With the problem statement clearly understood, we worked
stored in different places with different security and privacy with a cross-functional team of process, yield, defect
requirements. Our data challenges are similar to those at metrology, and equipment engineers to implement a
other companies; we have addressed them and continue machine-learning (including deep neural nets) ADC solution.
to improve upon them, through multiple data initiatives. This solution has since been deployed in both TD and HVM
for every technology node manufactured at Intel—including
Intel® Xeon® Scalable processors and Intel® Optane™
An AI Sampler: Three At-Scale AI technology. The deployment itself required enormous effort
Solutions Boosting Manufacturing and investment to integrate the AI algorithms into factory
automation systems. Integration included several levels:
Quality and Productivity
• The input side with the defect inspection systems.
Thousands of AI models are in production at Intel. We used
the project prioritization framework to develop and deploy • The user side to allow defect engineers and technicians
each of the AI solutions shown in Figure 1. Let’s examine to label images and configure corresponding target layout
three AI solutions in our factories, to illustrate how the information.
prioritization framework contributes to project success. • The factory operations side to automatically generate
statistical process control (SPC) alerts and place factory
ADC with Computer Vision and Machine Learning lots on hold.
ADC with computer vision and machine learning was among ADC enables us to measure and classify most of the defects
the first AI solutions that we deployed in HVM more than a on wafers produced by Intel’s factories with a required rate
decade ago.4 Inline defect metrology helps detect excursions of accuracy, and we have not experienced any increase in
and catch issues in nanometric silicon chip layouts before the total cost of ownership compared to other solutions. We
they become serious yield and quality issues. ADC helps have also been able to use existing imaging equipment in
pinpoint issues at their source. the post-wafer manufacturing process to implement ADC
with computer vision and machine learning where it did
As illustrated in Figure 4, it can take six to nine months to not previously exist, helping to prevent errors early and
train people to manually classify defects with 90 percent increase yield at no additional cost.
accuracy. Even after training is complete, an expert operator
White Paper | High-Value AI in Intel’s Semiconductor Manufacturing Environment 5
Root-Cause Analysis
Another good example of at-scale machine learning and
advanced analytics is the RCA solution, which we have
democratized across all technology nodes in Intel factories.
In semiconductor manufacturing, rapidly finding the
root cause of yield and quality issues is critical for both
profitability and customer satisfaction. Probe Card damage defects
automatically detected and classified
The problem: At the scale of Intel’s manufacturing, finding the
Dust, Damage, Scuffs
root cause for a yield issue typically requires mining billions
of parameters across e-tests, SPC, tools, operations, defects,
queue times (QTimes), process times, wafer slot order, Figure 5. Probe Card defects – from manual to AI inspection.
equipment logs, and many other data types. It is similar to
finding the proverbial needle in a haystack. An analytically
We took a multi-stage approach to build a fully automated
savvy engineer, with strong domain knowledge and years of
inspection system for Probe Cards. At each stage, we built
experience, might be able to intelligently mine all available
intermediate applications that reduced the technician’s
data in a few hours or days; but this knowledge is difficult
workload. The overall solution now coalesces these
to share between even two engineers, let alone across all of
applications. For example, one application automatically
Intel’s factories.
collects the image data while the Probe Card is on the test
To democratize RCA, we developed interpretable machine- equipment. Review tools flag only anomalous areas of the
learning engines (including enhanced decision trees, novel Probe Card. Another application allows the technicians
committee methods, feature selection, and rule induction to easily label the data, which in turn allows us to create
techniques) that can handle massive, noisy, heterogeneous, a labeled dataset to train a deep-learning AI system. By
and frequently missing not at random (MNAR) manufacturing starting with a minimum viable product and augmenting the
data. These engines provide solutions for tasks such as RCA, functionality incrementally, we developed a solution that
but a large effort is required to first transform the data into worked for the technicians even as they helped us attain our
an analyzable form. We applied semiconductor domain goal of full automation. The system is now fully automated
expertise to create a custom big data storage infrastructure and deployed at multiple factories, providing significant
that provides very fast data access to the multi-dimensional productivity enhancements: A task that previously took as
data required for RCA; engineers can now find potential root long as 46 hours per week per factory has been reduced to
causes in a few minutes compared to hours or days. And by less than 60 seconds.
White Paper | High-Value AI in Intel’s Semiconductor Manufacturing Environment 6
1
“Spatial Patterns in Sort Wafer Maps and Identifying Fab Tool Commonalities,” Eric R. St. Pierre; Eugene Tuv; Alexander Borisov, Proceedings of the 2008 IEEE/SEMI Conference and
Workshop on Advanced Semiconductor Manufacturing.
2
“85% of big data projects fail, but your developers can help yours succeed,” Matt Asay, TechRepublic, November 10, 2017.
3
Isabelle Guyon et al. “Analysis of the AutoML Challenge Series 2015-2018,” Automated Machine Learning. The Springer Series on Challenges in Machine Learning, (2019): pp 177-219.
4
“Faster, More Accurate Defect Classification Using Machine Vision,” Eugene Tuv et al., IT@Intel white paper, November 2018.
Performance varies by use, configuration and other factors. Learn more at intel.com/PerformanceIndex. Performance results are based on testing as of dates shown in configurations
available updates. See configuration disclosures for details. No product or component can be absolutely secure. Your costs and results may vary. Intel technologies may require enabled
hardware, software, or service activation. Intel, the Intel logo, and other Intel marks are trademarks of Intel Corporation or its subsidiaries. Other names and brands may be claimed as the
property of others. © Intel Corporation 0222/AREY/KC/PDF 349850-001US