Leakage Minimization Technique For Nanoscale Cmos Vlsi: Ecause of The
Leakage Minimization Technique For Nanoscale Cmos Vlsi: Ecause of The
Leakage Minimization Technique For Nanoscale Cmos Vlsi: Ecause of The
Leakage Minimization
Technique for Nanoscale
CMOS VLSI
Kyung Ki Kim and Yong-Bin Kim Nohpill Park
Northeastern University Oklahoma State University, Stillwater
Minsu Choi
University of Missouri-Rolla
322 0740-7475/07/$25.00 G 2007 IEEE Copublished by the IEEE CS and the IEEE CASS IEEE Design & Test of Computers
Therefore, this problem requires a better understand-
ing and a more accurate model of leakage currents
for input pattern control in nanometer CMOS circuits.
In this article, we investigate all the possible leakage
current components and propose a novel macromo-
deling technique to characterize the minimum leak-
age current of each individual leaf cell, considering
fan-out effect, stack effect, and the interaction between
gate-leakage and subthreshold currents. We devel-
oped a heuristic approach to find the input pattern for
minimum leakage current during standby mode of
nanoscale VLSI chips. Our approach uses the func-
Figure 1. Leakage current in a nanoscale CMOS circuit: off-
tional dependencies in VLSI and the controllability of
state (a) and on-state (b) leakage components.
its nodes.
gate-oxide thickness, the tunneling drops exponential-
Leakage power model and analysis ly. This is given by
In the off state, the main components of leakage
Tox
{B VGS a
current are subthreshold leakage (ISubTH), gate- Igate tunneling ~ ðAC ÞðWLÞe ð1Þ
induced drain leakage (IGIDL), gate-tunneling leakage
(IGATE), and band-to-band tunneling (IBTBT), as Fig- where A 5 q3/(8phwb), B 5 8p (2mox)1/2wb3/2/(3hq),
ure 1a shows. Gate-tunneling leakage (IGATE) is the C 5 (VGS/Tox)2, and a is a parameter that ranges
major component during the on state, as Figure 1b from 0.1 to 1, depending on the voltage drop
shows.10 across the oxide. Here, h is Planck’s constant, and
The GIDL is a current from the drain to the substrate wb is the barrier height for electrons and holes in
the conduction and valence band.
caused by the high electric field between the gate and
In nanometer CMOS technology, the gate-tunneling
the drain; thin gate-oxide thickness and a high supply
leakage current is expected to increase to more than
voltage increase GIDL. The gate-tunneling leakage is
twice the subthreshold leakage current. Figure 2a
a current flowing into the transistor’s gate by
shows the gate-tunneling leakage currents produced
a tunneling effect; thin gate-oxide thickness and a high
by a nanoscale n-channel MOS transistor. As Figure 2a
supply voltage also increase gate-tunneling leakage.
shows, the gate-tunneling current consists of four
The subthreshold leakage is a weak inversion conduc-
components: gate-to-channel tunneling (IGC), gate-to-
tion current of the CMOS transistor when VGS is less
drain edge tunneling (IGD), gate-to-source edge
than VTH. It increases exponentially because of the
tunneling (IGS), and gate-to-body tunneling (IGB). The
reduced threshold voltage, and it is a main leakage
gate tunneling’s magnitude depends on the applied
component in the case of a high forward body bias.
voltage VGS. For NMOS, four possible states exist,
Finally, the BTBT leakage is a current by electron
depending on the voltages of the three terminals
tunneling across the reverse-biased pn junction
between the drain or source and the CMOS transistor’s
substrate. Therefore, in the case of a high reverse body
bias, the BTBT leakage becomes a major portion of the
total leakage current. Kuroda et al. show that the
subthreshold leakage current and the BTBT leakage
are more sensitive to the applied body bias than the
other two leakage components.10 The minimum
leakage current is obtained when the subthreshold
leakage current is equal to the BTBT leakage.
Figure 2. The gate-tunneling and subthreshold leakage current
Gate-tunneling leakage current in NMOS and PMOS transistors: maximum gate-tunneling
Gate-tunneling leakage is a current flowing (tun- leakage current state (a) and maximum subthreshold leakage
neling) into the transistor’s gate. With an increase of current state (b).
July–August 2007
323
Computer-Aided Design for Emerging Technologies
Fan-out effect
Depending on the pri-
mary input (PI) pattern,
Figure 4. Leakage current flows in nanoscale CMOS circuits. the subthreshold leakage
current and gate tunnel-
ably, which is known as the stack effect. The leakage ing are affected by the adjacent fan-in and fan-out
current decreases monotonously with the number of logic circuits. Figure 5 illustrates the dependency of
stacked off transistors. the leakage current on the fan-out structures. In
Because of the transistor stack effect, a gate’s Figure 5, the PI is logic 1, the number of fan-outs of
leakage current depends on its input combination. An inverter G2 is two, and the number of fan-outs of
individual CMOS gate shows a variation in the leakage inverter G3 is three. First, the current IgG3 is the gate-
power for different input patterns. Only a few input tunneling leakage of inverter G3. In this circuit, IgG2
patterns, defined as dominant leakage states, cause and IgG4 are the gate-tunneling leakage currents of G2
significant leakage. Therefore, 011, 101, 110, and 111 and G4, respectively. The directions of the three
input patterns of a 3-input NAND gate are dominant currents converge into the input of inverter G3.
leakage states. If the dominant leakage states of all The sum of the gate leakage currents at node N3 is
input patterns are used to generate macromodels, the a function of gate G1’s fan-out and the subthreshold
lookup table’s size for the macromodels is reduced by currents of G2, G3, and G4. The 0 state voltage at node
ignoring the rest. N3 increases as the fan-out of G1 increases, which in
Figure 4 shows the static current paths that appear turn reduces the gate-leakage current of G2, G3, and
when the leakage current is considered in CMOS G4 because the voltages between the inputs and
circuits. In the circuits, each inverter has a few paths of outputs of those gates are reduced. The gate-leakage
subthreshold and gate-tunneling leakage current. We currents of G2, G3, and G4 are also a function of their
can assume that Inv 2 is the device under test (DUT), subthreshold currents because the subthreshold cur-
and the input of Inv 1 is 0. Inv 2 has three leakage rents affect the voltage between those gates’ inputs
components, which depend on the fan-out structures and outputs. Considering these fan-out effects, IgG3 is
of Inv 2: about one-third of the gate-tunneling leakage when G1
has only one fan-out. Consequently, the subthreshold
& the gate-tunneling current Igate_inv2 starting from current is influenced by the number of fan-outs of the
the PMOS of Inv 1, previous driver. However, the fan-out of inverter G3
& the subthreshold leakage of the turned-off state cannot have a significant effect on the leakage current
PMOS in Inv 2 (Isub_inv2), and of inverter G3. As the number of fan-outs for G3
& the gate-tunneling current Igate_inv3 starting from increases, G3’s output voltage decreases, which then
Inv 3. reduces its subthreshold and gate-tunneling leakage
currents.
Therefore, the total leakage current is the sum Therefore, the total leakage of inverter G3 is
of Igate_inv2, Isub_inv2, and Igate_inv3. However, when affected by the fan-outs of G1 and G3. Hence, it is
a cell’s macromodel is generated, one leakage tun- necessary to consider the interaction of each leakage
neling current (Igate_inv3) should be removed to make current component in both previous stages and the
July–August 2007
325
Computer-Aided Design for Emerging Technologies
Input-pattern generation
Based on the fan-out effect in leakage
current, we developed the macromodel
for a leaf cell (inverter, NAND, and NOR
gates) based on an Hspice simulation,
where the controlling variables are the
number of fan-outs, the cell size, and the
Figure 5. Fan-out effect for the G3 gate. The leakage currents depend on the input pattern, considering the stack
fan-out structures of the G3 gate. [W/L of PMOS is (90 nm)/(45 nm), and W/L effect under fixed VDD, VTH, Tox, and
of NMOS is (45 nm)/(45 nm), where W and L are the transistor channel width temperature. On the basis of the accu-
and length.] rate macromodel of the cells, we
implemented a heuristic approach to
generate the minimum leakage test
next stages for an accurate leakage estimate in pattern. The leakage of each cell in the circuit depends
nanoscale CMOS circuits. However, the effects of the on the input pattern applied to the circuits. Research-
leakage current components beyond one logic level ers have proposed several techniques to generate the
from the DUT are negligible. input pattern for minimum leakage current and solve
4–6
Figure 6 presents the fan-out effect on the leakage the NP-hard problem. An easy way to solve the
current for inverter G3 shown in Figure 5. The leakage problem is to use the functional dependencies in the
currents are measured at inverter G3 in Figure 5. The circuits and the controllability of the nodes. In this
number of fan-outs of G1 varies from 1 to 5, as does the article, we improve the methodology to estimate the
number of fan-outs of G3. Figure 6a and Figure 6b accurate leakage current with a fast simulation time.
show the subthreshold leakage and gate tunneling, The functionality of the cells in circuits determines
respectively, when the input of inverter G3 is 1. the states of the internal nodes for any given input
Figure 6c and Figure 6d show the subthreshold vector. A cell is dominated if its input pattern causing
leakage and gate tunneling, respectively, when the the minimum leakage current is a subset of the
input of inverter G3 is 0. minimum leakage input pattern of the other cells. A
As expected, the number of fan-outs of G1 affects cell is conflicting if the input pattern causing the
the leakage current. For the 0 input to G3, the fan-outs minimum leakage conflicts with other cells’ input
of G3 significantly affect the leakage current, but less vectors that cause the minimum leakage.
than the fan-outs of the previous driver. We measured Before finding the optimal input pattern to reduce
the smallest total leakage (0.73 mA) for the 1 input with leakage power dissipation, the functional dependen-
five fan-outs of G1 and five fan-outs of G3. We see the cies between cells should be searched, and each cell’s
highest total leakage (2.33 mA) for the 0 input with one dominated and conflicting cells should be listed in the
fan-out of G1 and one fan-out of G3. If we do not order of the weight function, which is given by
July–August 2007
327
Computer-Aided Design for Emerging Technologies
simulation results. In addition, the simulation time of accurate macromodel, because the leakage current
our method is far faster than that of the Hspice below 90-nm technology is sensitively affected by
simulation. PVT variations. &
It is not easy to compare our method with alternate
approaches, because most of the previously published & References
research requires significant circuit modification and 1. K. Roy, S. Mukhopadhyay, and H. Mahmoodi-Meimand,
too much performance overhead and they do not ‘‘Leakage Current Mechanisms and Leakage Reduction
consider fan-out effects. Some of the previous research Techniques in Deep-Submicrometer CMOS Circuits,’’
requires little or no overhead, but they are NP-hard Proc. IEEE, vol. 91, no. 2, Feb. 2003, pp. 305-327.
problems. Our heuristic approach solves the previous 2. A. Agarwal et al., ‘‘Leakage Power Analysis and
approaches’ problems, considering all the leakage Reduction: Models, Estimation and Tools,’’ IEE Proc.
components and the relationships between them. The Computers & Digital Techniques, vol. 152, no. 3, May
best way to compare the efficiency of the algorithms is 2005, pp. 353-368.
to compare the complexity of the algorithms. The 3. S.G. Narendra and A. Chandrakasan, Leakage in
proposed approach’s complexity is O(n2), whereas the Nanometer CMOS Technologies, Series on Integrated
other algorithms’ complexities are far higher, and the Circuits and Systems, Springer, 2005.
simulation time of our approach is faster than the 4. Y. Xu and Z. Luo, ‘‘Minimum Leakage Pattern Generation
previous approaches by at least a factor of 5. Using Stack Effect,’’ Proc. 5th Int’l Conf. ASIC, IEEE
Press, 2003, pp. 1239-1242.
THE REMAINING ISSUE is to consider process, voltage, 5. F. Gao and J.P. Hayes, ‘‘Exact and Heuristic Approaches to
and temperature (PVT) variations for a more Input Vector Control for Leakage Power Reduction,
July–August 2007
329
Computer-Aided Design for Emerging Technologies
Computer Aided Design,’’ Proc. Int’l Conf. Computer-Aided circuit design. Kim has a BS from Sogang University,
Design (ICCAD 04), IEEE CS Press, 2004, pp. 527-532. Seoul, an MS from the New Jersey Institute of
6. X. Chang et al., ‘‘SoC Leakage Power Reduction Technology, and a PhD from Colorado State Univer-
Algorithm by Input Vector Control,’’ Proc. Int’l Symp. sity, all in electrical engineering. He is senior member
System-on-Chip, IEEE Press, 2005, pp. 86-89. of the IEEE.
7. Y. Xu, Z. Luo, and X. Li, ‘‘A Maximum Total Leakage Current
Estimation Method,’’ Proc. Int’l Symp. Circuits and Systems Minsu Choi is an associate pro-
(ISCAS 04), vol. 2, IEEE Press, 2004, pp. 757-760. fessor in the Department of Electrical
8. Y. Xu et al., ‘‘Average Leakage Current Macromodeling and Computer Engineering at the
for Dual-Threshold Voltage Circuits,’’ Proc. 12th Asian University of Missouri-Rolla. His re-
Test Symp, (ATS 03) IEEE CS Press, 2003, pp. 196-201. search interests include computer
9. R. Kumar and C.P. Ravikumar, ‘‘Leakage Power architecture and VLSI, embedded systems, fault
Estimation for Deep Submicron Circuits in an ASIC tolerance, testing, quality assurance, reliability mod-
Design Environment,’’ Proc. Conf. Asia South Pacific eling and analysis, configurable computing, parallel
Design Automation / VLSI Design (ASP-DAC/VLSID 02), and distributed systems, instrumentation and mea-
IEEE CS Press, 2002, pp. 45-50. surement, and computational nanotechnology. Choi
10. T. Kuroda et al., ‘‘Variable Supply-Voltage Scheme for has a BS, an MS, and a PhD in computer science
Low-Power High Speed CMOS Digital Design,’’ IEEE J. from Oklahoma State University, Stillwater. He is
Solid-State Circuits, vol. 33, no. 3, Mar. 1998, pp. 454-462. a member of Sigma Xi and of the Golden Key
11. S. Yang et al., ‘‘Accurate Stacking Macro-modeling of National Honor Society.
Leakage Power in Sub-100 nm Circuits,’’ Proc. 18th Int’l
Conf. VLSI Design (VLSID 05), IEEE CS Press, 2005, Nohpill Park is an associate pro-
pp. 165-170. fessor in the Computer Science De-
12. D. Lee et al., ‘‘Analysis and Minimization Techniques for partment at Oklahoma State Universi-
Total Leakage Considering Gate Oxide Leakage,’’ Proc. ty, Stillwater. His research interests
40th Design Automation Conf. (DAC), ACM Press, 2003, include computer architecture, de-
pp. 175-180. fect- and fault-tolerant systems, testing and quality
assurance of digital systems, parallel and distributed
computer systems, multichip-module systems, pro-
Kyung Ki Kim is a PhD candidate in grammable digital systems, and SoCs. Park has a BS
the Department of Electrical and and an MS in computer science from Seoul National
Computer Engineering at Northeast- University and a PhD in computer science from Texas
ern University. His research interests A&M University.
include high-speed, low-power VLSI
design; analog VLSI circuit design; electronic CAD; & Direct questions and comments about this article to
and ATE system design. Kim has a BS and an MS in Yong-Bin Kim, Dept. of Electrical and Computer
electronics engineering from Yeungnam University, Engineering, Northeastern University, 442 Dana Re-
Kyungsan, South Korea. search Center, 360 Huntington Ave., Boston, MA
02115; ybk@ece.neu.edu.
Yong-Bin Kim is an associate pro-
fessor in the Department of Electrical
and Computer Engineering at North- For further information on this or any other computing
eastern University. His research fo- topic, visit our Digital Library at http://www.computer.
cuses on high-speed, low-power VLSI org/publications/dlib.