L2 - Complex Static CMOS Gates
L2 - Complex Static CMOS Gates
GATES
Static CMOS
● Pull-Up Network (PUN) and Pull-Down
Network (PDN)
o PUN and PDN are complementary
o PUN: PMOS devices only
o PDN: NMOS devices only
2
Example
3
Analysis of CMOS Gates
● Transistors in parallel → resistances
in parallel
o Effective resistance = R/2
o Effective width = 2W
8
Example: Transistor Sizing
● Draw circuit, size transistors for equal rise/fall times. Assume 2:1
mobility ratio and equal threshold.
9
Start from shortest path Start from longest path
• Smaller widths.
• Larger output cap.
10
Equivalent Inverter
● CMOS gates: many paths to Vdd and Gnd
o Multiple values for VM, VIL, VIH, etc
o Different delays for each input combination
● Equivalent inverter
o Represent each gate as an inverter with appropriate device width
o Include only transistors which are on or switching
o Calculate VM, delays, etc using inverter equations
11
Static Properties of Complementary CMOS Gates
● Symmetrical VTC
● No static power dissipation, since the circuits are designed such that
the pull-down and pullup networks are mutually exclusive
12
● Three possible input combinations switch the
output of the gate from high-to-low:
o (a) A = B = 0 ->1,
o (b) A = 0 -> 1, B = 1
o (c) A = 1, B = 0 -> 1,
13
Propagation Delay of Complementary CMOS
Gates
● For a 2 input NAND gate,
o If both inputs are driven low, the two PMOS devices are on. The delay in this case
is 0.69(Rp/2) CL, since the two resistors are in parallel.
o When only one PMOS device turns on, delay is given by 0.69(Rp)CL – Worst case
o For the pull-down path, the output is discharged only if both A and B are switched
high, and the delay is given by 0.69(2RN) CLto a first order.
● Adding devices in series slows down the circuit, and devices must be made
wider to avoid a performance penalty.
● When sizing the transistors in a gate with multiple fan-in’s, we should pick the
combination of inputs that triggers the worst-case conditions.
● For example, for a NAND gate to have the same pull-down delay (tphl ) as a
minimum-sized inverter, the NMOS devices in the NAND stack must be made
twice as wide so that the equivalent resistance the NAND pull-down is the
same as the inverter. The PMOS devices can remain unchanged.
16
● The delay dependence on input patterns
17
● The delay dependence on input patterns
18
NOR Gate
19
Propagation delay of complex gates
● In more complex logic gates that have large fan-in, the internal node
capacitances can become significant
tpHL = 0.69(R1C1 + (R1 + R2) C2 + (R1 + R2+ R3) C3 + (R1 + R2+ R3 + R4) CL)
M1 appears in all the terms, which makes this device important when
attempting to minimize delay.
20
Elmore Delay Model
● Consider an RC network with the
following properties
o The network has a single input
node (labeled s)
o All the capacitors are between a
node and the ground
o The network does not contain
any resistive loops.
● In this topology, there exists a unique resistive path between the source node s and
any node i of the network.
● For the circuit given, the Elmore delay at node 7 is given by:
τD7 = R1C1 + R1C2 + R1C3 + R1C4 + R1C5 + (R1 + R6) C6 + (R1 + R6+ R7) C7 + (R1 + R6+ R7) C8
● The Elmore delay at node 5 is given by:
τD5 = R1C1 + R1C6 + R1C7 + R1C8 + (R1 + R2) C2 + (R1 + R2) C3 + (R1 + R2+ R4) C4 + (R1 + R2+
R 4+ R 5) C 5
21
Fan In and Fan out
● A CMOS gate with N inputs requires 2N transistors.
o Other circuit styles require at most N+1 transistors, which can be a substantial
advantage in area, e.g., 8 versus 5 for a 4-input gate.
● The propagation delay of a complementary gate deteriorates rapidly as a
function of fan-in.
o Internal capacitance become significant
o First, the larger number of transistors increases the overall capacitance of the gate.
● For an N -input NAND gate, the output capacitance increases linearly with the fan-in, since
the number of PMOS devices connected to the output node increases linearly with the fan-
in.
o Second, the series connection in the PUN and PDN slows the gate because the
effective (dis)charging resistance increase linearly with the fan-in.
● Widening does not improve the performance as much as predicted, since widening
increases gate and diffusion capacitance.
22
Fan In and Fan out (Cont..)
● Fan-out in complementary gates has a larger impact on gate delay
than in other circuit styles.
o Downstream gate capacitance is always two per fan-out in contrast to one
in other styles.
23
Reduction of delays for Large Fan-in
● Transistor sizing
o Increasing size decreases the second-order factor in the tp expression.
o However, if load is dominated by intrinsic capacitance (self-loading),
propagation delay is not improved.
● increasing the transistor size, results in larger parasitic capacitors, which affect
the propagation delay of the gate and also present a larger load to the
preceding gate.
Distributed RC-line
25
Reduction of delays for Large Fan-in (Cont..)
● Transistor ordering
o Not all input signals to a gate arrive at the same time.
o An input signal to a gate is called critical if it is the last signal of all inputs
to assume a stable value.
o The path through the logic which determines the ultimate speed of the
structure is called the critical path.
● Putting the critical-path transistor closer to the output of the gate can
result in a speed-up.
26
Reduction of delays for Large Fan-in (Cont..)
● Logic Restructuring
o Partitioning large fan-in gates into small fan-in gates
27
● Buffering: Isolate Fan-in from Fan-out
28
Optimizing performance in
Combinational Networks - Logical Effort
● Systematic method for optimizing CMOS circuits for speed
o Balances gate delays and logic depth
o Enables systematic method of finding minimum sizes to achieve delay
specification
o Implicitly assumes switch RC model for propagation delay.
29
8-input AND
30
Review: Inverter Delay Model
● Inverter Delay Expression:
with tp0 represents the intrinsic delay of an inverter, and f the ratio between
the external load and the input capacitance of the gate. f is often called the
electrical effort .
31
Logical Effort
● For the complex gates, the basic delay equation of the inverter can be
modified as
32
Logical Effort (Cont..)
● The factor g is called the logical effort,
o For a given load, complex gates have to work harder than an inverter to
produce a similar response.
o The logical effort of a logic gate tells how much worse it is at producing
output current than an inverter, given that each of its inputs may contain
only the same input capacitance as the inverter.
● Reduced output current means slower operation, and thus the logical effort
number for a logic gate tells how much more slowly it will drive a load than
would an inverter.
o Logical effort is how much more input capacitance a gate presents to
deliver the same output current as an inverter.
o Logical effort is a useful parameter, because it depends only on circuit
topology
33
Logical Effort (Cont..)
34
Logical Effort (Cont..)
• The slope of the line is the logical
effort of the gate;
• Its intercept is the intrinsic delay.
• The graph shows that we can adjust
the delay by adjusting the effective
fanout (by transistor sizing) or by
choosing a logic gate with a different
logical effort.
• The fanout and logical effort
contribute to the delay in a similar
way.
• The gate effort/effort delay is given by
Delay as a function of fanout for an h = fg
inverter and a 2-input NAND
40
● Branching effort,
● Con-path is the load capacitance of the gate along the path we are
analyzing and Coff-path is the capacitance of the connections that lead
off the path.
B = b1b2b3……bN
41
● The path electrical effort can be written as
● The gate effort that minimizes the path delay is found to equal
42
8-input AND
43
● Case a: 2*(3.33F)1/2 + 9pinv
● Case b: 2*(3.33F)1/2 + 6pinv
● Case c: 4*(2.96F)1/4 + 7pinv
o pinv – Parasitic delay of the inverter
o Case b is always better than Case a
o When F = 1, Case b will be the best
o For higher values of F, Case c will be the best
How wide should the gates be for least delay?
● s1 is the sizing factor of the first gate in chain, the input capacitance of
the chain, Cg1 = g1s1Cref
45
Example
Consider the network given below. The output of the network is loaded
with a capacitance which is 5 times larger than the input capacitance of
the first gate, which is a minimum-sized inverter.
46
Effective fanout, F = 5
5 𝑐
∗ 1 = 1.93 ⇒ 𝑐 = 2.6 ∗ 5/3 = 1.93 ⇒ 𝑏 = 2.24
𝑐 𝑏
𝑏
∗ 5/3 = 1.93 ⇒ 𝑎 = 1.93
𝑎
Sizing of c => gcscCref = 2.6 => sc = 2.6
Sizing of b => gbsbCref = 2.24 => sb = 1.34
Sizing of a => gasaCref = 1.93 => sa = 1.16
47
● Select gate sizes x and y for least delay from A to B
● Logical Effort G = (4/3)*(5/3)*(5/3) = 100/27
● Electrical Effort F = 45/8
● Branching Effort B = 3 * 2 = 6
● Path Effort H = GBF = 125
● Best Stage Effort = H1/3 = 5
● Parasitic Delay P = 2 + 3 + 2 = 7
● Delay D = 3*5 + 7 = 22
● y = 45 * (5/3) / 5 = 15
● x = (15*2) * (5/3) / 5 = 10
● Now size last stage: x = 15
o 2 input NOR has unit input cap of 5
o To get an input cap of 15 => widths 15/5 = 3X unit!
● Now size 2nd stage: y = 10;
o 3 input NAND unit input cap of 5 => widths 10/5 = 2X unit!
● Now size 1st stage: Cin = 8;
o 2 input NAND has unit input cap of 4 => widths 8/4 = 2X
unit!
● For the circuit given below
o What is the path effort from In to Out?
o What electrical effort/stage minimizes the delay of this chain of gates?
o Size the gates to minimize the delay from In to Out
52
Power Consumption in CMOS Logic Gates
● The power dissipation is a strong function of
o Transistor sizing (which affects physical capacitance),
o Input and output rise/fall times (which affects the short-circuit power)
o Device thresholds and temperature (which affect leakage power)
o Switching activity
● Making a gate more complex mostly affects the switching activity α0->1,
which has two components:
o a static component that is only a function of the topology of the logic
network, and
o a dynamic one that results from the timing behavior of the circuit—the
latter factor is also called glitching.
54
● The transition activity is a strong function of the logic function being
implemented.
α0->1 = p0 (1 - p0)
● Assuming that the inputs are independent and uniformly distributed,
any N -input static gate has a transition probability
55
● Using a uniform input distribution to compute activity is not a good
one, since the propagation through logic gates can significantly modify
the signal statistics.
● For a 2-input static NOR gate, and let pa and pb be the probabilities
that the inputs A and B are one . Also assume that the inputs are not
correlated. The probability that the output node equals one is given by
56
α0->1
AND
OR
XOR
57
α0->1
AND (1-pApB)pApB
OR (1-pA)(1-pB)[1-(1-pA)(1-pB)]
59
Dynamic or Glitching Transitions
● The finite propagation delay from one logic block to the next can
cause spurious transitions, called glitches, critical races, or dynamic
hazards , to occur
o a node can exhibit multiple transitions in a single clock cycle before
settling to the correct logic level.
60
61
Design Techniques to Reduce
Switching Activity
● Glitch Reduction by balancing signal paths
62
Design Techniques to Reduce Switching Activity
(Cont..)
Logic Restructuring
63
Design Techniques to Reduce Switching Activity
(Cont..)
64
Design Techniques to Reduce Switching Activity
(Cont..)
● The chain implementation will have an overall lower switching activity
than the tree implementation for random inputs.
65
Design Techniques to Reduce Switching Activity
(Cont..)
● Input ordering
● Since both circuits implement identical logic functionality, it is clear that the
activity at the output node Z is equal in both cases.
● The difference is in the activity at the intermediate node. In the first circuit, this
activity equals (1 - 0.5 x 0.2) (0.5 x 0.2) = 0.09.
● In the second case, the probability that a 0 -> 1 transition occurs equals (1 –
0.2x0.1) (0.2 x 0.1)= 0.0196. This is substantially lower.
66
CMOS disadvantages
● For N-input CMOS gate, 2N transistors required
o Each input connects to an NMOS and PMOS transistor
o Large input capacitance: limits fan-out
o Large fan-in gates: always have long transistor stack
● in PUN or PDN
o Limits pull-up or pull-down delay
o Requires very large transistors
67
Ratioed Logic
● To reduce the number of transistors
required to implement a given logic
function,
o Reduced robustness
o Extra power dissipation.
68
pseudo-NMOS Logic
● Uses a grounded PMOS load, known as a
pseudo-NMOS gate.
● Reduced number of transistors (N +1 versus 2N
for complementary CMOS)
● VOH = VDD
● VOL ≠ 0
o There is a fight between the devices in the PDN
and the grounded PMOS load device.
o This results in reduced noise margins and more
importantly static power dissipation.
RATIOED LOGIC - The sizing of the load device relative to the pull-down
devices can be used to trade-off parameters such a noise margin ,
propagation delay and power dissipation.
• CMOS – Ratioless - The low and high levels do not depend upon transistor
sizes.
69
pseudo-NMOS Logic
● The value of VOL is obtained by equating the currents through the
driver and load devices for Vin = VDD
70
VTC of pseudo-NMOS inverter
71
Differential Cascode Voltage Switch Logic
(DCVSL)
72
DCVSL - Examples
73
DCVSL – Advantages
● The resulting circuit exhibits a rail-to-rail swing
● Both the output signal (Vout1) and its inverted value (Vout2) are
simultaneously available.
o This is a distinct advantage, as it eliminates the need for an extra inverter to
produce the complementary signal.
74
DCVSL – Disadvantages
● The circuit is still ratioed
o The sizing of the PMOS devices relative to the pull-down devices is critical
to functionality, not just performance.
75
Pass-Transistor Logic
● Implement logic by allowing the primary inputs to drive gate terminals
as well as source/drain terminals.
● When the pass transistor pulls a node high, the output only charges
up to VDD -VTn
76
Complementary/Differential pass-transistor logic
(CPL or DPL)
77
78
Pass-Transistor Logic- Solution 1:
Level Restoring Transistor
80
● For a rising transition at the output (step input)
o NMOS sat, PMOS sat until output reaches |VTP|
o NMOS sat, PMOS linear until output reaches VDD - VTN
o NMOS off, PMOS linear for the final VDD – VTN to VDD voltage swing
81
Equivalent resistance
● Equivalent resistance Req is
parallel combinaton of Req,n
and Req,p
82
2:1 Mux
83
Transmission Gate Full Adder
Propagate signal
85
Positive latch using
transmission gates
86
Edge-Triggered Flip-Flop
87
● Race condition may occur due to overlapping of clocks
88
Two-phase non-overlapping clocks
89
C2MOS Edge Triggered Flip-Flop
● Clocked CMOS
90
91
True Single-Phase Clocked Register (TSPCR)
92