03 Ece621 F17 TX

Download as pdf or txt
Download as pdf or txt
You are on page 1of 58

ECE 621

Signaling & Synchronization


Fall 2017
Topic 3
TX Circuitry
Sameh A. Ibrahim
Ain Shams University
ICL
(Courtesy of S. Pamarti – UCLA, S. Palermo – TAMU,
and E. Alon – UCB)
Outline
 Signaling Basics
▪ Single-ended vs. differential
▪ Voltage-mode vs. current-mode
 Termination Circuits
▪ On-Chip vs. off-chip
▪ AC vs. DC coupled
▪ Active vs. passive
 TX Driver Circuits
▪ Large swing drivers
▪ Small swing drivers
▪ Pre-drivers
 Multiplexing
▪ Multiplexing depth
▪ Multiplexing circuits

TX Circuitry 2
Single-Ended Signaling
 Sources of signal corruption
▪ Return-path impedances,
ZR,TX and ZR,RX
▪ Supply coupling
impedances, ZG,TX and
ZG,RX
▪ Supply mismatch and
noise, VN
 Clean reference (VREF) generation is very difficult
▪ Tx and Rx grounds could be different.
▪ The difference, VN, has a significant high frequency content
because of supply noise generated by high speed switching.
 We can use an additional line to send the reference signal
▪ Matching becomes a problem.

TX Circuitry 3
SE Signaling for Multiple Links
 Finite supply impedance
causes significant
Simultaneous Switching
Output (SSO) noise.
▪ Imagine many TXs
switching at the same
time
 Necessitates large
amounts of decoupling
capacitance for supplies
and reference voltage
▪ Decap limits I/O area
more than circuitry
 Is it still used? Why?
TX Circuitry 4
Classic Debate
 Area constraints mandates single ended.
 “Differential must be twice as fast as single ended in
order to win”
 Reality more complicated
▪ Ex: SSO mandates the use of more ground and supply
pins for SE
▪ i.e. higher supply to signaling pin ratio in SE
 Differential is a lot easier to build and get right the first
time.
 SE can be built to work but needs more care and is
more painful.
 Differential links can possibly save power.

TX Circuitry 5
Differential Signaling

 Use of two transmission lines per I/O link


 No reference voltage is used to extract data. ☺
 Swing is twice as large as SE signaling. ☺
 No return path corruption ☺
 Minimum self-induced power supply noise ☺
 Matching between lines is crucial 
 Low pin/wire utilization is becoming a problem. 
TX Circuitry 6
Voltage-Mode Signaling

 Output Impedance is low.


 Older systems used ZTX << Zo.
 Newer Systems use ZTX = Zo to minimize reflections.
 Thevenin-equivalent series termination

TX Circuitry 7
Current-Mode Signaling

 Output Impedance is high.


 Older systems used ZTX >> Zo.
 Newer Systems use ZTX = Zo to minimize reflections.
 Norton-equivalent parallel termination

TX Circuitry 8
Old “Voltage-Mode” vs. “Current-Mode”

TX Circuitry 9
Gb/s “Voltage-Mode” vs. “Current-Mode”
Voltage-Mode Current-Mode
Single-Ended
Differential

TX Circuitry 10
Voltage-Mode vs. Current-Mode Summary

 An ideal voltage-mode driver with differential RX


termination enables a potential 4x reduction in driver
power.
 Finally, since TL has both voltage and current,
terminology can mean
▪ Whether or not the termination is high
▪ How termination is set
▪ What sets the output swing (supply or current source)
TX Circuitry 11
Outline
 Signaling Basics
▪ Single-ended vs. differential
▪ Voltage-mode vs. current-mode
 Termination Circuits
▪ On-Chip vs. off-chip
▪ AC vs. DC coupled
▪ Active vs. passive
 TX Driver Circuits
▪ Large swing drivers
▪ Small swing drivers
▪ Pre-drivers
 Multiplexing
▪ Multiplexing depth
▪ Multiplexing circuits

TX Circuitry 12
On-Chip vs. Off-Chip Termination

 Package parasitics act as an unterminated stub which


sends reflections back onto the line.
 On-chip termination makes package inductance part of
transmission line.
 But on-chip termination needs control.

TX Circuitry 13
AC vs. DC-Coupled Termination
 DC coupling allows for
uncoded data.
 RX common-mode set by
transmitter signal level.

 AC coupling allows for


independent RX
common-mode level.
 Data must be coded
because of channel’s low
frequency cut-off.

TX Circuitry 14
Passive Termination
 Choice of integrated resistors involves trade-offs in
manufacturing steps, sheet resistance, parasitic
capacitance, linearity, and ESD tolerance.
 Integrated passive termination resistors are typically
realized with unsilicided poly, diffusion, or n-well
resistors.
 Poly resistors are typically used due to better linearity,
but they typically vary +/-30% over process and
temperature.

TX Circuitry 15
Active Termination

 Transistors must be used for


termination in CMOS
processes with no resistors.
 Triode-based FET works well
for low-swing (<500 mV)
▪ Adding a diode-connected
FET increases linear range.
 Pass-gate structure allows for differential termination.
 Not ESD robust, needs adjustment loop (Analog or
Digital), and adds extra capacitance.
TX Circuitry 16
Combination of Active and Passive

 Fixed resistor and digitally-controlled resistor to


account for variations
 Better ESD robustness and linearity
 Device capacitances are partially shielded.
 But adjustment loop is still required.

TX Circuitry 17
Termination Capacitance
 Many parasitic caps: ~500 fF ESD and pads and ~500 fF
driver.
 This gives a BW of ~ 6 GHz for double-terminated links.

50

1000f
Z

TX Circuitry 18
T-Coils
 Old technology recently used in
bandwidth extension and ESD
capacitance compensation.
[Schmatz JSSC Dec. 2008]
[Allstot]
 T-coil consists of two mutually
coupled inductor and a bridged
capacitor.

50

1000f
Z

TX Circuitry 19
Distributed ESD
 At higher data rates (ex 40 Gb/s), even T-coil Extension is
not enough.
 Distributed ESD is a possible solution.
 More on-chip inductors are used but with smaller values.
 Delay through transmission line sections can help
implement equalizers at RX side.
 Impedance of inductors at ESD events is negligible (low-
frequency events).

[Navid, JSSC Apr. 2015]


TX Circuitry 20
Outline
 Signaling Basics
▪ Single-ended vs. differential
▪ Voltage-mode vs. current-mode
 Termination Circuits
▪ On-Chip vs. off-chip
▪ AC vs. DC coupled
▪ Active vs. passive
 TX Driver Circuits
▪ Large swing drivers
▪ Small swing drivers
▪ Pre-drivers
 Multiplexing
▪ Multiplexing depth
▪ Multiplexing circuits

TX Circuitry 21
Conceptual I/O Transmitter

 Multiplexers and
Synchronizers
▪ To facilitate data exchange
between sources and the
link
 Driver  Goals
▪ Generates signals suitable ▪ High bit rate
for signaling ▪ Low power
 Pre-driver consumption
▪ Condition the driver e.g., ▪ Low noise, free of
level shifting etc. unnecessary time-
domain spikes
▪ Low coupling to other
links
TX Circuitry 22
Main Driver Circuit Choices

 Single Ended or Differential


 Voltage mode or current mode
 On-chip or off-chip supplies
 Source termination

 Each choice has an implication on signal fidelity,


power consumption, noise coupling, and data rate.

TX Circuitry 23
A Very Simple Large Swing Driver

 Characteristics
▪ Push-pull architecture
▪ Large signal amplitude, Vswing = VDD
 Supply noise contribution is very large.
▪ VDD on chip can be very noisy
 Reflections are significant due to lack of source
termination
 Large crowbar current
 Causes significant power consumption
TX Circuitry 24
Another Simple Large Swing Driver

 Break-before-make connection
▪ Ensures that NMOS and PMOS are not simultaneously
ON, even for short duration
 Supply noise, reflection problems remain
 These are mainly legacy I/Os
▪ Good for low data rate, generic interfaces with other ICs

TX Circuitry 25
Source Terminated Large Swing Driver

 Add a resistance in series to provide source termination.


▪ Prevents some reflection
 Actual source impedance is a series combination of RTERM
and ZNMOS or ZPMOS.
▪ Increase NMOS and PMOS sizes to maintain switching
speed performance.
 Difficult to terminate precisely
▪ Transistor impedance varies with region of operation and
PVT corners.
TX Circuitry 26
Dedicated Driver Supplies

 VSWING = VDD,I/O > Core-VDD is possible.


▪ I/O signals are better protected from noise on core
supplies.
 The driver supplies could be on-chip or off-chip.
 Large DC driver currents are possible
▪ Depends on relation between Core-VDD and VDD,I/O
TX Circuitry 27
A Very Simple Low Swing Driver

 Replace the pull-up PMOS with a resistance.


▪ Signal swings between VDD,I/O and VDD,I/O – IDSR
 Pull-up time constant is determined by R.
▪ Small R results in quick pull-up.
 Received signal is smaller than with large swing drivers.
▪ Threatens receiver margins and could increase BER.
 More susceptible to variations in R and Idrv.
TX Circuitry 28
Why Low Swing Drivers?

 Higher speed
▪ Devices switch faster.
▪ Smaller devices and hence lower capacitance
 Better linearity
▪ Driver transistor stays in a single region of operation
when ON.
 Lower power
▪ Power is proportional to swing e.g. 0.5CLV2f
TX Circuitry 29
Low Swing Differential Drivers

 Differential pair steers the current into a differential I/O


link.
 High output impedance can cause reflection problems
on the source side.
 Pre-driver works hard
▪ Currents need to be steered completely.
TX Circuitry 30
Terminated Low Swing Differential Drivers

CML Driver

 ZTX are actual termination resistors.


▪ Driver is based on current mode logic. [Mizuno JSSC Jun. 1996]
▪ Triode PMOS could be used instead.
▪ Additional power consumption
 Doubly terminated differential signaling is popular
among very high speed signaling.

TX Circuitry 31
Current-Mode Logic (CML) Driver

 Used in most high performance serial links.


 Low voltage operation relative to push-pull driver
▪ High output common-mode keeps current source saturated.
 Can use DC or AC coupling
 Differential pp RX swing is ±IR/2 with double termination

TX Circuitry 32
CML Power Consumption
VTT
𝑃 = 𝐼𝑏 𝑉𝑇𝑇 RT/2 RT/2
𝑉𝑆𝑊,𝑑𝑖𝑓𝑓 = 𝐼𝑏 𝑅𝑇
𝐼𝑏 𝑅𝑇 D Db
𝑉𝑆𝑊,𝑎𝑚𝑝 =
2
2𝑉𝑆𝑊,𝑎𝑚𝑝 Ib
𝐼𝑏 =
𝑅𝑇
2
𝑃 = 𝑉𝑆𝑊,𝑎𝑚𝑝 𝑉𝑇𝑇
𝑅𝑇
𝑉𝑇𝑇,𝑚𝑖𝑛 = 2𝑉𝐷𝑆𝐴𝑇 + 𝑉𝑆𝑊,𝑎𝑚𝑝
2 2𝑉𝐷𝑆𝐴𝑇 2
𝑃𝑚𝑖𝑛 = 𝑉𝑆𝑊,𝑎𝑚𝑝 1+
𝑉𝑆𝑊,𝑎𝑚𝑝 𝑅𝑇

TX Circuitry 33
Lower Power Using Voltage-Mode Drivers
 Voltage-mode driver implementation depends on
output swing requirements.
 For low-swing (<400-500mVpp), an all NMOS driver is
suitable.
 For high-swing, CMOS driver is used.

TX Circuitry 34
VM Driver Power Consumption
VTT
𝑉𝑇𝑇
RT 𝑉𝑆𝑊,𝑎𝑚𝑝 =
2
𝑉𝑇𝑇
+ 𝐼𝑉𝑇𝑇 =
4𝑅𝑇
2RT 1
𝑃 = 𝑉𝑆𝑊,𝑎𝑚𝑝 𝑉𝑇𝑇 ¼ CML
- 2𝑅𝑇

2 1
RT 𝑃 = 𝑉𝑆𝑊,𝑎𝑚𝑝
𝑅𝑇
 However
▪ Termination is set by devices
▪ Impedance control loops are complicated
▪ Matching pull-up and pull-down is difficult.
TX Circuitry 35
Low-Swing VM Driver Example

[Poulton JSSC Dec. 2007]

 A linear regulator sets the output stage supply, VS.


 Termination is implemented by output NMOS transistors.
 To compensate for PVT and varying output swing levels,
the pre-driver supply is adjusted with a feedback loop.
 The top and bottom output stage transistors need to be
sized differently, as they see a different VDSAT.
TX Circuitry 36
High-Swing VM Driver Example

[Kossel JSSC Dec. 2008]

 High-swing voltage-mode driver termination is


implemented with a combination of output driver
transistors and series resistors.
 To meet termination resistance levels (50 Ω), large
output transistors are required.
▪ Degrades potential power savings.

TX Circuitry 37
TX Swing Control

 Uncertainty in RTERM and IB can degrade output swing


▪ In low-swing drivers, this could be a big problem.
 Extend RTERM control to control the swing too.
 Control is done in two steps
▪ Impedance control
▪ I-control by adaptively changing bias current.
 Typically, a replica driver is used.
TX Circuitry 38
TX Slew-Rate Control
 Output transition times should be controlled.
▪ Too Slow
• Limits maximum data rate
▪ Too fast
• Can excite resonant circuits, resulting in ISI due to ringing
• Cause excessive cross-talk because of large di/dt

 Slew rate control reduces reflections and cross-talk.

 Controlling pre-driver is an effective means of


controlling driver output slew-rate.

TX Circuitry 39
Slew-Rate Control Implementation
 Idea
▪ Break the driver into segments.
▪ Turn on each segment sequentially.
 Delay element
▪ Transistor-based doesn’t track PVT changes well.
▪ Use poly resistor that is relatively constant.
▪ Use a timing element to sequentially time the switching.

TX Circuitry 40
Pre-Drivers
 Pre-drivers are needed for
▪ Capacitance driving
▪ Level shifting
▪ Timing control
▪ Transient waveform improvement
▪ Up and down impedance matching

 A pre-driver is similar to the driver but with much lower


current.

TX Circuitry 41
Pre-Driver Level Shifting

 If swing Vs > Vdd, the pre-driver needs to level shift the


output
▪ If not, the PMOS will not shut off completely when
needed.
 Level shifting can be achieved by a “pump” circuit.
 Reliability issues
▪ Cross-coupled PMOS devices need to have their wells
tied to external voltage.
TX Circuitry 42
Pre-Drivers for CML
 Ideal waveforms for VTT
the CML driver will
cause big drops on the RT/2 RT/2
tail current voltage and D
current. D Db
X
 This affects overall Db
performance. VX Ib
 Skewed inverters can
be used as pre-drivers.
 CML pre-drivers can D VDD
be used as well to
have a smaller input Db Db D
swing.
▪ Low current
TX Circuitry 43
Outline
 Signaling Basics
▪ Single-ended vs. differential
▪ Voltage-mode vs. current-mode
 Termination Circuits
▪ On-Chip vs. off-chip
▪ AC vs. DC coupled
▪ Active vs. passive
 TX Driver Circuits
▪ Large swing drivers
▪ Small swing drivers
▪ Pre-drivers
 Multiplexing
▪ Multiplexing depth
▪ Multiplexing circuits

TX Circuitry 44
TX Circuit Speed Limitations
 High-speed links can be limited by both the channel
and the circuits.
 Clock generation and distribution is key circuit
bandwidth bottleneck.
 Multiplexing circuitry also limits maximum data rate.

TX Circuitry 45
Full-Rate Multiplexing
 Tree-mux architecture with
cascaded 2:1 stages is
often used.
 Full-rate architecture
relaxes clock duty-cycle,
but limits max data rate.
▪ Needs to generate and
distribute high-speed
clock.
▪ Needs to design high-
speed flip-flop.
 CML logic is sometimes
used in last stages.
▪ Minimize CML to save
power. TX Circuitry 46
Clock Distribution Speed Limitations
 Max clock frequency that can be
efficiently distributed is limited by
clock buffers ability to propagate
narrow pulses.
 CMOS buffers are limited to a min
clock period near 8FO4 inverter
delays.
▪ About 4GHz in typical 90nm CMOS
▪ Full-rate architecture limited to this
data rate in Gb/s.
 Need a faster clock, use faster clock
buffers.
▪ CML
▪ CML w/ inductive peaking
TX Circuitry 47
Multiplexing Techniques – ½ Rate
 Full-rate architecture is
limited by maximum clock
frequency to 8FO4 Tb.
 To increase data rates
eliminate final retiming and
use multiple phases of a
slower clock to mux data.
 Half-rate architecture uses 2
clock phases separated by
180° to mux data.
▪ Allows for 4FO4 Tb
▪ 180° phase spacing (duty
cycle) critical for uniform
output eye
TX Circuitry 48
Half-Rate Multiplexing Timing Waveforms

 Half-rate architecture
eliminates high-speed
clock and flip-flop.
 Output eye is sensitive
to clock duty cycle.
 Critical path no longer
has flip-flop setup
time.

TX Circuitry 49
2:1 CMOS Multiplexer

 2:1 CMOS mux is able to propagate a minimum pulse


near 2FO4 Tb.
 However, with a ½-rate architecture it is still limited by
clock distribution to 4FO4 Tb.
▪ 8Gb/s in typical 90nm

TX Circuitry 50
2:1 CML Multiplexer

 CML mux can achieve higher speeds due to reduced self-


loading factor.
▪ Cost is higher power consumption that is independent of
data rate (static current).

TX Circuitry 51
Increasing Multiplexing Factor – ¼ Rate

 Increase multiplexing factor


to allow for lower frequency
clock distribution.
 ¼-rate architecture
▪ 4-phase clock distribution
spaced at 90° allows for
2FO4 Tb.
▪ 90° phase spacing and
duty cycle critical for
uniform output eye.

TX Circuitry 52
Mux Speed vs. Fan-in
 Higher fan-in muxes run slower
due to increased cap at mux node.
 1/4-rate architecture
▪ 4:1 CMOS mux can potentially
achieve 2FO4 Tb with low fanout.
• An aggressive CMOS-style
design has potential for 16Gb/s in
typical 90nm CMOS
 1/8-rate architecture
▪ 8-phase clock distribution spaced
at 45° allows for 1FO4 Tb
▪ No way a CMOS mux can achieve
this!!

TX Circuitry 53
Current-Mode Output-Multiplexed
 8:1 current-mode mux directly at
output pad.
 Makes sense if output time constant
smaller than on-chip time constant.
 Very sensitive to clock phase spacing.
 Yang achieved 6Gb/s in 0.35µm CMOS.
 Equivalent to 33Gb/s in 90nm CMOS
(now channel (not circuit) limited)

TX Circuitry 54
Current-Mode Input-Multiplexed

[Lee JSSC Nov. 2000]

 Reduces output capacitance relative to output-


multiplexed driver
▪ Easier to implement TX equalization
 Not sensitive to output stage current mismatches
 Reduces power due to each mux stage not having to be
sized to deliver full output current

TX Circuitry 55
Voltage-Mode Output-Multiplexed

[Song, JSSC May 2013]

 8:1 voltage-mode mux directly at output pad.


 Makes sense if output time constant smaller than on-chip
time constant.
 Allowed running from lower supply with lower jitter
performance.
 Song achieved 6.4Gb/s in 65nm CMOS with only 0.3 pJ/bit
excluding clocking power.
TX Circuitry 56
Assignment 2 (Transmitter Design)
 Due Date: 7/11/2014
 It is required to design a serial link transmitter similar
to the paper “A 2.8 mW/Gb/s, 14 Gb/s Serial Link
Transceiver” by Saxena in JSSC May 2017 with the
following parameter:
▪ Minimum data-rate: 8 Gb/s
▪ 16 random-input data streams
▪ Use of at most 2 independent external power supplies.
▪ Operating temperature: 25° C
▪ 0.25 V pp differential output swing
▪ You can assume a pad/pin capacitance of 250 fF and a
RX termination of 100 ohm differential.
▪ You can assume any missing specifications.
TX Circuitry 57
Submission Requirements

 A table of the design targets proposal (architecture,


signaling type, termination, clocking scheme, etc.)
 Transistor-level schematics
 A plot of the simulated eye diagram at the output of the
transmitter
 A table of the achieved transmitter characteristics
(current consumption of each block, S11, slew rate, etc.)

TX Circuitry 58

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy