03 Ece621 F17 TX
03 Ece621 F17 TX
03 Ece621 F17 TX
TX Circuitry 2
Single-Ended Signaling
Sources of signal corruption
▪ Return-path impedances,
ZR,TX and ZR,RX
▪ Supply coupling
impedances, ZG,TX and
ZG,RX
▪ Supply mismatch and
noise, VN
Clean reference (VREF) generation is very difficult
▪ Tx and Rx grounds could be different.
▪ The difference, VN, has a significant high frequency content
because of supply noise generated by high speed switching.
We can use an additional line to send the reference signal
▪ Matching becomes a problem.
TX Circuitry 3
SE Signaling for Multiple Links
Finite supply impedance
causes significant
Simultaneous Switching
Output (SSO) noise.
▪ Imagine many TXs
switching at the same
time
Necessitates large
amounts of decoupling
capacitance for supplies
and reference voltage
▪ Decap limits I/O area
more than circuitry
Is it still used? Why?
TX Circuitry 4
Classic Debate
Area constraints mandates single ended.
“Differential must be twice as fast as single ended in
order to win”
Reality more complicated
▪ Ex: SSO mandates the use of more ground and supply
pins for SE
▪ i.e. higher supply to signaling pin ratio in SE
Differential is a lot easier to build and get right the first
time.
SE can be built to work but needs more care and is
more painful.
Differential links can possibly save power.
TX Circuitry 5
Differential Signaling
TX Circuitry 7
Current-Mode Signaling
TX Circuitry 8
Old “Voltage-Mode” vs. “Current-Mode”
TX Circuitry 9
Gb/s “Voltage-Mode” vs. “Current-Mode”
Voltage-Mode Current-Mode
Single-Ended
Differential
TX Circuitry 10
Voltage-Mode vs. Current-Mode Summary
TX Circuitry 12
On-Chip vs. Off-Chip Termination
TX Circuitry 13
AC vs. DC-Coupled Termination
DC coupling allows for
uncoded data.
RX common-mode set by
transmitter signal level.
TX Circuitry 14
Passive Termination
Choice of integrated resistors involves trade-offs in
manufacturing steps, sheet resistance, parasitic
capacitance, linearity, and ESD tolerance.
Integrated passive termination resistors are typically
realized with unsilicided poly, diffusion, or n-well
resistors.
Poly resistors are typically used due to better linearity,
but they typically vary +/-30% over process and
temperature.
TX Circuitry 15
Active Termination
TX Circuitry 17
Termination Capacitance
Many parasitic caps: ~500 fF ESD and pads and ~500 fF
driver.
This gives a BW of ~ 6 GHz for double-terminated links.
50
1000f
Z
TX Circuitry 18
T-Coils
Old technology recently used in
bandwidth extension and ESD
capacitance compensation.
[Schmatz JSSC Dec. 2008]
[Allstot]
T-coil consists of two mutually
coupled inductor and a bridged
capacitor.
50
1000f
Z
TX Circuitry 19
Distributed ESD
At higher data rates (ex 40 Gb/s), even T-coil Extension is
not enough.
Distributed ESD is a possible solution.
More on-chip inductors are used but with smaller values.
Delay through transmission line sections can help
implement equalizers at RX side.
Impedance of inductors at ESD events is negligible (low-
frequency events).
TX Circuitry 21
Conceptual I/O Transmitter
Multiplexers and
Synchronizers
▪ To facilitate data exchange
between sources and the
link
Driver Goals
▪ Generates signals suitable ▪ High bit rate
for signaling ▪ Low power
Pre-driver consumption
▪ Condition the driver e.g., ▪ Low noise, free of
level shifting etc. unnecessary time-
domain spikes
▪ Low coupling to other
links
TX Circuitry 22
Main Driver Circuit Choices
TX Circuitry 23
A Very Simple Large Swing Driver
Characteristics
▪ Push-pull architecture
▪ Large signal amplitude, Vswing = VDD
Supply noise contribution is very large.
▪ VDD on chip can be very noisy
Reflections are significant due to lack of source
termination
Large crowbar current
Causes significant power consumption
TX Circuitry 24
Another Simple Large Swing Driver
Break-before-make connection
▪ Ensures that NMOS and PMOS are not simultaneously
ON, even for short duration
Supply noise, reflection problems remain
These are mainly legacy I/Os
▪ Good for low data rate, generic interfaces with other ICs
TX Circuitry 25
Source Terminated Large Swing Driver
Higher speed
▪ Devices switch faster.
▪ Smaller devices and hence lower capacitance
Better linearity
▪ Driver transistor stays in a single region of operation
when ON.
Lower power
▪ Power is proportional to swing e.g. 0.5CLV2f
TX Circuitry 29
Low Swing Differential Drivers
CML Driver
TX Circuitry 31
Current-Mode Logic (CML) Driver
TX Circuitry 32
CML Power Consumption
VTT
𝑃 = 𝐼𝑏 𝑉𝑇𝑇 RT/2 RT/2
𝑉𝑆𝑊,𝑑𝑖𝑓𝑓 = 𝐼𝑏 𝑅𝑇
𝐼𝑏 𝑅𝑇 D Db
𝑉𝑆𝑊,𝑎𝑚𝑝 =
2
2𝑉𝑆𝑊,𝑎𝑚𝑝 Ib
𝐼𝑏 =
𝑅𝑇
2
𝑃 = 𝑉𝑆𝑊,𝑎𝑚𝑝 𝑉𝑇𝑇
𝑅𝑇
𝑉𝑇𝑇,𝑚𝑖𝑛 = 2𝑉𝐷𝑆𝐴𝑇 + 𝑉𝑆𝑊,𝑎𝑚𝑝
2 2𝑉𝐷𝑆𝐴𝑇 2
𝑃𝑚𝑖𝑛 = 𝑉𝑆𝑊,𝑎𝑚𝑝 1+
𝑉𝑆𝑊,𝑎𝑚𝑝 𝑅𝑇
TX Circuitry 33
Lower Power Using Voltage-Mode Drivers
Voltage-mode driver implementation depends on
output swing requirements.
For low-swing (<400-500mVpp), an all NMOS driver is
suitable.
For high-swing, CMOS driver is used.
TX Circuitry 34
VM Driver Power Consumption
VTT
𝑉𝑇𝑇
RT 𝑉𝑆𝑊,𝑎𝑚𝑝 =
2
𝑉𝑇𝑇
+ 𝐼𝑉𝑇𝑇 =
4𝑅𝑇
2RT 1
𝑃 = 𝑉𝑆𝑊,𝑎𝑚𝑝 𝑉𝑇𝑇 ¼ CML
- 2𝑅𝑇
2 1
RT 𝑃 = 𝑉𝑆𝑊,𝑎𝑚𝑝
𝑅𝑇
However
▪ Termination is set by devices
▪ Impedance control loops are complicated
▪ Matching pull-up and pull-down is difficult.
TX Circuitry 35
Low-Swing VM Driver Example
TX Circuitry 37
TX Swing Control
TX Circuitry 39
Slew-Rate Control Implementation
Idea
▪ Break the driver into segments.
▪ Turn on each segment sequentially.
Delay element
▪ Transistor-based doesn’t track PVT changes well.
▪ Use poly resistor that is relatively constant.
▪ Use a timing element to sequentially time the switching.
TX Circuitry 40
Pre-Drivers
Pre-drivers are needed for
▪ Capacitance driving
▪ Level shifting
▪ Timing control
▪ Transient waveform improvement
▪ Up and down impedance matching
TX Circuitry 41
Pre-Driver Level Shifting
TX Circuitry 44
TX Circuit Speed Limitations
High-speed links can be limited by both the channel
and the circuits.
Clock generation and distribution is key circuit
bandwidth bottleneck.
Multiplexing circuitry also limits maximum data rate.
TX Circuitry 45
Full-Rate Multiplexing
Tree-mux architecture with
cascaded 2:1 stages is
often used.
Full-rate architecture
relaxes clock duty-cycle,
but limits max data rate.
▪ Needs to generate and
distribute high-speed
clock.
▪ Needs to design high-
speed flip-flop.
CML logic is sometimes
used in last stages.
▪ Minimize CML to save
power. TX Circuitry 46
Clock Distribution Speed Limitations
Max clock frequency that can be
efficiently distributed is limited by
clock buffers ability to propagate
narrow pulses.
CMOS buffers are limited to a min
clock period near 8FO4 inverter
delays.
▪ About 4GHz in typical 90nm CMOS
▪ Full-rate architecture limited to this
data rate in Gb/s.
Need a faster clock, use faster clock
buffers.
▪ CML
▪ CML w/ inductive peaking
TX Circuitry 47
Multiplexing Techniques – ½ Rate
Full-rate architecture is
limited by maximum clock
frequency to 8FO4 Tb.
To increase data rates
eliminate final retiming and
use multiple phases of a
slower clock to mux data.
Half-rate architecture uses 2
clock phases separated by
180° to mux data.
▪ Allows for 4FO4 Tb
▪ 180° phase spacing (duty
cycle) critical for uniform
output eye
TX Circuitry 48
Half-Rate Multiplexing Timing Waveforms
Half-rate architecture
eliminates high-speed
clock and flip-flop.
Output eye is sensitive
to clock duty cycle.
Critical path no longer
has flip-flop setup
time.
TX Circuitry 49
2:1 CMOS Multiplexer
TX Circuitry 50
2:1 CML Multiplexer
TX Circuitry 51
Increasing Multiplexing Factor – ¼ Rate
TX Circuitry 52
Mux Speed vs. Fan-in
Higher fan-in muxes run slower
due to increased cap at mux node.
1/4-rate architecture
▪ 4:1 CMOS mux can potentially
achieve 2FO4 Tb with low fanout.
• An aggressive CMOS-style
design has potential for 16Gb/s in
typical 90nm CMOS
1/8-rate architecture
▪ 8-phase clock distribution spaced
at 45° allows for 1FO4 Tb
▪ No way a CMOS mux can achieve
this!!
TX Circuitry 53
Current-Mode Output-Multiplexed
8:1 current-mode mux directly at
output pad.
Makes sense if output time constant
smaller than on-chip time constant.
Very sensitive to clock phase spacing.
Yang achieved 6Gb/s in 0.35µm CMOS.
Equivalent to 33Gb/s in 90nm CMOS
(now channel (not circuit) limited)
TX Circuitry 54
Current-Mode Input-Multiplexed
TX Circuitry 55
Voltage-Mode Output-Multiplexed
TX Circuitry 58