Cmos, Analog, Design-41
Cmos, Analog, Design-41
Cmos, Analog, Design-41
1
24.1
A 12.5Gb/s SerDes in 65nm CMOS Using a BaudRate ADC with Digital Receiver Equalization and
Clock Recovery.
A key challenge in designing high-bandwidth systems such as datarouters and super-computers is to transferring large amounts of
data between ICs either on the same circuit board or between
boards. Analysis of typical backplane channel attenuation (-24dB)
and package losses (-1 to -2dB) in the presence of crosstalk predict
that an un-equalized transceiver provides inadequate performance
and that DFE is needed to achieve error rates of <10-17.
Traditional DFE methods for SerDes receivers rely on either modifying the input signal based on the data history [1-3] or on having
an adaptive analog slicing level [4]. This paper describes an alternative approach. The input data is sampled at the baud-rate, digitized, and the equalization and CDR are performed using numerical digital processing techniques. This approach enables
power/area scaling with process, simplifies production testing,
allows integration of a FFE, and provides a flexible design with a
configurable number of filter taps. Implemented in 65nm CMOS,
the 12.5Gb/s transceiver uses the baud-rate sampling ADC, a RX
with digital 2-tap FFE and digital 5-tap DFE to correct channel
impairments, and a TX with a 4-tap FIR filter to pre-compensate
for channel impairments.
The RX is shown in Fig. 24.1.1. The received data is digitized at the
baud-rate, 1.0 to 12.5Gb/s, using a pair of interleaved T/H stages
and 23-level (4.5b) full-flash ADCs. The 2 T/H circuits enable interleaving of the half-rate ADCs and reduce signal-related aperture
timing errors. Two ADCs, each running at 6.25Gb/s, provide baudrate quantization of the received data. The dynamic range of the
ADC is normalized to the full input amplitude using a 7b AGC circuit. A loss-of-signal indication is obtained by detecting an out-ofrange AGC. An optional attenuator is included in the termination
block to enable reception of large signals whilst minimizing signal
overload.
The detailed full-flash ADC is shown in Fig. 24.1.2(A). The first linear transconductor stage (24.1.2(B)) is optimized for handling large
input signals combined with low supply voltages. Resistive interpolation is used to reduce the input DNL. The second CML latchstage (C) is conventional, but optimized for speed and metastability performance. The final sense-amp latch (D) uses a cross-coupled
NMOS pair with PMOS loads. Cross-coupled inverters further
enhance the gain and reduce the meta-stability occurrence probability by adding hysteresis.
The output from the ADCs is fed into a custom DSP data-path that
performs the numerical FFE and DFE, as shown in Fig. 24.1.3(A).
The digital FFE/DFE is implemented using standard library gates.
An advantage of digital equalization is that it is straightforward to
include FFE as a delay-and-add function without any noise-sensitive analog delay elements. The FFE tap weight is selected to compensate for pre-cursor ISI and can be bypassed to reduce latency.
While many standards require pre-cursor de-emphasis at the TX,
inclusion at the RX allows improved BER performance with existing legacy transmitters.
The DFE uses an unrolled non-linear cancellation method [4]. The
FFE output is compared with a stored slicer-level to generate the
data output. The slicer-level is selected from one of 2n possible
options depending on the previous n bits of data. Unrolled tap
adaption is performed using an LMS method where the optimum
slicing position is defined to be the average of the 2 possible symbol amplitudes (1) when preceded by identical history bits.
Although 5-taps of DFE are chosen for this implementation, this
436
Figure 24.1.1: A 12.5Gb/s receiver with 2 ADCs, a 2-tap FFE and a 5-tap DFE.
Figure 24.1.2: A) Sub-sections of the ADC including 3 comparator sections, and B) the
linear transconductor.
24
Figure 24.1.4: Transmitter with weighted 4-tap FIR filter and current-summing CML
output stage.
437
Figure 24.1.6: Quad SerDes macro with the key components of a single lane and the
central PLL indicated.
613