Razavi 2018
Razavi 2018
Behzad Razavi
D
Delay-locked loops (DLLs) can be
considered as feedback circuits that
phase lock an output to an input
without the use of an oscillator. In
some applications, DLLs are neces-
to adjust the delay and force DT
toward zero. This conjecture leads
us to the arrangement depicted in
Fig u r e 2(c). He r e , a
phase detector mea-
of the phase/frequency detector (PFD),
charge pump (CP), and capacitor pro-
vides an infinite gain, thus driving the
skew toward zero. The
variable-delay stage is
sary or preferable over phase-locked sures the skew and The origins of realized as a voltage-
loops (PLLs), with their advantages adjusts t he delay of DLLs can be controlled delay line
including lower sensitivity to supply B 2 t o r e d u c e DT. A s traced to a paper (VCDL). Figure 2(e)
noise and lower phase noise. This with PLLs, the low-pass published shows an example of
article deals with fundamental DLL filter attenuates the in 1961. VCDL design emp
design concepts. high-frequency compo- loying varactors for de-
The origins of DLLs can be traced nents generated by the lay control. While the
to a paper published in 1961 [1]. The PD. This circuit exemplifies a simple DLL does not require frequency
authors present the topology shown delay-locked loop. detection, the PFD provides a con-
in Figure 1 as a “delay-lock discrimi- The residual phase error in Fig- venient interface with the CP. As ex-
nator” operating on random signals. ure 2(c) depends on the loop gain, i.e., the plained next, no resistor is necessary
The feedback loop consists of a con- gain of the PD, K PD, and the gain of the in series with C 1 . This DLL archi-
trolled delay line, a multiplier acting variable-delay stage. The latter is de- tecture is commonly used in high-
as a phase detector (PD), and a low- fined as K DL = 2z/2Vcont, where z is speed systems.
pass filter. The use of DLLs in mod- the stage’s delay in rad ia ns. Rath- The DLL of Figure 2(d) is of first
ern CMOS design evidently began er than attempt to maximize order, facing no stability issues. More-
with the work by Bazes in 1985 [2] K PD K DL, we can add an integrator to over, it benefits from the lower phase
and Johnson and Hudson in 1988 [3]. the loop. Drawing upon our knowl- noise and supply sensitivity of delay
e d g e o f PLLs, we thus construct lines compared to oscillators.
Basic Idea the architecture shown in Figure 2(d), In contrast to PLLs, delay-locked
Suppose, as shown in Figure 2(a), an where the cascade consisting loops do not generate a frequency;
input clock travels on a long inter-
connect, experiencing a significant
skew, DT. How do we align CK out with
Received Waveform Multiplier
CK in ? Since the clock is periodic, As [t + T (t )] + n (t )
we surmise that an additional delay x (t ) Low-Pass Delay Estimate
can be introduced to make the total Filter F (p )
"
aT (t )
delay equal to one clock cycle [Fig-
ure 2(b)]. To set the delay properly, ks′ [t + T (t )]
we can view DT as an error that must
be suppressed by means of negative Delay Control
feedback. That is, if the phase of
CK out is compared to that of CK in, Reference Controlled Delay Line
the resulting er ror ca n be used Differentiate Gain gd = 1/α sec/Volt
ks (t )
ks′ (t )
Digital Object Identifier 10.1109/MSSC.2018.2844615
Date of publication: 13 August 2018 Figure 1: An early DLL reported in [1].
t
∆T
(a)
Variable−Delay
Stage CKin
CKin B1 B2 CKout
CKout
Vcont
TCK t
(b)
VCDL
R1 Vcont Vcont
PD PFD CP
C1 C1
(c) (d)
CKin CKout
Vcont
(e)
Figure 2: (a) An interconnect with skew, (b) the correction of skew by a delay stage, (c) a simple feedback system for controlling the delay
line, (d) a basic DLL, and (e) a VCDL implementation example.
rather, they simply delay the input. Loop Dynamics time constant, allowing the approxi-
As such, DLLs are less versatile than We wish to analyze the dynamic be mation exp (- TCK s) . 1.
PLLs. For example, in practice, a DLL havior of the DLL shown in Figure 3(a). How about the path from Vcont
would not be able to generate a 5-GHz In the locked state, the phase differ- to CK out ? If we apply a step at Vcont
clock from a 20-MHz reference. ence between CK in and CK out is con- in Figure 2(e), how long does it take
Another drawback of DLLs is stant and, in principle, equal to zero. to affect the output phase? From the
that they allow the input duty cycle Thus, the VCDL provides a delay of waveforms shown in Figure 3(c), we
error to propagate to the output. one clock period, TCK . recognize that this path too has a
In fact, the delay line may further Before delving into the overall delay of at most one TCK . Based on
increase this error. Thus, the VCDL loop dynamics, let us understand these observations, we can construct
is typically preceded or followed those of the VCDL itself. The circuit an approximate, static model for the
by a duty cycle correction stage. A has a clock input and a control input. VCDL; as shown in Figure 3(d), it sim-
third drawback of DLLs is that they What happens if CK in in Figure 2(e) ply adds a phase equal to K DL Vcont
operate the PFD a nd the CP at incurs a phase step? This step propa- to the input phase. (The one-cycle
high speeds. gates through the chain and emerges delay is neglected here.)
The dynamic behavior of DLLs at the output TCK seconds later [Fig- It is instructive to first examine
determines how they respond to such ure 3(b)]. That is, the transfer func- the overall DLL’s response qualita-
effects as input phase noise, and sup- tion associated with this path can be tively. If the phase of CK in in Fig-
ply noise. We therefore study this expressed as exp (-TCK s) . In practice, ure 3(a) fluctuates slowly, the DLL
behavior in the next section. TCK is much less than the overall DLL maintains a high loop gain, keeping
Input Phase
CKin VCDL CKout Step
φ out
φ in CKin
Vcont
PFD CP
CKout
C1
t
(a) (b)
CKin
Vcont
Vcont
t
(c) (d)
Figure 3: (a) A DLL with a VCDL, (b) the propagation of input phase step to output, (c) the propagation of step on Vcont to output, and
(d) a linear model of a VCDL.
CK out aligned with CK in . That is, the We should remark that some DLLs Effect of Supply Noise
closed-loop transfer function has apply an independent reference The principal effect of supply noise,
a unity magnitude for slow phase clock to the PFD and do not follow VDD (t), in DLLs is to modulate the
variations. Now suppose CK in expe- these dynamics [4]. delay of the VCDL. How does the DLL
riences very fast phase changes. The aforementioned study reveals of Figure 3(a) respond to VDD (t) ? If
Then, the DLL has little loop gain, two points: 1) DLLs do not generally the noise varies slowly, the loop has
Vcont d o e s n ot c h a n ge , a n d CK in face stability issues and can oper- enough “strength” to keep z out close to
simply propagates to CK out . In this ate with a wide range of values for z in, i.e., Vcont opposes VDD (t) and z out
case, too, the closed-loop response I p and C 1, and 2) the lack of filtering
is around unity because the input ability precludes the use of the fore-
phase changes appear at the output going DLLs in applications where the
VDD
with only a delay of TCK seconds. We input jitter must be removed. The
thus conclude that DLLs exhibit an latter issue is resolved by a different
KVDD
all-pass response, a point of contrast DLL architecture [4].
to the low-pass behavior of PLLs. φ out
φ in = 0
The all-pass nature of DLLs can also
be confirmed mathematically. For the
KDL
DLL of Figure 3(a), we draw the phase +
Ip
model as shown in Figure 4(a), noting that φ in φ out
– c1s Vcont
Vcont is given by (z in - z out) [I p / (C 1 s)],
where I p denotes the charge pump cur- KDL
+ (a)
rent, and hence Ip
φ out
c1s Vcont
Ip – VDD
z in + (z in - z out) K DL = z out .(1)
C1 s (a) KVDD
That is, φ out
φ in
K DL I p ω
- (z in - z out) = (z in - z out) , (2) IpKDL
C1 s
ω c1
(b) (b)
which implies z in = z out . In prac-
tice, the response exhibits a small Figure 4: (a) A linear model of DLL and Figure 5: (a) The supply noise in a DLL and
amount of peaking [Figure 4(b)] [4]. (b) the DLL phase response. (b) the DLL phase response.
Figure 9: (a) The DLL of Figure 8 with output inverters added and (b) uniform fanouts at all VCDL taps.
experience a nearly twofold change in it can do so if the phase difference this amount, thereby avoiding false
its delay as a function of PVT. between these two signals reaches lock. The filter preceding Vcoarse sup-
The multiphase DLL shown in Fig- 2TCK rather than TCK . As a result, presses the ripple and noise present
ure 8 faces several issues. First, due the phase spacings will be equal to in Vcont1 . This architecture approxi-
to unequal loading, the phase spac- 2TCK /N. mately doubles the area and power
ings at the boundaries of the VCDL Avoiding false lock generally consumption. Another method is
can be different from those in the requires substantial added complex- described next.
middle. To understand this point, ity, especially if the DLL must oper-
consider the situation illustrated in ate across a wide frequency range.
Figure 9(a), where inverters Inv N - 1 Depicted in Figure 11 is a solution
and Inv N see different fanouts; the employ ing a PLL. A replica of the Start−Up Condition
former drives two inverters but the VCDL is configured as a ring oscil- 2TCK + ε
latter, an inverter and a PFD. As a lator and phase locked to the main VN
result, the phase difference between input, thus g ua ra nteeing that
VN - 2 and VN - 1 is not the same as that the delay from A to B is equal to After Lock
bet ween VN - 1 and VN . This issue TCK and hence Vcont1 reaches the 2TCK
is overcome as shown in Figure 9(b), desired value. Now, this voltage V0
with two inverters inserted at the serves as the coarse control for the
PF D inputs. A ssum ing a ll of t he main VCDL, allowing the DLL to pro- 2TCK
VN
inverters are identical, we observe vide only a fine adjustment through
that 1) the fanout at V4 is equal to Vfine . For example, if the two VCDLs t
that at V1, V2, and V3, and 2) the loop have a delay mismatch of 10%, then
drives the phase difference between Vfine must var y the delay by only Figure 10: The problem of false lock.
Va and Vb to zero, thus aligning V0
and V4 as well. One assumption here
is that the waveform arriving at V0
has approximately the same rise
and fall times as that at V4; other- CKin VCDL CKout
wise, the delays through Inv a and Vfine Vcoarse
Inv b are slightly different.
PFD CP
Another issue in the DLL of Fig- Replica VCDL
C1
ure 8 is the problem of “false lock.” A B
Assume the circuit is designed to
provide a total delay of TCK at the
PFD CP
typical-typical (TT), 27 c C corner. Vcont1
Now, suppose the DLL operates in
R1
the slow-slow (SS), high-temperature PLL
C1
corner, and, upon startup, the total
VCDL delay is slightly greater than
2TCK (Figure 10). Then, the DLL sim-
ply attempts to align V0 and VN , and Figure 11: The addition of a PLL to a DLL to avoid false lock.
(a) (b) t
Figure 12: (a) An edge-combining circuit for frequency multiplication and (b) its waveforms.
C2 R2
RF
but shifted by TCK /2, we conclude in Vcont . What is the effect of this 2) Suppose the Tow –Thomas bi-
that V1 · V2 + V3 · V4 + V5 · V6 + V7 · V8 ripple on the output waveform? quad of Figure 16 senses a large,
is a signal with four times the input narrowband undesired chan-
frequency. The implementation is Answers to Last Issue’s Questions nel at ~ = ~ -3dB . Which of the two
shown in Figure 13(b). In reality, the 1) Figure 15 shows a noninverting inte- integrators produces greater volt-
AND and OR gates are replaced grator. Derive the condition for the age swings and hence experiences
by NANDs. elements so that the circuit acts as more nonlinearity?
The multiplication factor in Fig- an ideal integrator. What is the prin- We have Vout /VX = - 1/ (R 3 C 2 s) .
ures 12 and 13 is difficult to change, cipal difficulty with this topology? The relative swings in the two
a point of contrast to PLLs. Moreover, W e h a v e VB . VA = Vout R 1 / integrator outputs depend on the
delay mismatches among the stages (R 1 +R 2). Also, (Vin -VB)/R 3 + (Vout - component values. For example,
give rise to jitter and spurs. VB) /R 4 = VB C 2 s. Thus, i f C 1 = C 2, R 2 = R 3 = R F , a n d
The frequency multiplication Q = 1, t h e n ~ n = 1/ (R 3 C 2) a n d
Vin = V R1 C2 s
ability of DLLs can be exploited to R3 out
R1 + R2 ~ -3dB = 1.27~ n = 1.27/ (R 3 C 2) .
detect false locking. Consider the T h a t i s , | Vout /VX | = 1/1.27. I n
+ Vout c R 1
architecture shown in Figure 14, R1 + R2 this case, the first integrator com-
R + R4
where an edge combiner multiplies (8) $ 3 - 1 m. presses first. If the undesired
R3 R4 R4
the frequency by a factor of N. This channel occurs at 2~ -3dB, we have
result, fmult, is then divided by N and For ideal integration, the second | Vout /VX | = 1/2.54, observing even
compared to fin . With correct lock- term on the right-hand side must a greater swing disparity.
ing, fmult = N fin, leading to a low aver- vanish, yielding R 2 /R 1 = R 4 /R 3 .
age value for the PFD output. In the Another perspective provides References
[1] J. J. Spilker and D. T. Magill, “The delay-
presence of false lock, on the other additional insight. If Vout is pro- lock discriminator: An optimum tracking
hand, the total delay from CK in to portional to the integral of Vin, device,” Proc. IEEE, vol. 49, pp. 1403–1416,
Sept. 1961.
CK out is equa l to or g reater tha n t h e n s o a r e VA a n d VB . T h u s , [2] M. Bazes, “A novel precision MOS synchro-
2TCK, and fmult < N fin . As a result, the I C2 = C 2 dVB /dt is also proportion nous delay line,” IEEE J. Solid-State Cir-
cuits, vol. 20, pp. 1265–1271, Dec. 1985.
PFD output exhibits a higher aver- a l to Vin, a condition that is met [3] M. G. Johnson and E. I. Hudson, “A vari-
age. The false lock flag can then be only if the Norton equivalent of the able delay line PLL for CPU-coprocessor
synchronization,” IEEE J. Solid-State Cir-
used to adjust the tuning range of circuit in the dashed box reduces cuits, vol. 23, pp. 1218–1223, Oct. 1988.
the delay line so that the total delay to an ideal current source. Since [4] M. J. E. Lee, W. J. Dally, T. Greer, H. T. Ng,
R. Farjad-Rad, J. Poulton, and R. Senthi-
remains less than 2TCK. the Norton resistance is given by nathan, “Jitter transfer characteristics of
R 3 | | R eq, we set R eq to - R 3 and delay-locked loops: Theories and design
techniques,” IEEE J. Solid-State Circuits,
Questions for the Reader hence obtain R 2 /R 1 = R 4 /R 3 . vol. 38, pp. 614–621, Apr. 2003.
1) Suppose the up and down currents The pr i n c ip a l issue here is [5] A. Homayoun and B. Razavi, “Relation
between delay line phase noise and ring
in the charge pump of Figure 2(d) that the circuit relies on equal oscillator phase noise,” IEEE J. Solid-State
have a mismatch of DI.How does positive and negative feedback Circuits, vol. 49, pp. 384–391, Feb. 2014.
[6] J. Sonntag and R. Leonowich, “A monolithic
the DLL react to this mismatch? factors and is prone to latch up CMOS 10 MHz DPLL for burst-mode data reti-
2) The CP imperfections in Fig- in the presence of component ming,” in Proc. Int. Solid-State Circuits Conf.
Dig. Tech. Papers, Feb. 1990, pp. 194–195.
ure 2(d) create a periodic ripple mismatches.