# Analysis and Design of a Low-Voltage Low-Power Double-Tail Comparator

Samaneh Babayan-Mashhadi, Student Member, IEEE, and Reza Lotfi, Member, IEEE

Abstract—The need for ultra low-power, area efficient, and high speed analog-to-digital converters is pushing toward the use of dynamic regenerative comparators to maximize speed and power efficiency. In this paper, an analysis on the delay of the dynamic comparators will be presented and analytical expressions are derived. From the analytical expressions, designers can obtain an intuition about the main contributors to the comparator delay and fully explore the tradeoffs in dynamic comparator design. Based on the presented analysis, a new dynamic comparator is proposed, where the circuit of a conventional doubletail comparator is modified for low-power and fast operation even in small supply voltages. Without complicating the design and by adding few transistors, the positive feedback during the regeneration is strengthened, which results in remarkably reduced delay time. Post-layout simulation results in a 0.18-µm CMOS technology confirm the analysis results. It is shown that in the proposed dynamic comparator both the power consumption and delay time are significantly reduced. The maximum clock frequency of the proposed comparator can be increased to 2.5 and 1.1 GHz at supply voltages of 1.2 and 0.6 V, while consuming 1.4 mW and 153  $\mu$ W, respectively. The standard deviation of the input-referred offset is 7.8 mV at 1.2 V supply.

Index Terms—Double-tail comparator, dynamic clocked comparator, high-speed analog-to-digital converters (ADCs), low-power analog design.

#### I. INTRODUCTION

**→** OMPARATOR is one of the fundamental building blocks in most analog-to-digital converters (ADCs). Many highspeed ADCs, such as flash ADCs, require high-speed, lowpower comparators with small chip area. High-speed comparators in ultra deep submicrometer (UDSM) CMOS technologies suffer from low supply voltages especially when considering the fact that threshold voltages of the devices have not been scaled at the same pace as the supply voltages of the modern CMOS processes [1]. Hence, designing high-speed comparators is more challenging when the supply voltage is smaller. In other words, in a given technology, to achieve high speed, larger transistors are required to compensate the reduction of supply voltage, which also means that more die area and power is needed. Besides, low-voltage operation results in limited common-mode input range, which is important in many high-speed ADC architectures, such as flash

Manuscript received September 4, 2012; revised December 7, 2012; accepted January 15, 2013. Date of publication February 11, 2013; date of current version January 17, 2014.

The authors are with the Electrical Engineering Group of Engineering Department, Ferdowsi University of Mashhad, Mashhad 91775-111, Iran (e-mail: babayan@ieee.org; rlotfi@ieee.org).

Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/TVLSI.2013.2241799

ADCs. Many techniques, such as supply boosting methods [2], [3], techniques employing body-driven transistors [4], [5], current-mode design [6] and those using dual-oxide processes, which can handle higher supply voltages have been developed to meet the low-voltage design challenges. Boosting and bootstrapping are two techniques based on augmenting the supply, reference, or clock voltage to address input-range and switching problems. These are effective techniques, but they introduce reliability issues especially in UDSM CMOS technologies. Body-driven technique adopted by Blalock [4], removes the threshold voltage requirement such that bodydriven MOSFET operates as a depletion-type device. Based on this approach, in [5], a 1-bit quantizer for sub-1V  $\Sigma \Delta$ modulators is proposed. Despite the advantages, the bodydriven transistor suffers from smaller transconductance (equal to  $g_{mb}$  of the transistor) compared to its gate-driven counterpart while special fabrication process, such as deep n-well is required to have both nMOS and pMOS transistors operate in the body-driven configuration. Apart from technological modifications, developing new circuit structures which avoid stacking too many transistors between the supply rails is preferable for low-voltage operation, especially if they do not increase the circuit complexity. In [7]-[9], additional circuitry is added to the conventional dynamic comparator to enhance the comparator speed in low supply voltages. The proposed comparator of [7] works down to a supply voltage of 0.5 V with a maximum clock frequency of 600 MHz and consumes 18  $\mu$ W. Despite the effectiveness of this approach, the effect of component mismatch in the additional circuitry on the performance of the comparator should be considered. The structure of double-tail dynamic comparator first proposed in [10] is based on designing a separate input and crosscoupled stage. This separation enables fast operation over a wide common-mode and supply voltage range [10].

In this paper, a comprehensive analysis about the delay of dynamic comparators has been presented for various architectures. Furthermore, based on the double-tail structure proposed in [10], a new dynamic comparator is presented, which does not require boosted voltage or stacking of too many transistors. Merely by adding a few minimum-size transistors to the conventional double-tail dynamic comparator, latch delay time is profoundly reduced. This modification also results in considerable power savings when compared to the conventional dynamic comparator and double-tail comparator.

The rest of this paper is organized as follows. Section II investigates the operation of the conventional clocked regenerative comparators and the pros and cons of each



Fig. 1. Schematic diagram of the conventional dynamic comparator.

structure is discussed. Delay analysis is also presented and the analytical expressions for the delay of the comparators are derived. The proposed comparator is presented in Section III. Section IV discusses the design issues. Simulation results are addressed in Section V, followed by conclusions in Section VI.

#### II. CLOCKED REGENERATIVE COMPARATORS

Clocked regenerative comparators have found wide applications in many high-speed ADCs since they can make fast decisions due to the strong positive feedback in the regenerative latch. Recently, many comprehensive analyses have been presented, which investigate the performance of these comparators from different aspects, such as noise [11], offset [12], [13], and [14], random decision errors [15], and kick-back noise [16]. In this section, a comprehensive delay analysis is presented; the delay time of two common structures, i.e., conventional dynamic comparator and conventional dynamic double-tail comparator are analyzed, based on which the proposed comparator will be presented.

# A. Conventional Dynamic Comparator

The schematic diagram of the conventional dynamic comparator widely used in A/D converters, with high input impedance, rail-to-rail output swing, and no static power consumption is shown in Fig. 1 [1], [17]. The operation of the comparator is as follows. During the reset phase when CLK = 0 and  $M_{\text{tail}}$  is off, reset transistors  $(M_7 - M_8)$  pull both output nodes Outn and Outp to  $V_{DD}$  to define a start condition and to have a valid logical level during reset. In the comparison phase, when  $CLK = V_{DD}$ , transistors  $M_7$  and  $M_8$ are off, and  $M_{\text{tail}}$  is on. Output voltages (Outp, Outn), which had been pre-charged to  $V_{\rm DD}$ , start to discharge with different discharging rates depending on the corresponding input voltage (INN/INP). Assuming the case where  $V_{\text{INP}} > V_{\text{INN}}$ , Out p discharges faster than Outn, hence when Outp (discharged by transistor  $M_2$  drain current), falls down to  $V_{DD}-|V_{thp}|$ before Outn (discharged by transistor  $M_1$  drain current), the corresponding pMOS transistor  $(M_5)$  will turn on initiating the latch regeneration caused by back-to-back inverters  $(M_3, M_5)$ 



Fig. 2. Transient simulations of the conventional dynamic comparator for input voltage difference of  $\Delta V_{\rm in}=5$  mV,  $V_{\rm cm}=0.7$  V, and  $V_{\rm DD}=0.8$  V.

and  $M_4$ ,  $M_6$ ). Thus, Outn pulls to  $V_{\rm DD}$  and Outp discharges to ground. If  $V_{\rm INP} < V_{\rm INN}$ , the circuits works vice versa.

As shown in Fig. 2, the delay of this comparator is comprised of two time delays,  $t_0$  and  $t_{\text{latch}}$ . The delay  $t_0$  represents the capacitive discharge of the load capacitance  $C_L$  until the first p-channel transistor  $(M_5/M_6)$  turns on. In case, the voltage at node INP is bigger than INN (i.e.,  $V_{\text{INP}} > V_{\text{INN}}$ ), the drain current of transistor  $M_2$  ( $I_2$ ) causes faster discharge of Out p node compared to the Outp node, which is driven by  $M_1$  with smaller current. Consequently, the discharge delay ( $t_0$ ) is given by

$$t_0 = \frac{C_L |V_{\text{thp}}|}{I_2} \cong 2 \frac{C_L |V_{\text{thp}}|}{I_{\text{tail}}}.$$
 (1)

In (1), since  $I_2 = I_{\text{tail}}/2 + \Delta I_{\text{in}} = I_{\text{tail}}/2 + g_{\text{m1},2}\Delta V_{\text{in}}$ , for small differential input ( $\Delta V_{\text{in}}$ ),  $I_2$  can be approximated to be constant and equal to the half of the tail current.

The second term,  $t_{\text{latch}}$ , is the latching delay of two cross-coupled inverters. It is assumed that a voltage swing of  $\Delta V_{\text{out}} = V_{\text{DD}}/2$  has to be obtained from an initial output voltage difference  $\Delta V_0$  at the falling output (e.g.,  $\text{Out}\,p$ ). Half of the supply voltage is considered to be the threshold voltage of the comparator following inverter or SR latch [17]. Hence, the latch delay time is given by, [18]

$$t_{\text{latch}} = \frac{C_{\text{L}}}{g_{m,\text{eff}}} \cdot \ln\left(\frac{\Delta V_{\text{out}}}{\Delta V_{0}}\right) = \frac{C_{\text{L}}}{g_{m,\text{eff}}} \cdot \ln\left(\frac{V_{\text{DD}}/2}{\Delta V_{0}}\right) \quad (2)$$

where  $g_{m,\text{eff}}$  is the effective transconductance of the back-to-back inverters. In fact, this delay depends, in a logarithmic manner, on the initial output voltage difference at the beginning of the regeneration (i.e., at  $t=t_0$ ). Based on (1),  $\Delta V_0$  can be calculated from (3)

$$\Delta V_0 = |V_{\text{out}p}(t = t_0) - V_{\text{out}n}(t = t_0)|$$

$$= |V_{\text{thp}}| - \frac{I_2 t_0}{C_L} = |V_{\text{thp}}| \left(1 - \frac{I_2}{I_1}\right).$$
 (3)

The current difference,  $\Delta I_{\rm in} = |I_1 - I_2|$ , between the branches is much smaller than  $I_1$  and  $I_2$ . Thus,  $I_1$  can be

approximated by  $I_{\text{tail}}/2$  and (3) can be rewritten as

$$\Delta V_{0} = \left| V_{\text{thp}} \right| \frac{\Delta I_{\text{in}}}{I_{1}}$$

$$\approx 2 \left| V_{\text{thp}} \right| \frac{\Delta I_{\text{in}}}{I_{\text{tail}}}$$

$$= 2 \left| V_{\text{thp}} \right| \frac{\sqrt{\beta_{1,2} I_{\text{tail}}}}{I_{\text{tail}}} \Delta V_{\text{in}}$$

$$= 2 \left| V_{\text{thp}} \right| \sqrt{\frac{\beta_{1,2}}{I_{\text{tail}}}} \Delta V_{\text{in}}.$$
(4)

In this equation,  $\beta_{1,2}$  is the input transistors' current factor and  $I_{\text{tail}}$  is a function of input common-mode voltage ( $V_{\text{cm}}$ ) and  $V_{\text{DD}}$ . Now, substituting  $\Delta V_0$  in latch delay expression and considering  $t_0$ , the expression for the delay of the conventional dynamic comparator is obtained as

$$t_{\text{delay}} = t_0 + t_{\text{latch}}$$

$$= 2 \frac{C_L |V_{\text{thp}}|}{I_{\text{tail}}} + \frac{C_L}{g_{m,\text{eff}}} \cdot \ln \left( \frac{V_{\text{DD}}}{4 |V_{\text{thp}}| \Delta V_{\text{in}}} \sqrt{\frac{I_{\text{tail}}}{\beta_{1,2}}} \right). \quad (5)$$

Equation (5) explains the impact of various parameters. The total delay is directly proportional to the comparator load capacitance  $C_{\rm L}$  and inversely proportional to the input difference voltage ( $\Delta V_{\rm in}$ ). Besides, the delay depends indirectly to the input common-mode voltage ( $V_{\rm cm}$ ). By reducing  $V_{\rm cm}$ , the delay  $t_0$  of the first sensing phase increases because lower  $V_{\rm cm}$  causes smaller bias current ( $I_{\rm tail}$ ). On the other hand, (4) shows that a delayed discharge with smaller  $I_{\rm tail}$  results in an increased initial voltage difference ( $\Delta V_0$ ), reducing  $t_{\rm latch}$ . Simulation results show that the effect of reducing the  $V_{\rm cm}$  on increasing of  $t_0$  and reducing of  $t_{\rm latch}$  will finally lead to an increase in the total delay. In [17], it has been shown that an input common-mode voltage of 70% of the supply voltage is optimal regarding speed and yield.

In principle, this structure has the advantages of high input impedance, rail-to-rail output swing, no static power consumption, and good robustness against noise and mismatch [1]. Due to the fact that parasitic capacitances of input transistors do not directly affect the switching speed of the output nodes, it is possible to design large input transistors to minimize the offset. The disadvantage, on the other hand, is the fact that due to several stacked transistors, a sufficiently high supply voltage is needed for a proper delay time. The reason is that, at the beginning of the decision, only transistors  $M_3$  and  $M_4$  of the latch contribute to the positive feedback until the voltage level of one output node has dropped below a level small enough to turn on transistors  $M_5$  or  $M_6$  to start complete regeneration. At a low supply voltage, this voltage drop only contributes a small gate-source voltage for transistors  $M_3$  and  $M_4$ , where the gatesource voltage of  $M_5$  and  $M_6$  is also small; thus, the delay time of the latch becomes large due to lower transconductances.

Another important drawback of this structure is that there is only one current path, via tail transistor  $M_{\text{tail}}$ , which defines the current for both the differential amplifier and the latch (the cross-coupled inverters). While one would like a small tail current to keep the differential pair in weak inversion and obtain a long integration interval and a better Gm/I ratio, a



Fig. 3. Schematic diagram of the conventional double-tail dynamic comparator.

large tail current would be desirable to enable fast regeneration in the latch [10]. Besides, as far as  $M_{\text{tail}}$  operates mostly in triode region, the tail current depends on input common-mode voltage, which is not favorable for regeneration.

# B. Conventional Double-Tail Dynamic Comparator

A conventional double-tail comparator is shown in Fig. 3 [10]. This topology has less stacking and therefore can operate at lower supply voltages compared to the conventional dynamic comparator. The double tail enables both a large current in the latching stage and wider  $M_{\rm tail2}$ , for fast latching independent of the input common-mode voltage ( $V_{\rm cm}$ ), and a small current in the input stage (small  $M_{\rm tail1}$ ), for low offset [10].

The operation of this comparator is as follows (see Fig. 4). During reset phase (CLK = 0,  $M_{\text{tail1}}$ , and  $M_{\text{tail2}}$  are off), transistors  $M_3$ - $M_4$  pre-charge fn and fp nodes to  $V_{\text{DD}}$ , which in turn causes transistors  $M_{\text{R1}}$  and  $M_{\text{R2}}$  to discharge the output nodes to ground. During decision-making phase (CLK =  $V_{\text{DD}}$ ,  $M_{\text{tail1}}$  and  $M_{\text{tail2}}$  turn on),  $M_3$ - $M_4$  turn off and voltages at nodes fn and fp start to drop with the rate defined by  $I_{\text{Mtail1}}/C_{\text{fn(p)}}$  and on top of this, an input-dependent differential voltage  $\Delta V_{\text{fn(p)}}$  will build up. The intermediate stage formed by  $M_{\text{R1}}$  and  $M_{\text{R2}}$  passes  $\Delta V_{\text{fn(p)}}$  to the crosscoupled inverters and also provides a good shielding between input and output, resulting in reduced value of kickback noise [10].

Similar to the conventional dynamic comparator, the delay of this comparator comprises two main parts,  $t_0$  and  $t_{\text{latch}}$ . The delay  $t_0$  represents the capacitive charging of the load capacitance  $C_{L\text{out}}$  (at the latch stage output nodes, Outn and Outp) until the first n-channel transistor  $(M_9/M_{10})$  turns on, after which the latch regeneration starts; thus  $t_0$  is obtained



Fig. 4. Transient simulations of the conventional double-tail dynamic comparator for input voltage difference of  $\Delta V_{\rm in}=5$  mV,  $V_{\rm cm}=0.7$  V, and  $V_{\rm DD}=0.8$  V.

from

$$t_0 = \frac{V_{\text{Thn}} C_{Lout}}{I_{\text{B1}}} \approx 2 \frac{V_{\text{Thn}} C_{Lout}}{I_{\text{tail2}}}$$
 (6)

where  $I_{\rm B1}$  is the drain current of the  $M_9$  (assuming  $V_{\rm INP} > V_{\rm INN}$ , see Fig. 3) and is approximately equal to the half of the tail current ( $I_{\rm tail2}$ ).

After the first n-channel transistor of the latch turns on (for instance,  $M_9$ ), the corresponding output (e.g., Outn) will be discharged to the ground, leading front p-channel transistor (e.g.,  $M_8$ ) to turn on, charging another output (Out p) to the supply voltage ( $V_{\rm DD}$ ). The regeneration time ( $t_{\rm latch}$ ) is achieved according to (2). For the initial output voltage difference at time  $t_0$ ,  $\Delta V_0$  we have

$$\Delta V_0 = |V_{\text{out}p}(t = t_0) - V_{\text{out}n}(t = t_0)| = V_{\text{Thn}} - \frac{I_{\text{B2}}t_0}{C_{L_{\text{out}}}}$$

$$= V_{\text{Thn}} \left( 1 - \frac{I_{\text{B2}}}{I_{\text{B1}}} \right)$$
(7)

where  $I_{\rm B1}$  and  $I_{\rm B2}$  are the currents of the latch left- and right-side branches of the second stage, respectively.

Considering  $\Delta I_{\text{latch}} = |I_{\text{B1}} - I_{\text{B2}}| = g_{mR1,2} \Delta V_{\text{fn/fp}}$ , (7) can be rewritten as

$$\Delta V_0 = V_{\text{Thn}} \frac{\Delta I_{\text{latch}}}{I_{\text{B1}}} \approx 2V_{\text{Thn}} \frac{\Delta I_{\text{latch}}}{I_{\text{tail2}}} = 2V_{\text{Thn}} \frac{g_{\text{mR1,2}}}{I_{\text{tail2}}} \Delta V_{\text{fn/fp}}$$
(8)

where  $g_{mR1,2}$  is the transconductance of the intermediate stage transistors ( $M_{R1}$  and  $M_{R2}$ ) and  $\Delta V_{fn/fp}$  is the voltage difference at the first stage outputs (fn and fp) at time  $t_0$ . Thus, it can be concluded that two main parameters which influence the initial output differential voltage ( $\Delta V_0$ ) and thereby the latch regeneration time are the transconductance of the intermediate stage transistors ( $g_{mR1,2}$ ) and the voltage difference at the first stage outputs (fn and fp) at time  $t_0$ . In fact, intermediate stage transistors amplify the voltage difference of  $\Delta V_{fn/fp}$  causing the latch to be imbalanced.

The differential voltage at nodes fn/fp ( $\Delta V_{\rm fn/fp}$ ) at time  $t_0$  can be achieved from

$$\Delta V_{\text{fn/fp}} = |V_{\text{fn}}(t = t_0) - V_{\text{fp}}(t = t_0)|$$

$$= t_0 \cdot \frac{I_{\text{N1}} - I_{\text{N2}}}{C_{L,\text{fn(p)}}}$$

$$= t_0 \cdot \frac{g_{\text{m1},2} \Delta V_{\text{in}}}{C_{L,\text{fn(p)}}}.$$
(9)

In this equation,  $I_{\rm N1}$  and  $I_{\rm N2}$  refer to the discharging currents of input transistors ( $M_1$  and  $M_2$ ), which are dependent on the input differential voltage (i.e.,  $\Delta I_{\rm N} = g_{\rm m1,2} \Delta V_{\rm in}$ ). Substituting (9) in (8),  $\Delta V_0$  will be

$$\Delta V_0 = 2V_{\text{Thn}} \frac{g_{\text{mR1,2}}}{I_{\text{tail2}}} \Delta V_{\text{fn/fp}}$$

$$= \left(\frac{2V_{\text{Thn}}}{I_{\text{tail2}}}\right)^2 \cdot \frac{C_{L_{\text{out}}}}{C_{L,\text{fn(p)}}} \cdot g_{\text{mR1,2}} g_{\text{m1,2}} \Delta V_{\text{in}}. \quad (10)$$

This equation shows that  $\Delta V_0$  depends strongly on the transconductance of input and intermediate stage transistors, input voltage difference ( $\Delta V_{\rm in}$ ), latch tail current, and the capacitive ratio of  $C_{L{\rm out}}$  to  $C_{L{\rm ,fn(p)}}$ . Substituting  $\Delta V_0$  in latch regeneration time (2), the total delay of this comparator is achieved as follows:

$$t_{\text{delay}} = t_0 + t_{\text{latch}} = 2 \frac{V_{\text{Thn}} C_{L\text{out}}}{I_{\text{tail2}}} + \frac{C_{L\text{out}}}{g_{m,\text{eff}}} \cdot \ln \left( \frac{V_{\text{DD}}/2}{\Delta V_0} \right)$$

$$= 2 \frac{V_{\text{Thn}} C_{L\text{out}}}{I_{\text{tail2}}} + \frac{C_{L\text{out}}}{g_{m,\text{eff}}}$$

$$\cdot \ln \left( \frac{V_{\text{DD}} \cdot I_{\text{tail2}}^2 \cdot C_{L,\text{fn(p)}}}{8V_{\text{Thn}}^2 \cdot C_{L\text{out}} g_{\text{mR1},2} \Delta V_{\text{in}}} \right). \tag{11}$$

From the equations derived for the delay of the double-tail dynamic comparator, some important notes can be concluded.

- 1) The voltage difference at the first stage outputs ( $\Delta V_{\rm fn/fp}$ ) at time  $t_0$  has a profound effect on latch initial differential output voltage ( $\Delta V_0$ ) and consequently on the latch delay. Therefore, increasing it would profoundly reduce the delay of the comparator.
- 2) In this comparator, both intermediate stage transistors will be finally cut-off, (since fn and fp nodes both discharge to the ground), hence they do not play any role in improving the effective transconductance of the latch. Besides, during reset phase, these nodes have to be charged from ground to V<sub>DD</sub>, which means power consumption. The following section describes how the proposed comparator improves the performance of the double-tail comparator from the above points of view.

# III. PROPOSED DOUBLE-TAIL DYNAMIC COMPARATOR

Fig. 5 demonstrates the schematic diagram of the proposed dynamic double-tail comparator. Due to the better performance of double-tail architecture in low-voltage applications, the proposed comparator is designed based on the double-tail structure. The main idea of the proposed comparator is to increase  $\Delta V_{\rm fn/fp}$  in order to increase the latch regeneration speed. For this purpose, two control transistors ( $M_{\rm c1}$  and  $M_{\rm c2}$ ) have been added to the first stage in parallel to  $M_3/M_4$  transistors but in a cross-coupled manner [see Fig. 5(a)].



Fig. 5. Schematic diagram of the proposed dynamic comparator. (a) Main idea. (b) Final structure.

#### A. Operation of the Proposed Comparator

The operation of the proposed comparator is as follows (see Fig. 6). During reset phase (CLK = 0,  $M_{\text{tail1}}$  and  $M_{\text{tail2}}$  are off, avoiding static power),  $M_3$  and  $M_4$  pulls both fn and fp nodes to  $V_{\text{DD}}$ , hence transistor  $M_{\text{c1}}$  and  $M_{\text{c2}}$  are cut off. Intermediate stage transistors,  $M_{\text{R1}}$  and  $M_{\text{R2}}$ , reset both latch outputs to ground.

During decision-making phase (CLK =  $V_{DD}$ ,  $M_{tail1}$ , and  $M_{\text{tail2}}$  are on), transistors  $M_3$  and  $M_4$  turn off. Furthermore, at the beginning of this phase, the control transistors are still off (since fn and fp are about  $V_{DD}$ ). Thus, fn and fp start to drop with different rates according to the input voltages. Suppose  $V_{\rm INP} > V_{\rm INN}$ , thus fn drops faster than fp, (since  $M_2$  provides more current than  $M_1$ ). As long as fn continues falling, the corresponding pMOS control transistor ( $M_{c1}$  in this case) starts to turn on, pulling fp node back to the  $V_{\rm DD}$ ; so another control transistor  $(M_{c2})$  remains off, allowing fn to be discharged completely. In other words, unlike conventional double-tail dynamic comparator, in which  $\Delta V_{\text{fn/fp}}$  is just a function of input transistor transconductance and input voltage difference (9), in the proposed structure as soon as the comparator detects that for instance node fn discharges faster, a pMOS transistor  $(M_{c1})$  turns on, pulling the other node fp back to the  $V_{DD}$ . Therefore by the time passing, the difference between fn and fp  $(\Delta V_{\text{fn/fp}})$  increases in an exponential manner, leading to the reduction of latch regeneration time (this will be shown in Section III-B). Despite the effectiveness of the proposed idea, one of the points which should be considered is that in this circuit, when one of the control transistors (e.g.,  $M_{c1}$ ) turns on, a current from  $V_{DD}$  is drawn to the ground via input and tail transistor (e.g.,  $M_{c1}$ , M1, and  $M_{tail1}$ ), resulting in static power consumption. To overcome this issue, two nMOS switches are used below the input transistors [ $M_{sw1}$  and  $M_{sw2}$ , as shown in Fig. 5(b)].

At the beginning of the decision making phase, due to the fact that both fn and fp nodes have been pre-charged to  $V_{\rm DD}$ 



Fig. 6. Transient simulations of the proposed double-tail dynamic comparator for input voltage difference of  $\Delta \textit{V}_{in}=5$  mV,  $\textit{V}_{cm}=0.7$  V, and  $\textit{V}_{DD}=0.8$  V.

(during the reset phase), both switches are closed and fn and fp start to drop with different discharging rates. As soon as the comparator detects that one of the fn/fp nodes is discharging faster, control transistors will act in a way to increase their voltage difference. Suppose that fp is pulling up to the  $V_{\rm DD}$  and fn should be discharged completely, hence the switch in the charging path of fp will be opened (in order to prevent any current drawn from  $V_{\rm DD}$ ) but the other switch connected to fn will be closed to allow the complete discharge of fn node. In other words, the operation of the control transistors with the switches emulates the operation of the latch. This will be more discussed in the following section.

# B. Delay Analysis

In order to theoretically demonstrate how the delay is reduced, delay equations are derived for this structure as

previously done for the conventional dynamic comparator and the conventional double-tail dynamic comparator. The analysis is similar to the conventional double-tail dynamic comparator, however; the proposed dynamic comparator enhances the speed of the double-tail comparator by affecting two important factors: first, it increases the initial output voltage difference  $(\Delta V_0)$  at the beginning of the regeneration  $(t=t_0)$ ; and second, it enhances the effective transconductace  $(g_{\text{meff}})$  of the latch. Each of these factors will be discussed in detail.

1) Effect of Enhancing  $\Delta V_0$ : As discussed before, we define  $t_0$ , as a time after which latch regeneration starts. In other words,  $t_0$  is considered to be the time it takes (while both latch outputs are rising with different rates) until the first nMOS transistor of the back-to-back inverters turns on, so that it will pull down one of the outputs and regeneration will commence. According to (2), the latch output voltage difference at time  $t_0$ , ( $\Delta V_0$ ) has a considerable impact on the latch regeneration time, such that bigger  $\Delta V_0$  results in less regeneration time. Similar to the equation derived for the  $\Delta V_0$  of the double-tail structure, in this comparator we have

$$\Delta V_0 = V_{\text{Thn}} \frac{\Delta I_{\text{latch}}}{I_{\text{B1}}}$$

$$\approx 2V_{\text{Thn}} \frac{\Delta I_{\text{latch}}}{I_{\text{tail2}}}$$

$$= 2V_{\text{Thn}} \frac{g_{\text{mR1,2}}}{I_{\text{tail2}}} \Delta V_{\text{fn/fp}}.$$
(12)

In order to find  $\Delta V_{\rm fn/fp}$  at  $t=t_0$ , we shall notice that the combination of the control transistors ( $M_{\rm c1}$  and  $M_{\rm c2}$ ) with two serial switches ( $M_{\rm sw1}$ ,  $M_{\rm sw2}$ ) emulates the operation of a backto-back inverter pair; thus using small-signal model presented in [18],  $\Delta V_{\rm fn/fp}$  is calculated by

$$\Delta V_{\rm fn/fp} = \Delta V_{\rm fn(p)0} \exp((A_{\rm v} - 1)t/\tau). \tag{13}$$

In this equation,  $\frac{\tau}{A_{\rm v}-1} \cong \frac{C_{\rm L,fn(p)}}{G_{\rm m,effl}}$  and  $\Delta V_{\rm fn(p)0}$  is the initial fn/fp node difference voltage at the time when the corresponding pMOS control transistor is started to be turned on. Hence, it can be shown that  $\Delta V_{\rm fn(p)0}$  is obtained from

$$\Delta V_{\text{fn(p)0}} = 2 \left| V_{\text{Thp}} \right| \frac{g_{\text{m1,2}} \Delta V_{\text{in}}}{I_{\text{tail1}}}.$$
 (14)

Substituting (13) in (12),  $\Delta V_0$  will be

$$\begin{split} \Delta V_{0} &= 2 V_{\text{Thn}} \frac{g_{\text{mR1,2}}}{I_{\text{tail2}}} \Delta V_{\text{fn/fp}} \\ &= 4 V_{\text{Thn}} \left| V_{\text{Thp}} \right| \frac{g_{\text{mR1,2}}}{I_{\text{tail2}}} \frac{g_{\text{m1,2}} \Delta V_{\text{in}}}{I_{\text{tail1}}} \exp \left( \frac{G_{\text{m,eff1}} \cdot t_{0}}{C_{\text{L,fn(p)}}} \right) . \end{split}$$
(15)

Comparing (15) with (10), it is evident that  $\Delta V_0$  has been increased remarkably (in an exponential manner) in compare with the conventional dynamic comparator.

2) Effect of Enhancing Latch Effective Transconductance: As mentioned before, in conventional double-tail comparator, both fin and fp nodes will be finally discharged completely. In our proposed comparator, however, the fact that one of the first stage output nodes (fn/fp) will charge up back to the  $V_{\rm DD}$  at the beginning of the decision making phase, will turn on one of the intermediate stage transistors, thus the effective

transconductance of the latch is increased. In other words, positive feedback is strengthened. Hence,  $t_{\text{latch}}$  will be

$$t_{\text{latch}} = \frac{C_{L\text{out}}}{g_{\text{m,eff}} + g_{\text{mR1,2}}} \cdot \ln\left(\frac{\Delta V_{\text{out}}}{\Delta V_0}\right)$$
$$= \frac{C_{L\text{out}}}{g_{\text{m,eff}} + g_{\text{mR1,2}}} \cdot \ln\left(\frac{V_{\text{DD}}/2}{\Delta V_0}\right). \tag{16}$$

Finally, by including both effects, the total delay of the proposed comparator is achieved from

$$t_{\text{delay}} = t_{0} + t_{\text{latch}}$$

$$= 2 \frac{V_{\text{Thn}} C_{\text{Lout}}}{I_{\text{tail2}}} + \frac{C_{\text{Lout}}}{g_{\text{m,eff}} + g_{\text{mr1,2}}} \cdot \ln\left(\frac{V_{\text{DD}}/2}{\Delta V_{0}}\right)$$

$$= 2 \frac{V_{\text{Thn}} C_{\text{Lout}}}{I_{\text{tail2}}} + \frac{C_{\text{Lout}}}{g_{\text{m,eff}} + g_{\text{mR1,2}}}$$

$$\times \ln\left(\frac{V_{\text{DD}}/2}{4V_{\text{Thn}} |V_{\text{Thp}}| \frac{g_{\text{mR1,2}}}{I_{\text{tail2}}} \frac{g_{\text{m1,2}} \Delta V_{\text{in}}}{I_{\text{tail1}}} \exp\left(\frac{G_{\text{m,eff1}} \cdot I_{0}}{C_{\text{L,fn(p)}}}\right)}\right).$$
(17)

By comparing the expressions derived for the delay of the three mentioned structures, it can be seen that the proposed comparator takes advantage of an inner positive feedback in double-tail operation, which strengthen the whole latch regeneration. This speed improvement is even more obvious in lower supply voltages. This is due to the fact that for larger values of  $V_{\rm Th}/V_{\rm DD}$ , the transconductance of the transistors decreases, thus the existence of an inner positive feedback in the architecture of the first stage will lead to the improved performance of the comparator. Simulation results confirm this matter.

3) Reducing the Energy Per Comparison: It is not only the delay parameter which is improved in the modified proposed comparator, but the energy per conversion is reduced as well. As discussed earlier, in conventional double-tail topology, both fn and fp nodes discharge to the ground during the decision making phase and each time during the reset phase they should be pulled up back to the  $V_{\rm DD}$ . However, in our proposed comparator, only one of the mentioned nodes (fn/fp) has to be charged during the reset phase. This is due to the fact that during the previous decision making phase, based on the status of control transistors, one of the nodes had not been discharged and thus less power is required. This can be seen when being compared with conventional topologies [see Figs. 9(b) and 10(b)].

#### IV. DESIGN CONSIDERATIONS

In designing the proposed comparator, some design issues must be considered. When determining the size of tail transistors ( $M_{\text{tail}1}$  and  $M_{\text{tail}2}$ ), it is necessary to ensure that the time it takes that one of the control transistors turns on must be smaller than  $t_0$  (start of regeneration)

$$t_{\text{on,Mc1(2)}} < t_0 \rightarrow \frac{\left| V_{\text{Thp}} \right| \cdot C_{\text{L,fn(p)}}}{I_{\text{n1,2}}} < \frac{V_{\text{Thn}} C_{L\text{out}}}{I_{\text{B1}}}$$
$$\rightarrow \frac{\left| V_{\text{Thp}} \right| \cdot C_{\text{L,fn(p)}}}{\frac{I_{\text{Tail1}}}{I_{\text{2}}}} < \frac{V_{\text{Thn}} \cdot C_{L\text{out}}}{\frac{I_{\text{Tail2}}}{I_{\text{2}}}}. (18)$$

This condition can be easily achieved by properly designing the first and second stage tail currents. Even if possible in the fabrication technology, low-threshold pMOS devices can be used as control transistors leading to faster turn on.

In designing the nMOS switches, located below the input transistors, the drain-source voltage of these switches must be considered since it might limit the voltage headroom, restricting the advantage of being used in low-voltage applications. In order to diminish this effect, low-on-resistance nMOS switches are required. In other words, large transistors must be used. Since the parasitic capacitances of these switches do not affect the parasitic capacitances of the fn/fp nodes (delay bottlenecks), it is possible to optimally select the size of the nMOS switch transistors in a way that both low-voltage and low-power operations are maintained.

The effect of mismatch between controlling transistors on the total input-referred offset of the comparator is another important issue. When determining the size of controlling transistors ( $M_{\rm C1}-M_{\rm C2}$ ), two important issues should be considered. First, the effect of threshold voltage mismatch and current factor mismatch of the controlling transistors on the comparator input-referred offset voltage. Second, the effect of transistor sizing on parasitic capacitances of the fn/fp nodes, i.e.,  $C_{\rm L,fn(p)}$ , and consequently the delay of the comparator. While larger transistors are required for better matching; however, the increased parasitic capacitances are delay bottlenecks. In order to study the effect of threshold and current factor mismatch of control transistors on the total input-referred offset voltage, a brief mismatch analysis is presented here.

#### A. Mismatch Analysis

In principle, the effect of threshold voltage mismatch and current factor mismatch of controlling transistors is almost negligible in most cases except for the situation where input differential voltage ( $\Delta V_{\rm in}$ ) is very small where fn and fp have approximately similar discharging rates. This is true because by the time that the controlling transistor ( $M_{\rm c1}$  or  $M_{\rm C2}$ ) turns on, the differential input signal is already amplified to large amplitude compared to the mismatches. In other words, offset due to the controlling transistor mismatches is divided by the gain from the input to the output. However, in case of small  $\Delta V_{\rm in}$ , when fn and fp follow each other tightly, the mismatch of the controlling transistors might influence the result of the comparison. Hence, the following brief analyzes the effect of threshold and current factor mismatches of controlling transistors on the total input-referred offset voltage.

1)Effect of Threshold Voltage Mismatch of  $M_{C1}$ ,  $M_{C2}$ , i.e.,  $\Delta V_{ThC1,2}$ :

The differential current due to the threshold voltage mismatch can be obtained from

$$i_{\text{diff}} = g_{\text{mc1,2}} \Delta V_{\text{Thc1,2}} \tag{19}$$

where  $g_{\text{mc1},2}$  is the transconductance of the controlling transistors. So, the input-referred offset voltage due to the  $M_{\text{c1},2}$ 

threshold voltage mismatch is obtained as follows:

$$\Delta V_{\text{eq,due}} \Delta V_{\text{Thc1,2}} = \frac{g_{\text{mc1,2}} \Delta V_{\text{Thc1,2}}}{g_{\text{m1,2}}}$$
$$= \frac{\mu_{\text{p}} W_{\text{C1,2}} V_{\text{ODC1,2}}}{\mu_{\text{n}} W_{\text{1,2}} V_{\text{OD1,2}}} \Delta V_{\text{Thc1,2}}$$
(20)

where  $V_{\text{OD}}$  refers to the overdrive voltage of the transistors. 2) Effect of Current-Factor Mismatch  $M_{\text{C1}}$ ,  $M_{\text{C2}}$ , i.e.,  $\Delta \beta_{\text{C1},2}$ :

In order to calculate the input-referred offset due to the current factor mismatch of  $M_{\text{Cl},2}$ ,  $\Delta\beta_{\text{Cl},2}$  is modeled as a channel width mismatch  $\Delta W$ , i.e.,  $\Delta\beta/\beta = \Delta W/W$ . The differential current that  $\Delta W$  generates can be obtained as expressed in (21).

$$i_{\text{diff}} = \frac{1}{2} \mu_{\text{p}} C_{\text{ox}} \frac{\Delta W}{L} (V_{\text{gsc1},2} - V_{\text{thc1},2})^2.$$
 (21)

Note that the controlling transistors are in saturation since  $|V_{\rm GDc1,2}| = |V_{\rm fn} - V_{\rm fp}| < |V_{\rm thp}|$ . So the input-referred offset voltage due to the current factor mismatch is calculated from

$$\Delta V_{\text{eq,due}} \Delta \beta_{\text{C1,2}} = \frac{i_{\text{diff}}}{g_{\text{m1,2}}} 
= \frac{0.5 \mu_{\text{p}} W_{\text{C1,2}} (V_{\text{gsc1,2}} - V_{\text{thp}})^2}{\mu_{\text{n}} W_{1,2} (V_{\text{gs1,2}} - V_{\text{thn}})} 
= \frac{0.5 \mu_{\text{p}} W_{\text{C1,2}} V_{\text{ODc1,2}}^2}{\mu_{\text{n}} W_{1,2} (V_{\text{cm}} - R_{\text{CLK}} K_{\text{nC1,2}} V_{\text{ODc1,2}}^2 - V_{\text{thn}})}$$
(22)

where  $V_{\rm cm}$  is the input common mode voltage and  $R_{\rm clk}$  is the equivalent on resistance of the tail transistor.

Assuming both mismatch factors, the total input-referred offset due to the mismatch of the controlling transistors can be found from

$$\sigma_{\text{total}} = \sqrt{\sigma_{\Delta V_{\text{ThCI},2}}^2 + \sigma_{\Delta \beta_{\text{CI},2}}^2}.$$
 (23)

From (20) and (22), it can be concluded that the ratio of the controlling transistor sizes to the input transistor size, i.e.,  $(W_{C1,2}/W_{1,2})$ , is effective in reducing the offset. Due to the fact that the transconductance of the input transistors  $(g_{m1,2})$  is important in amplifying the input differential voltage and due to the dominant role of the size of these transistors on total input-referred offset, usually large input transistors are designed, which results in diminishing the effect of controlling transistors mismatch.

# B. Kickback Noise

Principally in latched comparators, the large voltage variations on the regeneration nodes are coupled, through the parasitic capacitances of the transistors, to the input of the comparator. Since the circuit preceding it does not have zero output impedance, the input voltage is disturbed, which may degrade the accuracy of the converter. This disturbance is usually called "kickback noise." In [16], it has been shown that the fastest and most power efficient comparators generate more kickback noise. This is true about our proposed dynamic comparator. Although it improves the double-tail



Fig. 7. Peak input voltage error due to kickback noise.



Fig. 8. Layout schematic diagram of the proposed dynamic comparator.

topology in terms of operation speed and thus energy per comparison, the kickback noise is increased in comparison to conventional double-tail structure (Fig. 3). Fig. 7 presents the peak disturbance as a function of differential input voltage of the comparator in three studied architectures. While doubletail structure takes advantage of input-output isolation and thus the minimum kickback noise, the conventional dynamic comparator and our proposed structure has nearly similar kickback noise. However, in our proposed comparator since control transistors are not supposed to be as strong as the latch transistors in conventional dynamic comparator, it is possible to determine the size of those transistors in a way that keeps the advantages of the speed enhancement and power reduction, while reducing kickback noise. Besides, for some applications where kickback becomes important, it is possible to apply simple kickback reduction techniques, such as neutralization [16] to remarkably reduce the kickback noise (See Fig. 7, proposed dynamic comparator with neutralization).

# V. SIMULATION RESULTS

In order to compare the proposed comparator with the conventional and double-tail dynamic comparators, all circuits have been simulated in a 0.18- $\mu$ m CMOS technology with  $V_{\rm DD}=1.2$  V. The comparators were optimized and the transistor dimensions were scaled to get an equal offset standard variation of  $\sigma_{\rm OS}=8$  mV at the input commonmode voltage of  $V_{\rm cm}=1.1$  V (the same conditions that are



Fig. 9. (a) Post-layout simulated delay and (b) energy per conversion as a function of supply voltage ( $\Delta V_{\rm in}=50$  mV,  $V_{\rm cm}=V_{\rm DD}-0.1$ ).



Fig. 10. (a) Post-layout simulated delay and (b) energy per conversion as a function of input common-mode voltage ( $\Delta V_{in} = 50$  mV,  $V_{DD} = 1.2$ ).

 $\label{eq:table I} \textbf{Summary of the Comparator Performance}$ 

| Item                                                                                   | Value       |
|----------------------------------------------------------------------------------------|-------------|
| Technology                                                                             | 180-nm CMOS |
| Supply voltage                                                                         | 1.2 V       |
| Average power dissipation per conversion @ freq. = 500 MHz                             | 329 μW      |
| Worst case delay ( $V_{\rm cm} = 0.6 \text{ V}$ , $\Delta V_{\rm in} = 1 \text{ mV}$ ) | 550 ps      |
| Delay/log( $\Delta V_{ m in}$ )                                                        | 69 ps/dec   |
| Offset standard deviation (1-sigma) $(\sigma_{os})$                                    | 7.8 mV      |
| Energy efficiency                                                                      | 0.66 pJ     |

found in [10]). Fig. 8 shows the layout of the comparator. Particular care was taken in the layout to avoid affecting delay and power of the comparator. Fig. 9(a) and (b) demonstrates

| TABLE II               |  |  |  |  |  |  |
|------------------------|--|--|--|--|--|--|
| PERFORMANCE COMPARISON |  |  |  |  |  |  |

| Comparator structure                                                | Conventional Dynamic<br>Comparator | Double-tail Dynamic<br>Comparator | Proposed Dynamic Comparator |                                           |
|---------------------------------------------------------------------|------------------------------------|-----------------------------------|-----------------------------|-------------------------------------------|
| Technology CMOS                                                     | 180 nm                             | 180 nm                            | 180 nm                      |                                           |
| Supply voltage (V)                                                  | 0.8 V                              | 0.8 V                             | 0.8 V                       |                                           |
| Maximum sampling frequency                                          | 900 MHz                            | 1.8 GHz                           | 2.4 GHz                     |                                           |
| Delay/log( $\Delta V_{\rm in}$ ) (ps/dec.)                          | 940                                | 358                               | 294                         |                                           |
| Peak transient noise voltage at regeneration time(nV)               | 215 n                              | 221 n                             | 219 n                       |                                           |
| Kickback noise voltage (at $\Delta V_{\text{in}} = 10 \text{ mV}$ ) | 51.3 mV                            | 5.3 mV                            | 43 mV                       | With neutralization: 13 mV                |
| Energy per conversion (J)                                           | 0.3 p                              | 0.27 p                            | 0.24 p                      | Without $M_{sw1}$ and $M_{sw2}$ : 0.265 p |
| Input-referred offset voltage (mV)                                  | 7.89 mV                            | 7.91 mV                           | 7.8 m                       |                                           |
| Estimated area                                                      | $16 \mu \times 16 \mu$             | $28 \mu \times 12 \mu$            | $28 \mu \times 14 \mu$      |                                           |

the post-layout simulation results of the delay and the energy per conversion of the mentioned dynamic comparators versus supply voltage variation. As shown in Fig. 9(a), in comparison with the other two structures, the delay of the proposed doubletail dynamic comparator is significantly reduced in low-voltage supplies. It is obvious that at high supply voltages, all structures have approximately similar performances, about 200 ps clk-to-output delay (including clock buffer) with 0.65 pJ/bitconversion for 8-mV offset. However, by decreasing the supply voltage, three structures start to behave differently. It is evident that the double-tail topology can operate faster and can be used in lower supply voltages, while consuming nearly the same power as the conventional dynamic comparator. The case is even much better for the proposed comparator when compared to the conventional double-tail topology. For instance, the proposed comparator can operate in 0.6 V supply at the cost of 106 fJ/conversion with 840 ps delay versus 1.81 ns for the conventional double-tail comparator and 3.5 ns for the conventional topology. Our simulations show that if the circuit is optimized for  $V_{\rm DD} = 0.6$  V, the results would be even better for the proposed circuit. Fig. 10 shows the simulated performance as a function of input common-voltage  $(V_{\rm cm})$ . Generally in the double-tail topologies, the delay of the comparator is less influenced by the variation of the input common-mode voltage in comparison with the conventional dynamic topology and thus has a wider common-mode range. The power consumption is nearly equal.

Fig. 11 depicts the dependence of the comparator delay on power supply level at various differential input voltages. For  $\Delta V_{\rm in}=10$  mV, the delay is 460 ps at  $V_{\rm DD}=0.9$  V. This delay drops from 460 to 162 ps when  $V_{\rm DD}$  changes from 0.9 to 1.5 V. In addition, at a given  $V_{\rm DD}$ , the larger the differential input voltage, the smaller the comparator delay will be. Fig. 12 shows the simulated delay of the comparator versus differential input voltage under different conditions of input common-mode voltage ( $V_{\rm cm}$ ) at  $V_{\rm DD}=1.2$  V. The delay of the comparator at  $\Delta V_{\rm in}=1$  mV and  $V_{\rm cm}=700$  m is 413 ps. For a given value of  $V_{\rm cm}$ , the delay decreases as differential input voltage increases. Furthermore, the delay is also dependent on the variation of common-mode voltage. For example, at  $\Delta V_{\rm in}=10$  mV, the delay increases by 64 ps, from 239 to 303 ps, as  $V_{\rm cm}$  decreases from 900 to 700 mV. As shown in



Fig. 11. Delay of the proposed comparator versus supply voltage  $(V_{\rm DD})$ .



Fig. 12. Delay of the proposed comparator versus input voltage difference  $(\Delta V_{\rm in})$ .

Fig. 13, the standard deviation of the offset of the proposed comparator is achieved to be  $\sigma_{OS} = 7.8$  mV using Monte Carlo simulations for a run of 300 samples. Table I summarizes the performance of the proposed dynamic comparator. Finally,



Fig. 13. Histogram diagram of Monte Carlo simulation of the offset in proposed dynamic comparator.

Table II compares the performance of the proposed comparator with the conventional dynamic and double-tail comparators. In 0.18- $\mu$ m CMOS, the proposed comparator provides the maximum sampling of 2.4 GHz at 0.8-V supply voltage.

#### VI. CONCLUSION

In this paper, we presented a comprehensive delay analysis for clocked dynamic comparators and expressions were derived. Two common structures of conventional dynamic comparator and conventional double-tail dynamic comparators were analyzed. Also, based on theoretical analyses, a new dynamic comparator with low-voltage low-power capability was proposed in order to improve the performance of the comparator. Post-layout simulation results in 0.18- $\mu$ m CMOS technology confirmed that the delay and energy per conversion of the proposed comparator is reduced to a great extent in comparison with the conventional dynamic comparator and double-tail comparator.

# REFERENCES

- [1] B. Goll and H. Zimmermann, "A comparator with reduced delay time in 65-nm CMOS for supply voltages down to 0.65," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 56, no. 11, pp. 810–814, Nov. 2009.
- [2] S. U. Ay, "A sub-1 volt 10-bit supply boosted SAR ADC design in standard CMOS," *Int. J. Analog Integr. Circuits Signal Process.*, vol. 66, no. 2, pp. 213–221, Feb. 2011.
- [3] A. Mesgarani, M. N. Alam, F. Z. Nelson, and S. U. Ay, "Supply boosting technique for designing very low-voltage mixed-signal circuits in standard CMOS," in *Proc. IEEE Int. Midwest Symp. Circuits Syst. Dig. Tech. Papers*, Aug. 2010, pp. 893–896.
- [4] B. J. Blalock, "Body-driving as a Low-Voltage Analog Design Technique for CMOS technology," in *Proc. IEEE Southwest Symp. Mixed-Signal Design*, Feb. 2000, pp. 113–118.
- [5] M. Maymandi-Nejad and M. Sachdev, "1-bit quantiser with rail to rail input range for sub-1V ΔΣ modulators," *IEEE Electron. Lett.*, vol. 39, no. 12, pp. 894–895, Jan. 2003.
- [6] Y. Okaniwa, H. Tamura, M. Kibune, D. Yamazaki, T.-S. Cheung, J. Ogawa, N. Tzartzanis, W. W. Walker, and T. Kuroda, "A 40Gb/s CMOS clocked comparator with bandwidth modulation technique," *IEEE J. Solid-State Circuits*, vol. 40, no. 8, pp. 1680–1687, Aug. 2005.
- [7] B. Goll and H. Zimmermann, "A 0.12 μm CMOS comparator requiring 0.5V at 600MHz and 1.5V at 6 GHz," in *Proc. IEEE Int. Solid-State Circuits Conf., Dig. Tech. Papers*, Feb. 2007, pp. 316–317.

- [8] B. Goll and H. Zimmermann, "A 65nm CMOS comparator with modified latch to achieve 7GHz/1.3mW at 1.2V and 700MHz/47μW at 0.6V," in Proc. IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers, Feb. 2009, pp. 328–329.
- [10] D. Shinkel, E. Mensink, E. Klumperink, E. van Tuijl, and B. Nauta, "A double-tail latch-type voltage sense amplifier with 18ps Setup+Hold time," in *Proc. IEEE Int. Solid-State Circuits Conf., Dig. Tech. Papers*, Feb. 2007, pp. 314–315.
- [11] P. Nuzzo, F. D. Bernardinis, P. Terreni, and G. Van der Plas, "Noise analysis of regenerative comparators for reconfigurable ADC architectures," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 55, no. 6, pp. 1441–1454, Jul. 2008.
- [12] A. Nikoozadeh and B. Murmann, "An analysis of latched comaprator offset due to load capacitor mismatch," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 53, no. 12, pp. 1398–1402, Dec. 2006.
- [13] S. Babayan-Mashhadi and R. Lotfi, "An offset cancellation technique for comparators using body-voltage trimming," *Int. J. Analog Integr. Circuits Signal Process.*, vol. 73, no. 3, pp. 673–682, Dec. 2012.
- [14] J. He, S. Zhan, D. Chen, and R. J. Geiger, "Analyses of static and dynamic random offset voltages in dynamic comparators," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 56, no. 5, pp. 911–919, May 2009.
- [15] J. Kim, B. S. Leibowits, J. Ren, and C. J. Madden, "Simulation and analysis of random decision errors in clocked comparators," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 56, no. 8, pp. 1844–1857, Aug. 2009.
- [16] P. M. Figueiredo and J. C. Vital, "Kickback noise reduction technique for CMOS latched comapartors," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 53, no. 7, pp. 541–545, Jul. 2006.
- [17] B. Wicht, T. Nirschl, and D. Schmitt-Landsiedel, "Yield and speed optimization of a latch-type voltage sense amplifier," *IEEE J. Solid-State Circuits*, vol. 39, no. 7, pp. 1148–1158, Jul. 2004.
- [18] D. Johns and K. Martin, Analog Integrated Circuit Design, New York, USA: Wiley, 1997.



circuits.

Samaneh Babayan-Mashhadi was born in Mashhad, Iran, in 1984. She received the B.S. and M.S. degrees (Hons.) in electrical engineering from the Ferdowsi University of Mashhad, Mashhad, Iran, in 2006 and 2008, respectively, where she is currently pursuing the Ph.D. degree, from the Electrical Engineering Department, with research on design of lowpower high-resolution, high-speed analog-to-digital converters.

Her current research interests include low-power low-voltage analog and mixed-signal integrated



**Reza Lotfi** received the B.Sc. degree from Ferdowsi University of Mashhad, Mashhad, Iran, the M.Sc. degree from the Sharif University of Technology, Tehran, Iran, and the Ph.D. degree from the University of Tehran, Tehran, in 1997, 1999, and 2004, respectively, all in electrical engineering.

Since 2004, he has been with the Ferdowsi University of Mashhad, where he is currently an Associate Professor. From 2008 to 2009, he was with the Electronics Research Laboratory, Delft University of Technology, Delft, the Netherlands, as a Post-

Doctoral Scientific Researcher involved in research on ultralow-power analog and mixed-signal integrated circuits for biomedical applications. His current research interests include low-voltage low-power analog integrated circuit design for biomedical applications and high-speed data converters for telecommunication systems.

Dr. Lotfi has been an Associate Editor of the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—PART I: REGULAR PAPERS since 2010.