## VISS MINI PROJECT

## BCD TO EXCESS 3 CODE CONVERTER

SUBMITTED BY:
AGNISH JAIN
(M.E. Microelectronics- $1^{\text {st }}$ Yr.)

SR.NO.4610-411-021-02172

Guided By:
Dr. Navakant Bhat
Dr. H.S. Jamdagni
I.I.Sc, Bangalore.

HEMANT H PARATE
(M.E Microelectronics- $1^{\text {st }}$ Yr.)

SR.NO.4610-411-021-02195

## BCD to EXCESS-3 CODE Converter

## Design Implementation

## - STATIC CMOS LOGIC

Parameters of Technology Node.

- Transistor Count - $\mathbf{3 8}$ (without Buffers including ERROR signal) -42(with buffers)
- Power Dissipation $=7.5256 \mathrm{e}-011 \mathrm{~W}$
- Propagation Delay TPHL=2.035 nsec. ( worst case)
- Area
$B$ Active=10325 sq. units
B Total=52000 sq. units
Additional Resources- More no. of Transistors.
- DYNAMIC LOGIC
- Transistor Count - $\mathbf{3 3}$ (without Buffers)
-39(with buffers)
- Power Dissipation $=1.4275 \mathrm{e}-010 \mathrm{~W}$
- Propagation Delay TPHL=2.539 nsec. ( worst case)
- Area

B Active=8670 sq. units
B Total=46200 sq. units
Additional Resources- Clock signal with larger Fan In

NOTE: All the Readings are taken for 4 bit Excess- 3 code + one ERROR signal which is used to indicate that the input is other than BCD.

## Design Steps:

1. DIGITAL DESIGN
2. VHDL DESIGN
3. TRANSISTOR SIZING
4. LAYOUT DESIGN

## 1.DIGITAL DESIGN:

- Truth table
- Karnaugh's map
- Logic diagram


## Truth Table:

| BCD INPUT | EXCESS-3 OUTPUT | ERROR |
| :---: | :---: | :---: |
| ABCD | W3W2W1W0(WXYZ) | ER |
| 0000 | 0011 | 0 |
| 0001 | 0100 | 0 |
| 0010 | 0101 | 0 |
| 0011 | 0110 | 0 |
| 0100 | 0111 | 0 |
| 0101 | 1000 | 0 |
| 0110 | 1001 | 0 |
| 0111 | 1010 | 0 |
| 1000 | 1011 | 0 |
| 1001 | 1100 | 0 |

Illegal input(ER ROR)

| 1010 | XXXX | 1 |
| :---: | :---: | :---: |
| 1011 | XXXX | 1 |
| 1100 | XXXX | 1 |
| 1101 | XXXX | 1 |
| 1110 | XXXX | 1 |
| 1111 | XXXX | 1 |

## Karnaugh's Map:

## Equations:

(W)W3=A OR (B AND (C OR D))
(X)W2=B XOR (C OR D)
(Y)W1=C XOR D
(Z)W0=NOT D
(ER)ERROR=A AND (B OR C)

## GRAPHICAL REPRESENTATIONS OF K-Map



$$
W=A+B C+B D
$$


$Y=C D+\bar{C} \bar{D}$


$$
\mathrm{X}=\overline{\mathrm{B}} \mathrm{C}+\mathrm{BD}+\mathrm{B} \overline{\mathrm{C}} \overline{\mathrm{D}}
$$


$Z=\overline{\mathrm{D}}$

Note: The Signals_ $\mathbf{W}=\mathbf{W}(\mathbf{3}), \mathbf{X}=\mathbf{W}(2), Y=W(1), Z=W(0)$ are used interchangeably through out the design.
Logic Diagram:

(Generated From Sinplicity)
2.VHDL :(Software Used is Cypress -Warp and Sinplify)

- code
- equations generated
- delays
- worst case path


## *Code

library ieee;
use ieee.std_logic_1164.all;
use ieee.std_logic_unsigned.all;
entity bcd_exc_3 is
port( A,B,C,D : in std_logic;
Error,W3,W2,W1,W0 : out std_logic
);
end bcd_exc_3;
architecture arch_bcd_exc_3 of bcd_exc_3 is
begin
W3<=A or (B and (C or D));
$\mathrm{W} 2<=\mathrm{B}$ xor ( C or D );
W1<=C xnor D;
W0<=not D;
Error $<=\mathrm{A}$ and (B or C);
end arch_bcd_exc_3;

## Signal information:

ABCD: -input signals with A as MSB and D as LSB
W3W2W1W0:-output signals with W3 as MSB and W0 as LSB.
ERROR:- signal used to indicate invalid input BCD number

## Equations Generated

error $=a * c+a * b$
$\mathrm{w} 0=/ \mathrm{d}$
$\mathrm{w} 1=\mathrm{c} * \mathrm{~d}+/ \mathrm{c} * / \mathrm{d}$
$\mathrm{w} 2=\mathrm{b} * / \mathrm{c} * / \mathrm{d}+/ \mathrm{b} * \mathrm{~d}+/ \mathrm{b} * \mathrm{c}$
$/ \mathrm{w} 3=/ \mathrm{a} * / \mathrm{c} * / \mathrm{d}+/ \mathrm{a} * / \mathrm{b}$

## Delays



## Worst C ase Path Summary

$\mathrm{tPD}=8.5 \mathrm{~ns}$ for w 2

## Critical path:

The critical Path is shown in RED colour

(generated by software simplicity)

## 3. Schematic Design:

- S-Edit Schematic
- Transistor sizing
- Load Testing
- Problems and Solution of Load Testing


## S-E DIT Diagram

1. Static CMOS Implementation:

## Static CMOS Design Schematic



## Design process followed in Schematic Design:

a. Design of circuit with minimum size transistors
ie $\operatorname{Ln}=0.8 \mathrm{um}$ and $\mathrm{Wn}=2 \mathrm{um}$ for NMOS
ie $\mathrm{Lp}=0.8 \mathrm{um}$ and $\mathrm{Wp}=6 \mathrm{um}$ for PMOS
The ratio of $\mathrm{Wp} / \mathrm{Wn}=3$.

(output waveforms without transistor sizing)
b.The problem faced is the glitches in the result as shown below-
without transistor sizing:
The glitches in the output are due to the charge sharing and the late arriving signals.

- The late arriving signals are connected near the output but it is not the complete remedy. So the next cause is charge sharing .The rising glitches are due to the PMOS and falling Glitches are due to NMOS .by transistor sizing this problem is solved.

2. Transistor sizing is done by iterative measurements i.e. by keeping one parameter constant and other changing i.e. varying the W/L ratio of PMOS for high to low glitch and of NMOS for low to high glitch the glitch is minimized.

## Load Testing:

The Circuit is Tested using the load of 200ff at each node. from waveforms obtained it is observed that the delay is increased due to output load.
This degradation of output is removed by designing a Buffer. by trial and error method the no of buffers used in cascade are 2 so that the logic is maintained and the out quality also increases. The output waveforms obtained with load is given below.

output waveform with load of 200ff without Buffer

## Static CMOS LAYOUT:

## Waveforms:

Input sequence for testing:

| A(msb) | 0000000011111111 |
| :--- | :--- |
| B | 0000111100001111 |
| C | 0011001100110011 |
| D(lsb) | 1010101010101010 |



LAYOUT OF BCD to EXCESS-3 CODE Converter using Static Logic.

With transistor sizing:
Input waveforms:


Output waveforms without load and with BUFFER


Output waveforms with load 200ff(with Buffer at the output):


## Delays in BCD to excess-3 code converter

Without load:

|  |  |  |  |  |  |
| :--- | :--- | :--- | :--- | :--- | :--- |
|  | W3 (ns) | W2 (ns) | W1 (ns) | W0 $(\mathrm{ns})$ | Error $(\mathrm{ns})$ |
| tplh | 7.3766 | 5.0983 | 3.2339 | 1.6268 | 4.1887 |
| tphl | 3.1156 | 4.3801 | 5.1374 | 2.1958 | 0.91456 |

## With load of 200ff :

|  |  |  |  |  |  |
| :--- | :--- | :--- | :--- | :--- | :--- |
|  | W3 (ns) | W2 (ns) | W1 (ns) | W0 (ns) | Error (ns) |
| tplh | 9.2220 | 6.6782 | 4.8016 | 3.1111 | 4.1959 |
| tphl | 5.6831 | 6.9615 | 7.7534 | 4.7236 | 9.1406 |

## Worst-case delays:

for W3:
When PMOS B_bar and C_bar or D_bar are conducting tplh__W3 = 7.5084e-009
when NMOS A_bar C_bar and D_bar are conducting tphl__W3 = 5.7456e-009
for W2:
when PMOS B_bar and C and D are conducting tplh__W2 $=4.5541 \mathrm{e}-009$
when NMOS B_bar C_bar and D_bar are conducting tphl__W2 $=4.5805 \mathrm{e}-009$
for W1:
when PMOS C_bar and D_bar or C and D are conducting tplh__W1 $=2.4564 \mathrm{e}-009$
when NMOS C_bar and D or C and D_bar are conducting tphl__W1 = 6.3411e-009
for W0:
depends on only D
tplh__W0 $=1.5708 \mathrm{e}-009$
tphl__W0 $=2.1738 \mathrm{e}-009$

## Power Consumed:

```
power = 7.4048e-011
From \(=0.0000 \mathrm{e}+000\)
To \(=2.0000 \mathrm{e}-007\)
```


## DYNAMIC LOGIC IMPLENTATION

## Dynamic Circuit BLOCK Schematic



$\mathrm{W}=\mathrm{A}+\mathrm{BC}+\mathrm{BD}$


X=B_bar.C+B.D+B.C_bar.D_bar


$\mathrm{Y}=\mathrm{CD}+\mathrm{C}$ _bar.D_bar


W=D_bar

$\mathrm{Er}=\mathrm{A}(\mathrm{B}+\mathrm{C})$

Schematic Circuit for BCD to EXCESS-3 code convertor using Dynamic Logic.

## Transistor sizing in Dynamic logic.

W varies from $2-8$ for inv. \& 4-16 for clk NMOs \& PMOS inv by 4-16u Measurement result summary $-\mathrm{a}=1 \mathrm{e}-006$
$\mathrm{tph}=3.6666 \mathrm{e}-010$
tphl $=7.3252 \mathrm{e}-010$
tphl $=6.0366 \mathrm{e}-008$
$\mathrm{tphl}=1.2066 \mathrm{e}-009$
$\mathrm{tplh}=9.6248 \mathrm{e}-010$
tplh $=3.5994 \mathrm{e}-010$
tphl $=6.1638 \mathrm{e}-010$

Measurement result summary - $\mathrm{a}=2 \mathrm{e}-006$
tphl $=3.5909 \mathrm{e}-010$
tphl $=7.9951 \mathrm{e}-010$
tphl $=3.8034 \mathrm{e}-010$
tphl $=1.0505 \mathrm{e}-009$
$\mathrm{tplh}=8.9149 \mathrm{e}-010$
tplh $=2.9400 \mathrm{e}-010$
tphl $=3.8159 \mathrm{e}-010$

Measurement result summary $-\mathrm{a}=3 \mathrm{e}-006$
tphl $=3.3364 \mathrm{e}-010$
tphl $=7.6720 \mathrm{e}-010$
tphl $=3.3674 \mathrm{e}-010$
tphl $=1.0021 \mathrm{e}-009$
tplh $=8.3782 \mathrm{e}-010$
$\mathrm{tplh}=2.6435 \mathrm{e}-010$
tphl $=3.2539 \mathrm{e}-010$
Measurement result summary $-\mathrm{a}=4 \mathrm{e}-006$
tphl $=3.0872 \mathrm{e}-010$
tphl $=7.9940 \mathrm{e}-010$
tphl $=3.0806 \mathrm{e}-010$
$\mathrm{tphl}=9.5909 \mathrm{e}-010$
tplh $=8.3673 \mathrm{e}-010$
tplh $=2.4374 \mathrm{e}-010$
tphl $=2.9360 \mathrm{e}-010$
Power Results for $\mathrm{pw}=40 \mathrm{n} \& \mathrm{clk} \mathrm{pw}=20 \mathrm{n}$
vdd from time 0 to $4 \mathrm{e}-008$
integral power consumed -> $8.840991 \mathrm{e}-011$ watts
The above reading show a favourable trend towards increasing the size of $\mathrm{W}=16 \mathrm{u}$ for Clk NMOS Transistor since it needs to pass current in small time to reduce Tphl.


Layout of BCD to EXCESS-3 code converter using dynamic Logic.


Input signals clk, D, C, B, A

contd.


Without using Buffer $Z, X, W, Y$ respectively. Measurement result summary
tphl__Y $=2.0327 e-009$ tphl_X $=2.7627 e-009$
tphl__W $=2.4527 e-009$
tphl__Z $=1.0307 e-009$
power $=1.1839 \mathrm{e}-010$


With using Buffer $\mathrm{Z}, \mathrm{X}, \mathrm{W}, \mathrm{Y}$ respectively

```
Measurement result summary
```

$$
\begin{aligned}
& \operatorname{tph} 1 \_Y=1.4528 \mathrm{e}-009 \\
& \operatorname{tph} \_\_X=2.7153 \mathrm{e}-009 \\
& \text { tphl_K }=2.5934 \mathrm{e}-009 \\
& \text { tphl_Z }=1.0255 \mathrm{e}-009 \\
& \text { power }= \\
& 1.4275 \mathrm{e}-010
\end{aligned}
$$

## Comparison of Static vs. dynamic Logic

## Combinational Logic: Static versus D ynamic

- Static:
- At every point in time (except during the switching transient), each gate output is connected to either V do or V ss via a lowresistance path.
- Slower and more complex than dynamic but "safer".
- Dynamic:
- Rely on the temporary storage of signal values on the capacitance of highimpedance circuit nodes.
- Simplier in design and faster than static but more complicated in operation and are sensitive to noise.


Complementary Pseudo-NMOS


## Combinational (Non-Regenerative) Circuits

- We've already looked at full complementary design.
- In summary.


Suppose PDN implements G.
But G is connected to GND, so it implements the inverse $F=\bar{G}$

The PUN must implement $F$, since it's connected to $\mathrm{V}_{\mathrm{DD}}$.

- Therefore, the following must hold.

$$
\overline{G\left(I n_{1}, I n_{2}, I n_{3}, \ldots\right)}=F\left(\overline{I n_{1}}, \overline{I n_{2}}, \overline{I n_{3}}, \ldots\right)
$$

- This condition is met if (but not only if) $F$ and $G$ are dual equations, e.g., AND s in $F$ are OR s in $G$.


## Complementary CMOS G ates

- Static CMOS gates inherit the nice properties of the basic CMOS inverter.
- High noise margins.
- No static power consumption.
- Comparable rise and fall times (under the appropriate scaling conditions).
- The last point needs further clarification:
- This is true if the PUN and PDN networks have identical current-driving capabilities.
- For the inverter, this required that p-transistors be widened by a factor of $m_{n} / m_{p}$.
- This is complicated for complex gates since the current driving capabilities are determined by the values of the input signals as well.
-As we've done in the lab, characterize based on the worst case.


## Complementary CMOS G ates

- Performing a manual analysis of the dynamic behavior of complex gates is only tractable via a switch model.
- Here, the transistor is modeled as a switch with an infinite offresistance and a finite on resistance, $\mathbf{R}$ on .
- R on is chosen so that the equivalent RC-circuit has a propagation delay identical to the original transistor-capacitor model.
- R on is inversely proportional to the $\mathrm{W} / \mathrm{L}$ ratio but varies during the switching transient.
- As we did for (dis)charge currents, we estimate $R$ on at the endpoints of the transitions, e.g., for $\mathrm{t}_{\mathrm{pHL}}$ :

$$
R_{o n}=\frac{1}{2}\left(R_{N M O S}\left(V_{o u t}=V_{D D}\right)+R_{N M O S}\left(V_{o u t}=\frac{V_{D D}}{2}\right)\right)
$$

## Complementary CMOS G ates

- This simplifies to:

$$
R_{o n}=\frac{1}{2}\left[\left(\frac{V_{D S}}{I_{D}}\right)_{V_{o u t}=V_{D D}}+\left(\frac{V_{D S}}{I_{D}}\right)_{V_{o u t}=\frac{V_{D D}}{2}}\right]
$$

- Thetext gives an example using 1.2 mm CMOS process with V DD $=5 \mathrm{~V}$.
- $R_{n}$ is found to be 9.7 k W for $t_{p H L}$ and $R_{p}$ is 9.6 kW for a W/ L eff of 2 and 6 respectively.
- Deriving propagation delay can be doneby analyzing the RC network.



## Complementary CMOS G ates

- Propagation delay is computed for the worst-case delay over all possible input combinations.
-For the two-input NAND, the worst-case rise time occurs for one PMOS:

$$
t_{p L H}=0.69 R_{p} C_{L}
$$

- H owever, the worst-case (only) fall time occurs for two series NMOS:
$t_{p H L}=2 \times 0.69 R_{n} C_{L}$
-This suggests the a 2-to-1 width scaling factor of NMOS to PMOS.


1 is a unit-sized transistor.

Assumes PMOS is triple the resistance of NMOS

## Complementary CMOS G ates

- This analysis indicates the deficiencies of implementing gates with largefan-in values:
- A gate with N inputs requires 2 N transistors.
- Other circuit styles require at most $\mathrm{N}+1$ transistors, which can be a substantial advantage in area, e.g., 8 versus 5 for a 4-input gate.
- The propagation delay of a complementary gate deteriorates rapidly as a function of fan-in.
- First, the larger number of transistors increases the overall capacitance of the gate.
- Second, the series connection in the PUN and PDN slows the gate.
ß Widening does not improve the performance as much as predicted, since widening increases
gate and diffusion capacitance.
- Fan-out in complementary gates has a larger impact on gate delay than in other circuit styles.
- Downstream gate capacitance is always two per fan+ut in contrast to one in other styles.


## Complementary CMOS G ates

- Fan-in and fan-out modeling:
$t_{p}=a_{1} F I+a_{2} F I^{2}+a_{3} F O$
- $\mathrm{a}_{1}, \mathrm{a}_{2}$ and $\mathrm{a}_{3}$ are technology-dependent weighting factors.
- The linear dependence on fan-out is easy to understand since load increases linearly with fanout.
- There is a quadratic dependence on fan-in since increasing fan-in raises both $C_{L}$ and (dis)charging resistance in a linear way (under no scaling).



## Complementary CMOS G ates

- Several approaches may be used to alleviate this problem:
- Transistor sizing
- Increasing size decreases the second-order factor in thet ${ }_{p}$ expression.
- However, as indicated above, if load is dominated by intrinsic capacitance (self-loading), propagation delay is not improved.
- Progressive transistor sizing
- Previous analysis lumped capacitance at the output nodeand internal node capacitance was ignored.
- This model becomes increasingly inaccuratefor largefan-in.


While $\mathrm{M}_{\mathrm{N}}$ has to conduct the discharge current of the load capacitance, $\mathrm{C}_{\mathrm{L}}, \mathrm{M}_{1}$ has to carry the discharge current $\mathrm{C}_{\text {tot }}=\mathrm{C}_{\mathrm{L}}+\ldots+\mathrm{C}_{2}+\mathrm{C}_{1}$

Therefore, progressive scaling is beneficial:
$\mathrm{M}_{1}>\mathrm{M}_{2}>\ldots>\mathrm{M}_{\mathrm{N}}$

## Complementary CMOS G ates

- Transistor ordering
- N ot all input signals to a gate arrive at the same time.
- Let's call the last arriving input signal critical , which is propagated by a critical path .
- Putting the critical-path transistor closer to the output of the gate can result in a speed-up.



## Complementary CMOS G ates

- Improved Logic Design

Alternative



Alternative
2 :


Alternative


## Complementary CMOS G ates

- Use A nother Circuit Style
- Ratioed
- Pass-transitor logic
- Plus others to be discussed
- These techniques deal with improving performance of gates with largefan-ins.
-Often speed is dominated by thefan-out factor.
- Scaling the transistors up in complex logic gates to drive large loads is expensive in terms of area.
- Instead, a buffer (an inverter, or sequence of inverters) can be inserted between the complex gate and thefan-out.
- Scaling is applied to the buffer transistors -- the complex gate uses minimum size transistors.


## Conclusion

Although one of the goals of Ultra-Low-Power CMOS is to maintain portability of existing circuits, this can be achieved only to a limited extent. This is due to the fact that most high-performance digital VLSI systems employ dynamic circuits (such as domino logic) to achieve higher speed and density Unfortunately, dynamic circuits require a small off-state current which sets a lower limit to the threshold voltage. Although dynamic circuits would work down to 500 mV supply voltage with high power efficiency, they would not meet the performance requirements. Static logic, on the other hand, is very robust against high leakage currents and can therefore be scaled to much lower voltages than dynamic logic. Unfortunately, static logic consumes more space on a chip than dynamic logic. There are, however, alternatives to pure static or dynamic circuits, which combine the robustness of static logic with the speed and density of dynamic logic. For example, one could use latched dynamic logic, pseudo-NMOS logic (for large-fan-in NOR gates), or cascade voltage switch logic (CVSL) in time-critical signal paths

## BIBLOGRAPHY

1. DIGITAL INTEGRATED CIRCUITS- JAN M. RAEBAY.
2. CMOS VLSI DESIGN -WESTE AND ESHRAGHIAN
3. CMOS VLSI CIRCUIT LAB LABACEE.
