560 likes | 899 Views
Chapter 14. Arithmetic Circuits. Rev. 1.0 05/12/2003 Rev. 2.0 06/05/2003. A Generic Digital Processor. Building Blocks for Digital Architectures. Arithmetic and Unit. Bit-sliced datapath. ( adder, multiplier, shifter, comparator, etc.). -. Memory. - RAM, ROM, Buffers, Shift registers.
E N D
Chapter 14 Arithmetic Circuits Rev. 1.0 05/12/2003 Rev. 2.0 06/05/2003
Building Blocks for Digital Architectures Arithmetic and Unit Bit-sliced datapath (adder, multiplier, shifter, comparator, etc.) - Memory - RAM, ROM, Buffers, Shift registers Control - Finite state machine (PLA, random logic.) - Counters Interconnect - Switches - Arbiters - Bus
Intel Microprocessor Itanium has 6 integer execution units like this
Itanium Integer Datapath Fetzer, Orton, ISSCC’02
Full-Adder (FA) Generate (G) = AB Propagate (P) = A B Å Delete = A B
Boolean Function of Binary Full-Adder CMOS Implementation
Express Sum and Carry as a function of P, G, D Define 3 new variable which ONLY depend on A, B Generate (G) = AB Propagate (P) = A B Å Delete = A B S C D and P Can also derive expressions for and based on o Note that we will be sometimes using an alternate definition for + Propagate (P) = A B
A B A B A B A B 0 0 1 1 2 2 3 3 C C C C C i ,0 o ,0 o ,1 o ,2 o ,3 FA FA FA FA = ( C ) i ,1 S S S S 0 1 2 3 Ripple-Carry Adder Critical Path Worst-case delayis linear with the number of bits tadder = (N-1)tcarry + tsum td = O(N) • Propagation delay (or critical path) is the worst-case delay over all possible input patterns • A= 0001, B=1111, trigger the worst-case delay • A: 0 1, and B= 1111 fixed to set up the worst-case delay transition.
Complimentary Static CMOS Full Adder 28 Transistors • Logic effort of Ci is reduced to 2 (c.f., A and B signals) • Ci is late arrival signal near the output signal • Co needs to be inverted Slow down the ripple propagate
Minimize Critical Path by Reducing Inverting Stages • Exploit Inversion Property • Reduce One inverter delay in each Full-adder (FA) unit
A Better Structure: The Mirror Adder Exploring the “Self-Duality” of the Sum and Carry functions
Mirror Adder Design • The NMOS and PMOS chains are completely symmetrical • A maximum of two series transistors can be observed in the carry-generation circuitry for good speed. • When laying out the cell, the most critical issue is the minimization of the capacitance at node Co. • The capacitance at node Co is composed of four diffusion capacitances, two internal gate capacitances, and six gate capacitances in the connecting adder cell . • The transistors connected to Ci are placed closest to the output.
Transmission-Gate Full Adder (24T) • Same delay for Sum and Carry Multiplier design
Manchester Carry-Chain Adder Static Circuits Dynamic Circuits
Carry-Bypass Adder Also called Carry-Skip
Carry-Bypass Adder (cont.) tadder = tsetup + Mtcarry + (N/M-1)tbypass + (M-1)tcarry + tsum M bits form a Section (N/M) Bypass Stages
Carry Ripple versus Carry Bypass Wordlength (N) > 4~8 is better for Bypass Adder
Square Root Carry Select N-bit adder with P stages, 1st stage adds M bits
Look-Ahead: Topology Expanding Lookahead equations: All the way:
MxN Array Multiplier — Critical Path Critical Path 1 & 2
Multipliers —Summary • Identify Critical Paths • Other Possible techniques: • Data Encoding (Booth) • Logarithmic v.s. Linear (Wallace Tree Multiplier) • Pipelining
The Barrel Shifter Area Dominated by Wiring
4x4 barrel shifter Widthbarrel ~ 2 pm M
0-7 bit Logarithmic Shifter A 3 Out3 A 2 Out2 A 1 Out1 A 0 Out0
Summary • Datapath designs are fundamentals for high-speed DSP, Multimedia, Communication digital VLSI designs. • Most adders, multipliers, division circuits are now available in Synopsys Designware under different area/speed constraint. • For details, check “Advanced VLSI” notes, or “Computer Arithmetic” textbooks