480 likes | 611 Views
This lecture discusses the current state of VLSI scaling, focusing on scaling rules, historical trends, and future predictions in semiconductor technology. Key topics include the effects of scaling on features such as channel length, width, oxide thickness, and voltage. Additionally, the importance of predicting technological advancements and their impact on design decisions is emphasized. The session highlights the need to understand the limits of scaling and the potential for competing effectively in a rapid technology landscape.
E N D
CS184a:Computer Architecture(Structure and Organization) Day 6: January 22, 2003 VLSI Scaling
Today • VLSI Scaling Rules • Effects • Historical/predicted scaling • Variations (cheating) • Limits
Why Care? • In this game, we must be able to predict the future • Rapid technology advance • Reason about changes and trends • re-evaluate prior solutions given technology at time X.
Why Care • Cannot compare against what competitor does today • but what they can do at time you can ship • Careful not to fall off curve • lose out to someone who can stay on curve
Scaling • Premise: features scale “uniformly” • everything gets better in a predictable manner • Parameters: • l (lambda) -- Mead and Conway (class) • S -- Bohr • 1/k -- Dennard
Feature Size l is half the minimum feature size in a VLSI process [minimum feature usually channel width]
Scaling • Channel Length (L) • Channel Width (W) • Oxide Thickness (Tox) • Doping (Na) • Voltage (V)
Scaling • Channel Length (L) l • Channel Width (W) l • Oxide Thickness (Tox) l • Doping (Na) 1/l • Voltage (V) l
Area Capacitance Resistance Threshold (Vth) Current (Id) Gate Delay (tgd) Wire Delay (twire) Power Effects?
l l/k A = L * W A A/k2 0.35mm 0.25mm 50% area 2x capacity same area Area
Area Perspective [2000 tech.] 18mm18mm 0.18mm 60G l2
Capacitance per unit area Cox= eSiO2/Tox ToxTox/k Cox k Cox Capacitance
Gate Capacitance Cgate= A*Cox A A/k2 Cox k Cox Cgate Cgate /k Capacitance
VTHVTH /k Threshold Voltage
Saturation Current Id=(mCOX/2)(W/L)(Vgs-VTH)2 Vgs=VV/k VTHVTH /k WW/k Cox k Cox IdId/k Current
tgd=Q/I=(CV)/I VV/k IdId/k CC/k tgd tgd /k Gate Delay
R=rL/(W*t) WW/k L, t similar R k R Resistance
twire=RC R -> k R C-> C/k twire -> twire …assuming (logical) wire lengths remain constant... Assume short wire or buffered wire (unbuffered wire ultimately scales as length squared) Wire Delay
Resistive Power P=V*I VV/k IdId/k PP/k2 Power Dissipation (Static)
Capacitive (Dis)charging P=(1/2)CV2f VV/k CC/k PP/k3 Increase Frequency? f kf ? P P/k2 Power Dissipation (Dynamic)
Area 1/k2 Capacitance 1/k Resistance k Threshold (Vth) 1/k Current (Id) 1/k Gate Delay (tgd) 1/k Wire Delay (twire) 1 Power 1/k21/k3 Effects?
ITRS Roadmap • Semiconductor Industry rides this scaling curve • Try to predict where industry going • (requirements…self fulfilling prophecy) • http://public.itrs.net
S=0.7 [0.5x per 2 nodes] Pitch Gate MOS Transistor Scaling(1974 to present) [from Andrew Kahng] Source: 2001 ITRS - Exec. Summary, ORTC Figure
Poly • Pitch • Metal • Pitch (Typical MPU/ASIC) (Typical DRAM) Half Pitch (= Pitch/2) Definition [from Andrew Kahng] Source: 2001 ITRS - Exec. Summary, ORTC Figure
1994 NTRS - .7x/3yrs Log Half-Pitch Actual - .7x/2yrs 0.7x 0.7x Linear Time 250 -> 180 -> 130 -> 90 -> 65 -> 45 -> 32 -> 22 -> 16 0.5x Node Cycle Time (T yrs): *CARR(T) = [(0.5)^(1/2T yrs)] - 1 CARR(3 yrs) = -10.9% CARR(2 yrs) = -15.9% N N+1 N+2 * CARR(T) = Compound Annual Reduction Rate (@ cycle time period, T) Scaling Calculator + Node Cycle Time: [from Andrew Kahng] Source: 2001 ITRS - Exec. Summary, ORTC Figure
[from Andrew Kahng] Source: 2001 ITRS - Exec. Summary, ORTC Figure
Delays? • If delays in gates/switching? • If delays in interconnect? • Logical interconnect lengths?
Delays? • If delays in gates/switching? • Delay reduce with 1/k [l]
Delays • Logical capacities growing • Wirelengths? • No locallityk (slower!) • Rent’s Rule • L n(p-0.5) • [p>0.5]
Compute Density • Density = compute / (Area * Time) • k3>compute density scaling>k • k3: gates dominate, p<0.5 • k2: moderate p, good fraction of gate delay • [p from Rent’s Rule again – more on Day12] • k: large p (wires dominate area and delay)
Power Density • P-> P/k2 (static, or increase frequency) • P-> P/k3 (dynamic, same freq.) • A -> A/k2 • P/A P/A … or … P/kA
Cheating… • Don’t like some of the implications • High resistance wires • Higher capacitance • Need for more wiring • Not scale speed fast enough
R=rL/(W*t) WW/k L, t similar R k R Improving Resistance • Don’t scale t quite as fast. • Decrease r (copper)
Capacitance per unit area Cox= eSiO2/Tox ToxTox/k Cox k Cox Improving Capacitance Reduce Dielectric Constant e
Typical chip cross-section illustrating hierarchical scaling methodology Passivation Dielectric Wire Etch Stop Layer Via Global (up to 5) Dielectric Capping Layer Copper Conductor with Barrier/Nucleation Layer Intermediate (up to 4) Local (2) Pre Metal Dielectric Tungsten Contact Plug [from Andrew Kahng]
tgd=Q/I=(CV)/I VV/k Id=(mCOX/2)(W/L)(Vgs-VTH)2 IdId/k CC/k tgd tgd /k Improving Gate Delay Don’t scale V: VV IkI tgdtgd /k2 • Lower C. • Don’t scale V.
Capacitive (Dis)charging P=(1/2)CV2f VV/k CC/k PP/k3 Increase Frequency? f kf ? P P/k2 …But Power Dissipation (Dynamic) If not scale V, power dissipation not scale.
…AndPower Density • PP(increase frequency) • P> P/k(dynamic, same freq.) • A A/k2 • P/A kP/A … or … k2P/A • Power Density Increases
Physical Limits • Doping? • Features?
Physical Limits • Depended on • bulk effects • doping • current (many electrons) • mean free path in conductor • localized to conductors • Eventually • single electrons, atoms • distances close enough to allow tunneling
What Is A “Red Brick” ? • Red Brick = ITRS Technology Requirement with no known solution • Alternate definition: Red Brick = something that REQUIRES billions of dollars in R&D investment [from Andrew Kahng]
The “Red Brick Wall” - 2001 ITRS vs 1999 Source: Semiconductor International - http://www.e-insite.net/semiconductor/index.asp?layout=article&articleId=CA187876 [from Andrew Kahng]
Big Ideas[MSB Ideas] • Moderately predictable VLSI Scaling • unprecedented capacities/capability growth for engineered systems • change • be prepared to exploit • account for in comparing across time
Big Ideas[MSB-1 Ideas] • Uniform scaling reasonably accurate for past couple of decades • Area increase k2 • Real capacity maybe a little less? • Gate delay decreases (1/k) • Wire delay not decrease, maybe increase • Overall delay decrease less than (1/k)