Binding, Allocation and Floorplanning in Low Power High-Level Synthesis

Binding, Allocation and Floorplanning in Low Power High-Level Synthesis A. Stammermann, D. Helms, M. Schulte OFFIS Research Institute A. Schulz, W. Nebel Univ. Of Oldenburg

Outline • Motivation • Background • Optimization Algorithm • Evaluation • Results • Conclusion ICCAD 2003

Motivation Within actual CMOS-technologies (0.1m) 80-95% of the power consumption emerges from charging and discharging capacitances. VDD Input Output Switching activity  Capacitance of • gates at the output • connecting wires C GND Pdyn =  C VDD2 f Optimization target ICCAD 2003

Capacitance Distribution (ITRS Roadmap 2001) Increasing share of interconnect capacitance Interconnect has to be considered during high-level synthesis. ICCAD 2003

Summary • Capacitance contributes linear to power dissipation • Wire capacitance is increasing • Wire capacitance is dominated by its length  Thus it is important that accurate physical information is used during high-level synthesis ICCAD 2003

rough Floorplanning detailed Floorplanning ASIC-Cells Chip Design RT-netlist Generation adder, multiplier Gate-netlist Generation Increasing level of details Floorplanning ASIC-Cells Interconnect ICCAD 2003

+1 +2 +3 +4 +5 cstep 1 +1 +1 cstep 2 +2 +2 cstep 3 +3 +4 +3 +4 +5 cstep 4 +5 RT-Netlist Generation Scheduling When are operations executed Allocation How many resources Binding Which operation is executed on which resource ICCAD 2003

+1 +1 +2 +2 +3 +4 +3 +4 +5 +5 RT-Netlist Generation Reg1 Reg2 Reg3 Reg1 Reg2 Reg3 ADD1,2 ADD3 ADD4,5 ADD1 ADD2,3 ADD4,5 ICCAD 2003

0 0 1 1 0 0 1 1 1 1 0 0 1 1 0 0 +1 +2 0 1 0 1 1 0 1 0 0 1 0 1 1 0 1 0 ADD Impact on Power Dissipation • Binding/Allocation • Interleaving changes the characteristic of input streams:The switching activity  at the input of resources (and according wires) can increase or decrease. • Influences netlist topology:The wire length resp. capacitance can increase or decrease. ICCAD 2003

Power Estimation • Switching activity Simulation of behaviour description • Power consumption of resources (e.g. adder) Power models • Wire capacitance Capacitance model Floorplan ICCAD 2003

x x y y 3 * 1 + + * 5 3 5 2 4 1 2 4 Floorplanning • Slicing Floorplans • Standard • Supports softmacros (leafs are flexible in their aspect-ratio) • Efficient representation as binary tree vertical horizontal ICCAD 2003

Optimization Algorithm Power optimization algorithm for RT-level netlist with regard to interconnect power • Performs simultaneoulsy • Slicing-tree structured based floorplanning • Functional unit binding and allocation • Changes in the netlist topology are mended immediately in the actual floorplan • In contrast to previous approaches a generation from scratch is not necessary • Non-deterministic, iterative optimization technique • Simulated Annealing • Cost function: PFE + PInter + Area ICCAD 2003

2 1 3 4 * + 3 * + + + 2 3 2 4 1 4 1 1 2 4 3 Floorplan-Annealing Simulated Annealing based floorplanner Floorplan moves • F1: Exchange two leafs • F2: Exchange leaf and node • F3: Change direction • F4: Exchange two nodes • F5: Displace a leaf / node F2: Exchange leaf and node ICCAD 2003

Initial Floorplan new costs < old costs Stopping criteria met? Optimized floorplan Floorplan-Annealing Floorplan move Fi random number < e-Cost/T Undo move Fi N N Y Y Y N Y ICCAD 2003

Architecture-Annealing Architecture moves (mutate binding/allocation) • A1: Share • Merge two resources res1 and res2 to one single resource • res1 und res2 must be instances of the same type (e.g. ripple adder) • A2: Split • Inverse of Share • Split a single resource into two resources • A3: Swap • Swap the inputs of commutative operations A1-A3 in combination are able to create every possible binding solution ICCAD 2003

Architecture-Annealing Changes in the architecture are mended immediately in the actual floorplan. • Moves A1-A3 may cause resources to vanish or appear. • Moves A1-A3 changes the netlist topology. The floorplan is not optimal anymore after a move A1-A3. Solution: • Each move A1-A3 is followed by a floorplan update (annealing). • Supporting softmacros. • Optimal inserting of new resources into floorplan. ICCAD 2003

new costs < old costs Stopping criteria met? Optimized architecture/floorplan solution Algorithm Initial architecture/floorplan solution Each architecture move is followed by a floorplan-annealing Architecture move Aj floorplan- annealing random number < e-Cost/T • Undo move Aj • Undo floorplan update N N Y Y N Y ICCAD 2003

new 1 1 3 3 2 new 4 1 2 4 hardmacros 8.2 LE 3 1 2 1 4 3 3 best position 2 new 4 2 new 4 4.6 LU softmacros Inserting new resources ICCAD 2003

Evaluation 3 experiments are performed: • Full parallelEach operation is mapped on one single resource (no resource sharing). • ConsecutiveBinding optimization and floorplan optimization are executed consecutively. This interconnect unaware optimization is similarly to the traditionally procedure in high-level synthesis. • SimultaneousBinding and floorplanning is optimized simultaneously. ICCAD 2003

Results ICCAD 2003

Conclusion • Compared to consecutively (traditionally) procedure the proposed technique • reduces interconnect power almost by half • increases the functional unit power insignificantly • The CPU times vary from 6 seconds (diffeq) to 138 seconds (turbo_decoder).[1,0 GHz Athlon™ based PC with 256 MB memory] ICCAD 2003

Thank You! ICCAD 2003

Capacitance model Capacitance extracted from layout vs. capacitance from our model (0,35 m) ICCAD 2003

ITRS Roadmap 2001International Technology Roadmap for Semiconductors ICCAD 2003

Cross Section on Die ICCAD 2003

Binding, Allocation and Floorplanning in Low Power High-Level Synthesis