1 / 24

A Diagonal-Interconnect Architecture and Its Application to RISC Core Design

A Diagonal-Interconnect Architecture and Its Application to RISC Core Design. Mutsunori Igarashi, Takashi Mitsuhashi, Andy Le, Shardul Kazi, Yang-Trung Lin, Aki Fujimura, Steve Teig ISSCC 2002 Additional material from http://www.xinitiative.org/. 141< 100+ 100 ~30% less wirelength.

mei
Download Presentation

A Diagonal-Interconnect Architecture and Its Application to RISC Core Design

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Diagonal-Interconnect Architecture and Its Application to RISC Core Design Mutsunori Igarashi, Takashi Mitsuhashi, Andy Le, Shardul Kazi, Yang-Trung Lin, Aki Fujimura, Steve Teig ISSCC 2002 Additional material from http://www.xinitiative.org/

  2. 141< 100+ 100 ~30% less wirelength 141.4214 100 100

  3. benefits • Circuit efficiency • Wirelength reduction = 20% • Via-count reduction= 30% • Core-area reduction = 10% • Wirelength reduction makes the routing problem easier to solve, resulting in faster timing closure, improved reliability and a reduction in signal integrity problems.

  4. W:21% V:39% W:13% V:28% W:22% V:32% W:25% V:37% Register-to-register path delays RISC CORE DESIGN Technology = 0.18um metal layers = 6 750Kgates + SRAM Target Frequency = 200 MHz • Critical path delay improvement of 0.99ns • 19.8% faster operating frequencies

  5. X routing in commercial products • June 2004 – Toshiba Corp. X architecture diagonal routing for metal layers 4 and 5. • TC90400XBG : 130-nanometer, 2.7-million-gate design for the digital video broadcast and home entertainment markets. • "The X design was timing out at 11 percent faster — about 180 MHz — than the conventional design, and the logic area was 10 percent smaller.” • Takashi Yoshimori. Toshiba technology executive for SoC design Yield = 80% (for first wafer)

  6. hurdles • Yield questions • Mask generation and cost • Design and verification tools

  7. Simple Routing Example R R D Ito et al “Diagonal routing in high performance microprocessor design,”ASP-DAC 24-27 Jan. 2006

  8. Interested in X-routing algorithms? • Das et al“Manhattan-diagonal routing in channels and switchboxes,”ACM Trans. Des. Autom. Electron. Syst. 9, 1 (Jan. 2004), 75-104 • Ito et al “Diagonal routing in high performance microprocessor design,”ASP-DAC 24-27 Jan. 2006

  9. Track Assignment: A Desirable Intermediate Step Between Global Routing and Detailed Routing Shabbir Batterywala, Narendra Shenoy, William Nicholls and Hai Zhou Presenter: Yiwen Shi

  10. Background & Motivation • Given the entire subregions by the global router, searching in detailed routing stage and rip-up & re-route stage are very time consuming • Simple two stages of global routing & detailed routing cannot appropriately address problems arising from signal delay, cross-talk and process constraints • Integrate an intermediate stage of track assignment between global and detailed routing! Benefit: • Fully utilizes the information generated by the global router • The efficient usage of routing resources since most of the routes are laid in straight lines • Track assignment introduces a stage where nets are routed in parallel • The neighborhood information for large nets can be easily calculated  optimize DFM issues

  11. Track Assignment • Assign long global routes (routes running over at least one whole global cell), iroutes, to underlying routing resources • The routing resources are present in gridlines (tracks) and vias

  12. Cost Metrics • Plane: put longer iroutes on higher layers • Track obstruction: capture assignability of an iroute to a track • Via obstruction: prevent iroutes from being assigned to tracks that will be hard to connect • Planar anchoring: minimize jogs & wire length – assign iroute as close as possible to other net components and assignd iroutes of its underlying net • Via anchoring: minimize vias – similar to planar anchoring, in z-direction • Other consideration: DFM issues & interplay across these cost components (weights)

  13. Select a set of mutually conflicting iroutes (largest clique) and use a minimum weighted bipartite matching (shortest augmenting path) to assign them to different tracks Look-Ahead: If any iroute becomes uniquely assignable to a track, we accept this assignment immediately Weighted Bipartite Matching & Look-Ahead Heuristic Iroute overlap & Bipartite assignment & Combination Without Look-Ahead With Look-Ahead NP complete problem

  14. Top level track assignment algorithm

  15. Experimental Results & Conclusions • The suggested track assignment stage reduces the run-time of overall routing process Future work: • Incorporate other costs which address crosstalk, timing, congestion, antennae effect and other DFM problems • Calculate gridlines efficiently in an environment where wirewidths could differ on the same layer

  16. BoxRouter: A New Global Router Based on Box Expansion and Progressive ILPMinsik Cho and David Z. PanECE Dept. Univ. of Texas at AustinDAC 2006, July 24-28

  17. Introduction • Global Routing – plans approximate route of each net to reduce complexity of detailed router • Goal: Optimize wire density during global routing • Improve manufacturability • Potential to feedback interconnect information

  18. Steps • PreRouting captures congested areas • BoxRouting starts in most congested area and expands box to cover entire chip • Progressive integer linear programming (ILP) technique to route wires in box • Maze routing algorithm for rest of wires • PostRouting reroutes wires without rip-up • Parameter controls trade-off between length and routability

  19. BoxRouter • Route as many wires inside box as possible with ILP • Maze routing algorithm if ILP fails

  20. Algorithm

  21. Steps 2) Box around congested area 3) ILP routing between G-cells 1) PreRouting – Identify congested area 6) Repeat 4) Maze routing 5) Expand box

  22. PostRouting • Start from congested area • Reroute wires to reduce length (if possible) • Reroute surrounding wires • Repeat • Parameter controls cost function • Wirelength vs. routability

  23. Experimental Results • Larger box expansion can improve results at a cost of runtime • Compared to Labyrinth • Reduce wirelength by 14.3% • Reduce overflow by 91.7% • Compared to Fengshui • Reduce overflow by 79% • Compared to multicommodity flow-based router • 15.7x faster • 4.2% shorter wirelength

More Related