1 / 15

An Operand Routing Network for an SFQ Reconfigurable Data-Paths Processor

An Operand Routing Network for an SFQ Reconfigurable Data-Paths Processor. I. Kataeva 1 , H. Akaike 1 , A. Fujimaki 1 , N. Yoshikawa 2 , N. Takagi 3 , K. Murakami 4 1 Dept. of Quantum Engineering, Nagoya University 2 Dept. of Electrical and Computer Engineering, Yokohama National University

bill
Download Presentation

An Operand Routing Network for an SFQ Reconfigurable Data-Paths Processor

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. An Operand Routing Network for an SFQ Reconfigurable Data-Paths Processor I. Kataeva1, H. Akaike1, A. Fujimaki1, N. Yoshikawa2, N. Takagi3, K. Murakami4 1Dept. of Quantum Engineering, Nagoya University 2Dept. of Electrical and Computer Engineering, Yokohama National University 3Dept. of Information Engineering, Nagoya University 4Research Institute for Information Technology, Kyushu University

  2. Outline • Architectures of the Operand Routing Network • NDRO-based ORN • Crossbar-based ORN • A crossbar switch with a multicasting function • Experimental results • Crossbar switches • A 1-to-2 ORN prototype • Conclusion

  3. ORN ... FPU ORN : : : : ... ORN ... ORN SB : : : ... : : LM SMAC Reconfigurable Data-path processoran architecture suitable for SFQ implementation • two-dimensional array of Floating Point Units • connected using Operand Routing network • ORN is reconfigured to fit a DFG

  4. Requirements for the Operand Routing Network • each FPU can be connected to one or more FPUs in the next row • connections are established between the FPUs in the immediate vicinity of each other • number of the connections, N, is odd • one FPU’s output can be connected to either or both inputs of an FPU in the next stage FPU FPU FPU FPU FPU FPU FPU FPU FPU FPU

  5. NDRO-based Operand Routing Network FPU FPU FPU FPU NDRO FPU FPU FPU FPU NDRO NDRO FPU FPU FPU FPU NDRO NDRO FPU FPU FPU FPU NDRO FPU FPU FPU FPU • “–”: • irregular non-pipelined structure • with the increase of the complexity becomes cumbersome • “+”: • 2×M×N NDRO cells • small number of Josephson junctions

  6. Crossbar-based Operand Routing Network CB FPU FPU FPU ½CB FPU CB CB FPU FPU FPU FPU CB ½CB CB FPU FPU CB FPU ½CB FPU CB FPU FPU FPU CB FPU ½CB CB FPU FPU CB FPU ½CB FPU CB • “+”: • scalable • pipelined • easily re-designed for any number of N and M • “–”: • large number of Josephson junctions • M - ½ CB and (2×M+1)(N-1)/2 - CB

  7. Comparison of the ORN architectures NDRO-based ORN Number of JJs of NDRO-based ORN in a table is an estimation based on a design of the switch for RDP prototype (N=3, M=4) that consisted of 3420 JJs (Iwasaki, not published yet) Crossbar-based ORN A crossbar switch with broadcasting function: 296 JJs

  8. Crossbar switch with a multicasting function, ver.1 00 01 00 AND 10 11 DFF DFF din0 DFF 01 AND dout0 00 AND • 788 JJs • area – 1.28mm×1mm • bias current – 87 mA • clock frequency – 27 GHz • 4 pipeline stages NOT 10 AND cross0 NDRO bar0 01 AND 11 AND DFF clkout clkin 00 AND 10 AND NOT cross1 NDRO bar1 01 AND 11 AND DFF dout1 10 AND din1 DFF DFF DFF 11 AND

  9. Crossbar switch with a multicasting function, ver.2 din0 din0 DFF DFF AND dout0 cross0 bar0 dout0 NDROC 02 AND NDRO AND bar0 cross0 clkout clkin clkout clkin AND cross1 cross1 dout1 AND NDROC 02 NDRO bar1 bar1 dout1 AND din1 DFF • 141 JJs • 14 mA • 70% reduction of the number of JJs • 2 pipeline stages instead of 4 • 296 JJs • 30 mA • 62% reduction of the number of JJs • 2 pipeline stages instead of 4 10 11 00 01 10 11 00 01

  10. Experimental results data_out clkin_hf clkin_lfout circuit under test ladder clkin_lfin • Circuits tested: • crossbar switch with a multicasting function, ver.1 • ½ crossbar switch with a multicasting function , ver.1 • 1-to-2 ORN prototype data_in

  11. Cross-bar switch with a multicasting function, ver.1 00 01 • 788 JJs • area – 1.28mm×1mm • bias current – 87 mA • clock frequency – 27 GHz Total: • 1484 JJs • bias current – 172 mA clkout1 dout1 dout0 clkout clkin_lfout clkin_hf clkin_lfin bar1 cross0 bar0 din1 din0 10 11 clkout1 dout1 dout0 clkout clkin_lfout clkin_lfin cross1 bar1 bar0 din1 din0

  12. ½ cross-bar switch with a multicasting function, ver.1 00 01 • 455 JJs • area – 1mm×0.52mm • bias current – 50 mA • clock frequency – 27 GHz Total: • 1066 JJs • bias current – 127 mA clkout1 dout1 dout0 clkout clkin_lfout clkin_lfin bar1 cross0 bar0 din0 10 clkout1 dout1 dout0 clkout clkin_lfout Tested at the frequencies: 22÷36 GHz clkin_lfin cross1 bar0 din0

  13. 1-to-2 ORN: low frequency test • completely functional, exhaustive test • bias_kern0 = -14.6/5.3 % does not depend on the pattern • bias_kern1 = -16.1/18.3 % for din0 -> dout11, dout12 • bias_kern2 = -20.7/12.6 % for din0 -> dout11, dout12 minimum! • bias_kern1 = -40.3/17.2% for din1 -> dout01 • bias_kern2 = -38/12.6% for din2 -> dout02, dout12 maximum! • Total 2031 JJs • Total bias current 224.4 mA dout12 dout02 dout11 dout01 clkout2 clkout1 clkout bar02 bar12 cross11 cross01 cross10 bar00 clkin_lfin din0 • bias_kern0 = -14.6/5.3 % • bias_kern1 = -23/17.2% • bias_kern2 = -23/12.6% open466, no. 4 chip F2 example: din0 -> dout01, dout02, dout12 control pattern: CBT0 “10”, CBT1 “01”, CBT2 “10”

  14. 1-to-2 ORN: high frequency test and bias margins frequency dependence dout12 dout02 dout11 dout01 clkout2 clkout1 clkout bar11 bar01 bar10 bar00 clkin_lfout1 clkin_hf clkin_lfin din0 • open466, no. 4 chip F2 example: • din0 -> dout11 • control pattern: CBT0 “00”, CBT1 “00” • bias_kern0 margins were frequency independent • measured at the frequencies up to 23.5 GHz

  15. Conclusion • we have proposed two different architectures of an ORN: • NDRO-based • crossbar-based • complexity comparison has been done and crossbar-based ORN is considered a better option due to its scalability and pipelined structure • a new version of the crossbar switch has been designed • two versions of a crossbar with a multicasting function have been designed for 2.5 kA/cm2 process and successfully tested at the frequencies up to 28 GHz • a 1-to-2 ORN has been designed and successfully tested at the frequencies up to 23.5 GHz

More Related