1 / 14

Architecture of Systola 1024

M. Kunde H.W. Lang M. Schimmler H. Schmeck H. Schröder. Special features of the ISA: fast local communication aggregate functions with constant period fast integer arithmetic. RAM NORTH. RAM WEST. Controller. program memory. host computer bus. ISA. Interface processors.

theodoree
Download Presentation

Architecture of Systola 1024

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. M. Kunde H.W. Lang M. Schimmler H. Schmeck H. Schröder • Special features of the ISA: • fast local communication • aggregate functions with • constant period • fast integer arithmetic RAM NORTH RAM WEST Controller program memory host computer bus ISA Interface processors Architecture of Systola 1024

  2. 1 M bit 1 M bit memory control PIPS architecture • Torus 32x32 • off-the-shelf SRAM as distributed shared memory with prefetch

  3. reconfigurable mesh = mesh + interior connections low cost diameter 1 !! 15 positions

  4. CAN shadow-processors majority voting Fault tolerance through reconfiguration

  5. majority voting Mission-critical tasks can be taken over by any processor PS(k)= probability of survival under the assumption that k processors fail. For the “shadow system” with 16 = 8x2 processors: PS(1)=1, PS(2)=0.93, PS(4)=0.6, PS(6)=0.22, PS(8)=0.02

  6. instrument processor “atomic fault pattern” A simple solution with high fault tolerance (torus) Every fault pattern, that does not contain a 2x2 array of faulty PEs survives. PS(7)=0.7

  7. on the “other side” processor task Torus: 16 task 32 processors PS(5)=1 How about 32 tasks and 16 processors ??

  8. majority voting 6-neighborhood torus PS(5)=1

  9. Distributed shared memory ? Migration of contents ? Memory/voting/error-correction Memory/voting/error-correction Memory/voting/error-correction Memory/controller/error-correction

  10. Assumption: The processors are identical and have at least twice the required capacity. Other devices do not fail. N I W P E S 4x4 torus = 4D hypercube

  11. If less then 9 processors fail, no processor needs to take more than 2 tasks. The processors time-share between the tasks. PS(10)=1 ?

  12. N I 3 3 W 2 2 P E 1 3 3 S 2 2 3 3 PS(7)=1 ?? 1-processor/1-task ?? spares - torus ?? control of switches ??

  13. PS N I 1 W P E S 8 16 # faults Number of switches: 2, 4, 6, 30 Wire area: 0, 2.8, 3.8, 4

  14. Load balancing dynamic allocation of tasks

More Related