1 / 15

Playstation3 cluster experience

Submitted to: J. Lee By DharaKumari Patel. Playstation3 cluster experience. Introduction Hardware Software Communication Algorithms Applications Benchmarks Conclusion. INDEX. Multi-core approach was deemed the next biggest change in processor technology. Challenges to overcome:

hila
Download Presentation

Playstation3 cluster experience

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Submitted to: J. Lee By DharaKumari Patel Playstation3 clusterexperience

  2. Introduction Hardware Software Communication Algorithms Applications Benchmarks Conclusion INDEX

  3. Multi-core approach was deemed the next biggest change in processor technology. Challenges to overcome: – Memory Wall 1. Streaming DMA architecture 2. 3-level Memory Model: Main Storage, Local Storage, Register Files – Power Wall 1. High frequency at a low operating voltage with advanced power management – Efficiency Wall 1. Highly optimized implementation 2. Large shared register files, SIMD and deeper pipelines INTRODUCTION

  4. PS3 Cluster – 8 PS3s in a Private VLAN (10.0.0.X) – GigE between the Nodes – DHCP Server on the Front Node IP Masquerading, hosts name setup, etc. – Accessible Only through Front Node HARDWARE

  5. Installed on PS3: – Fedora Core 5 1. Linux Kernel v2.6.16: No built-in Cell BE Support. 2. Recommended by IBM for SDK 2.0 3. Minimum Installation with Extra Packages – SDK 2.0 Installed for PS3 cluster: – MPICH2 (MPI 2.0 standard) 1. Compatibility Issues with PowerPC Architecture. SOFTWARE

  6. SDK & Kernel Recompilation – Two Different Compilers for PPE and SPE – SDK Includes Compilers and Debuggers, SIMD Libraries, Full System Simulator, etc. – Kernel Needs to be Recompiled for Huge TLB Pages 1. Improves Performance 2. Lower Translations from Virtual Addresses to Physical Addresses SOFTWARE

  7. Element Interconnect Bus (EIB) – Heart of cell’s intra-chip communication – Runs at half of processor bandwidth with peak performance of 204.8 Gb/s. – 4 rings, 2 clockwise and 2 counter-clockwise with 16 byte-wide rings. COMMUNICATION

  8. ALGORITHMS • Basic algorthim to split data into several SPEs: • for (i=0; i<num_spes; i++) { • offset = size - ((num_spes - i)*(size/num_spes)); • for (k=0; k < SPU_SPLIT_NUM; k++) { • spu1_data[k] = data[k+offset]; • } • } • Eg:- 1024, SPEs = 4, offset: 0, 256, 512, 768 • Basic algorithm to “stitch” calculated data from several SPEs: • for (i=0; i<num_spes; i++) { • offset = size - ((num_spes - i)*(size/num_spes)); • for (k=0; k < SPU_SPLIT_NUM; k++) { • result[k+offset] = spu1_result[k]; • } • }

  9. Cluster (Optimized): (Variable Nodes and Matrix Sizes) - Normal (Just PPE) - Addition of Two Arrays - Multiplication of Two Arrays - Copy of Two Arrays - Triad (Add, Multiply) + Copy PS3 Node Scaling Comparison to a Desktop processor BENCHMARKS

  10. Multiplication – Cluster

  11. Triad – Cluster

  12. COPY - CLUSTER

  13. Opteron 275 vs PS3 Cluster

  14. Cluster setup is similar to setting up any other generic clusters Cell cluster can provide a good scalable performance Very steep learning curve for programming the cell Have to manipulate algorithms and incoming data to take full advantage of Cell CONCLUSION

  15. http://arstechnica.com/old/content/2006/04/6600.ars http://www.playstation.com/ps3-openplatform/index.html http://en.wikipedia.org/wiki/PlayStation_3 http://en.wikipedia.org/wiki/Linux_on_the_PlayStation_3 References

More Related