1 / 8

Platform Disaggregation

Platform Disaggregation. Lightening talk Openlab Major review 16 th Octobre 2014. Background: the CERN * aaS catalog. Indico. Twiki. CATIA. Puppet. Drupal. …. Foreman. Ansys. CDS. JIRA. Interactive. OpenStack. LSF. Ceph. CVMFS. LQCD. Kibana. s/w build. AFS. CASTOR. BOINC.

cole-obrien
Download Presentation

Platform Disaggregation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Platform Disaggregation Lightening talk Openlab Major review 16thOctobre 2014

  2. Background: the CERN *aaScatalog Indico Twiki CATIA Puppet Drupal … Foreman Ansys CDS JIRA Interactive OpenStack LSF Ceph CVMFS LQCD Kibana s/w build AFS CASTOR BOINC Git … EOS Hadoop Oracle Exchange … TS Grid Services Netapp … AD DFS Lync ElasticSearch TSM CERNBOX Platform disaggregation- 2

  3. Background: CERN IT assets Platform disaggregation- 3

  4. Background: Platform customizations • Platforms • Base node (Cloud, batch worker, web services, …) • 2x CPU (e.g. Intel E5-26xx v2,3) • 64GB RAM • 2x 2TB HDD • On-board 1GbE + dedicated IPMI • Disk storage (JBOD) front-end • Base node + SAS HBA (LSI-9207-8e) + 10GbE (SFP+) • Tape server • Base node + 10GbE + 8Gbs FC • TSM front-end • Base node + SAS HBA + 10GbE + 8Gbs FC (tape) • Oracle DB server • Base node + 64GB RAM + 2x dual port 10GbE (SFP+) + RHEL certification • HPC • Base node + 64GB (or 192GB) + low-latency 10GbE • Windows server • Base node + RAID (intor int/ext) • “Fat” cloud node • Base node + 64GB RAM + 10GbE (SFP+ or RJ45) + SSD • … • Challenge: achieve all those customizations starting from monolithic base platforms Platform disaggregation- 4

  5. Could Open Compute help? http://www.opencompute.org/ • Open Compute Project (OCP) is an interesting new direction with • Potential far-reaching impact for industry and data centres • A constantly growing provider community and private customer space • Encouraging results from our small-scale tests with two twin systems • Sufficiently interesting to motivate launching a project for larger deployment • Platform is still monolithic • Except rack level power distribution Open Compute at CERN

  6. Breaking Up the Monolith? Intriguing paragraph from an Open Compute announcement (OCP summit January 2013): (http://www.opencompute.org/blog/ocp-summit-iv-breaking-up-the-monolith/ ) … “But most exciting of all are a series of new developments that will enable us to take some big steps forward toward better utilization of these technologies. One of the challenges we face as an industry is that much of the hardware we build and consume is highly monolithic -- our processors are inextricably linked to our motherboards, which are in turn linked to specific networking technology, and so on. This leads to poorly configured systems that can't keep up with rapidly evolving software and waste lots of energy and material. To fix this, we need to break up some of these monolithic designs -- to disaggregate some of the components of these technologies from each other so we can build systems that truly fit the workloads they run and whose components can be replaced or updated independently of each other. Several members of the Open Compute Project have come together today to take the first steps toward this kind of disaggregation:” … (*) (*) More technical details in “Design guide for photonic architecture” contributed by Intel to Open Compute project in 2013: http://www.opencompute.org/assets/Uploads/Open_Compute_Project_Open_Rack_Optical_Interconnect_Design_Guide_v0.5.pdf Platform disaggregation- 6

  7. Openlab-V scope Disaggregation for enabling provisioning of customized hardware platforms? Vision: Software Defined Platform • Flexible provisioning / sustainment • Commissioning • Repurpose • Connectivity and routing domains • Flexible component lifecycle management • Trays of network components (NICs, switches, fabrics, …) • Trays of storage (HDD, SSD, NVMe, …) • Trays of memory (RAM, NVRAM, …) • Trays of processors sockets (Xeon, Xeon Phi, Atom, ARM, …) • Scalable, Open and Affordable • Scalable performance and manageability • Open competitive manufacturer / supplier ecosystem • Affordable also underneath the hyper-scale pedestal Long-term Platform disaggregation- 7

  8. Objectives and timescale • Phase 1, ~PM12:Disaggregated ToRModel • OCP rack enhanced with a disaggregated Top of the Rack switch • Phase 2, ~PM24: Disaggregated storage • Prototype enhanced with SSD storage sleds are added to the switch fabric • Phase 3, ~PM36 or beyond: Disaggregated system memory • Add second level memory hierarchy on NVRAM sleds? • Would other research organisations want to participate? Platform disaggregation- 8

More Related