1 / 12

Configuration Management for High-Performance Clusters in Real-Time Computing for ALICE HLT

This research focuses on the configuration management of high-performance computing clusters used in the ALICE High Level Trigger (HLT) system at CERN. The HLT is responsible for filtering a large data stream from 25 GB/s to a manageable 1.2 GB/s by identifying interesting events and regions of interest, utilizing data compression techniques. Optimization of the cluster configuration is crucial for enhancing performance, minimizing node communication and inactivity, and eliminating bottlenecks. The study aims to develop a simulation tool that models various configurations effectively, enhancing data quality for the physics community.

tricia
Download Presentation

Configuration Management for High-Performance Clusters in Real-Time Computing for ALICE HLT

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Configuration Management for High-PerformanceCluster for Real-Time Computing(ALICE HighLevel Trigger) Lars Christian Raae Supervisor: Håvard Helstrup

  2. The ALICEHighLevel Trigger (HLT) • Trigger: A mechanism to determineifthe ”record” buttonshould be pushed. • HLT objective: Cutdatastream from 25GB/s to more manageable 1.2 GB/s by • finding ”interesting” events • selectingevent regions ofinterest • data compression • HLT computingcluster: On-site, COTS machines, about 1 000 CPUs

  3. HLT ConfigurationExample T. Thingnæs. Generering av konfigurasjonsfiler for TaskManager i HLT-systemet for ALICE-eksperimentet på CERN. Master’sthesis, University of Bergen, Norway, 2007.

  4. HLT Architecture M. Richter, Development and Integration of on-line Data Analysis for the ALICE Experiment. PhD thesis, University of Bergen, Norway, 2009. [Online] https://bora.uib.no/bitstream/1956/3555/1/Dr.thesis_Matthias%20Richter.pdf

  5. HLT ConfigurationOptimization • Unique and complex experiment with unpredictable data stream • Initial configuration a ”qualified guess” • Configuration will need optimization for a long time • Increaseperformance by: • Minimizing node communication • Minimizing node inactivity • Eliminatingprocessingbottlenecks • Prerequisite: Test bench

  6. Research Project • Research project: Simulationofthe HLT computingcluster • Cannotexperimentonproductioncluster • Equivalent test clustertoocostly, must usedifferent hardware configuration • Develop software solutionthat lets usmodelpreciselyenough to compareconfigurations • Openquestion: How, exactly, is thisgoing to be done?

  7. Evaluation • What features of a real computingcluster is thesolutionable to model? • Howaccuratelycanthesolutionanswerwhich is betteroftwoclusterconfigurations, given applicable input, and howmuchtheydiffer in performance? • How portable is thesolution? Howmuch manual work is required to test a differentclustersetupthantheone given in the case?

  8. PossibleResults • For HLT: Testbench to performexperiments and developimproved HLT configurations • For physics: Higherqualityexperiment data • For clustercomputingcommunity: Perhaps a newsimulationtool

More Related