120 likes | 263 Views
This research focuses on the configuration management of high-performance computing clusters used in the ALICE High Level Trigger (HLT) system at CERN. The HLT is responsible for filtering a large data stream from 25 GB/s to a manageable 1.2 GB/s by identifying interesting events and regions of interest, utilizing data compression techniques. Optimization of the cluster configuration is crucial for enhancing performance, minimizing node communication and inactivity, and eliminating bottlenecks. The study aims to develop a simulation tool that models various configurations effectively, enhancing data quality for the physics community.
E N D
Configuration Management for High-PerformanceCluster for Real-Time Computing(ALICE HighLevel Trigger) Lars Christian Raae Supervisor: Håvard Helstrup
The ALICEHighLevel Trigger (HLT) • Trigger: A mechanism to determineifthe ”record” buttonshould be pushed. • HLT objective: Cutdatastream from 25GB/s to more manageable 1.2 GB/s by • finding ”interesting” events • selectingevent regions ofinterest • data compression • HLT computingcluster: On-site, COTS machines, about 1 000 CPUs
HLT ConfigurationExample T. Thingnæs. Generering av konfigurasjonsfiler for TaskManager i HLT-systemet for ALICE-eksperimentet på CERN. Master’sthesis, University of Bergen, Norway, 2007.
HLT Architecture M. Richter, Development and Integration of on-line Data Analysis for the ALICE Experiment. PhD thesis, University of Bergen, Norway, 2009. [Online] https://bora.uib.no/bitstream/1956/3555/1/Dr.thesis_Matthias%20Richter.pdf
HLT ConfigurationOptimization • Unique and complex experiment with unpredictable data stream • Initial configuration a ”qualified guess” • Configuration will need optimization for a long time • Increaseperformance by: • Minimizing node communication • Minimizing node inactivity • Eliminatingprocessingbottlenecks • Prerequisite: Test bench
Research Project • Research project: Simulationofthe HLT computingcluster • Cannotexperimentonproductioncluster • Equivalent test clustertoocostly, must usedifferent hardware configuration • Develop software solutionthat lets usmodelpreciselyenough to compareconfigurations • Openquestion: How, exactly, is thisgoing to be done?
Evaluation • What features of a real computingcluster is thesolutionable to model? • Howaccuratelycanthesolutionanswerwhich is betteroftwoclusterconfigurations, given applicable input, and howmuchtheydiffer in performance? • How portable is thesolution? Howmuch manual work is required to test a differentclustersetupthantheone given in the case?
PossibleResults • For HLT: Testbench to performexperiments and developimproved HLT configurations • For physics: Higherqualityexperiment data • For clustercomputingcommunity: Perhaps a newsimulationtool