190 likes | 315 Views
GRID Applications for the LHC ALICE Experiment. Máté Ferenc Nagy Eötvös University Applied-Physics student Supervisor : Gergely Barnaföldi MTA KFKI RMKI Hungarian ALICE Group. Overview. Part I.: The CERN GRID. The GRID GRID implementation : the middleware AliEn , the ALICE GRID
E N D
GRID Applicationsforthe LHC ALICE Experiment Máté Ferenc Nagy Eötvös University Applied-Physicsstudent Supervisor : Gergely Barnaföldi MTA KFKI RMKI Hungarian ALICE Group
Overview Part I.: The CERN GRID • The GRID • GRID implementation: themiddleware • AliEn, the ALICE GRID • RMKI resources • A job’s life • ROOT and AliROOT Part II.: GP-GPU ApplicationsinPhysics • GP-GPU • RMKI test cluster • Architecture • Summary • Animations / Questions
GRID • What is a GRID? A distributedsystem, agroup of interconnectedcomputersthatshare a commoninterfacetoshareresources. • Whywouldweneedsuch a thing? „Anythingthatcan be calculatedonpaper has alreadybeendone.” Great demandforcomputingpower. Tooexpensivecentralized.
GRID, the CERN way • The solution? Amidstthe LCG-EGEE (LHC ComputingGrid – EnablingGRIDsfor E-Science) program, thegLitemiddleware has beendeveloped. (see 7thslide) • What’sconsideredto be a resource? CPU, HDD
Where is themiddleware? software middleware middleware software middleware software middleware Operatingsystem Operatingsystem Operatingsystem hardware hardware hardware
The role of themiddleware • Itservesasthedistributor of resourcesavailableto a machine. • Advertisestheseresources (free vs. used) • Itschedulesincomingjobsforexecution, reportstheir status. • Itservesthejobswithwhateverpackageorlibrarydependenciestheymighthave.
Middleware – á la ALICE • gLite: The background service. Allhigherlayersgivecommandstothislayer. • AliEn: (AliceEnvironment) a moduleextendingtheregulargLiteservices.(faster, stronger, better, and nontheless: userfriendly) • VOBOX: (VirtualOrganisation)It is a generalpurposeelementinthemiddleware. • - Itisn’tmandatorytoinstall a voboxontoa site. Allexperimentsuseittodifferentends. • -At ALICE, AliEncommandsaregivenfrom here. (eg. Job submission, resultdownloading, file operations, ...)
Storage – á la ALICE • The handling of storagedoesnot go accordingto standard procedure; gLitesupportsmanystoragetypes and topologies, butnotthe ALICE standard xrootd. • Therefore file operationscannot be invokedbytheunderlyinggLitemiddleware, it has to be maintainedbytheVOBOX. (Whereforeseengeneralitycomesinhandy)
Storage and Computingelementsat RMKI • SE: (Storage Element) Twomachineswith 20TB diskspaceeach. • 1 machine: 4X RAID 5+1, all 1TB HDD • CE: (ComputingElement) Vastly Intel Xeonprocessorswiththe standard 2GB RAM/core. • 13 DualCore és 100 QuadCoremachines arebroughttowork. • Theyareusedintimedivisionwiththe CMSexperiment.
A job’s life CERN BDII database xrootd globalredirector JA Job Agent: WHAT werun (containstheexecutable) ON what (containsthenecessary input files) WHERE werun (optionalrestrictionsto site properties) VOBOX gLite VOBOX gLite VOBOX gLite „A” site „B” site „C” site redirector redirector redirector CE SE CE SE CE SE CE SE CE SE CE SE CE SE CE SE CE SE
ROOT and AliROOT • ROOT: - C++ basedframework. - Meantfor 3 dimensional designing withcapablemathematicalsupport. • AliROOT: - Every version containstheup-to-dateschematics of the ALICE detector. - The physicalbehavior of materialsaresimulatedbyGeant.
PART II. GP-GPU applicationsinthe CERN GRID
GP-GPU • Expansion: General Purpose-GraphicsProcessing Unit • Advantages: - 10-100X speedincrease is achieveable -Better performance vscost, wattageratios. • Backdraws: • - Longerdevelopmenttime. • - Notallproblemscan be paralellized. • Recentevents: - OpenCL (Open ComputingLanguage) - DirectX 11 (ComputeShader 5.0)
New CardsUptheSleeve • Both major GPU manufacturersintroducedtheirnewgenerationgraphicscardsthatdelivermassivecomputingpowerforparalellizedalgorithms.
Fermi (GT300) Architecture • The C2070 Tesla cardswillfeature 512 CUDA cores, eachabletorun a singlethread of execution. • Peakcomputationalpower of 1600 GFLOPS SP, 650 GFLOPS DP. 1 FLOPS = 1 FLoating-pointOperation Per Second High-end Intel Xeonprocessorspossess ~ 130 GFLOPS SP/DP computingpower.
Cypress (RV870) Architecture • The AMD ATiRadeon 5870 features 1600 shaderprocessors, eachabletorun a singlethread of execution. • Peakcomputationalpower of 2700 GFLOPS SP, 500 GFLOPS DP.
RMKI GPU test cluster • Onemachinewith 3 GTX295 (dualcore) GPUs. • Acquisition of 2 newhigh-enddesktopconfigsinrackmounts: • Intel Core-i7 quad-core 2.66 GHzprocessors, 12GB RAM, 3 high-speed (x16 PCI-E) slotsforGPUs. • Onemachinewil be equippedwith AMD ATiRadeon 5970 (dualcore) GPUsforOpenCLcomputinginthenearfuture. Secondwillcontain Tesla C2070 cardsfor CUDA/OpenCL.
Summary • The RMKI GRID has beenfunctioningsince 2002 • ALICE computationsrunningsince 2006, fromthisyearatevenlargerscales. • The xrootdbased ALICE dedicatedstorage has beenworkingsincethisyear Q1. • Fullyaccomplishingthedirectives of a T2 site. • Atthe end of Scientific Linux 5 migration, onestepfromdatataking (thxto Szabolcs Hernáth). • Scientificactivity: TDK work has juststarted.
Animations • NVIDIA CUDA Fluid Dynamics, Navier-Stokesequationsolveronrealisticsizegrid. • NVIDIA CS 5.0 BruteForceN-BodySimulation • NVIDIA CS 5.0 OceanRenderingusing FFT calculationson a highresolutiongrid. • NVIDIA OpenCL O(N*logN) N-Bodysimultaionusingcut-offdistanceoptimization. • AMD Havokclothingsimulation, simple Newton equationsolverwithhighnumberinteractions.