1 / 20

Krzysztof Korcyl Institue of Nuclear Physics Polish Academy of Sciences &

Modeling event building architecture for the triggerless data acquisition system for PANDA experiment at the HESR facility at FAIR/GSI. Krzysztof Korcyl Institue of Nuclear Physics Polish Academy of Sciences & Cracow University of Technology. Agenda.

Download Presentation

Krzysztof Korcyl Institue of Nuclear Physics Polish Academy of Sciences &

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Modeling event building architecture for the triggerless data acquisition system for PANDA experiment at the HESR facility at FAIR/GSI Krzysztof Korcyl Institue of NuclearPhysicsPolishAcademy of Sciences & CracowUniversity of Technology

  2. Agenda • PANDA experiment – detectors and requirements for DAQ system • thepush-onlyarchitecture • ComputeNodein ATCA standard • data flowinthearchitecture • shortintroduction on discreteeventmodelling • modelingresults • latency • queuesdynamics • load monitoring • summary

  3. PANDA detectors Experimentat HESR (High Energy Storage Ring) in FAIR (Facility for Antiproton and Ion Research ) complexat GSI, Darmstadt. • Trackingdetectors: • Micro VertexDetector • Central Tracker • Gas ElectronMultiplierStations • ForwardTracker • Particlesidentification: • DIRC (Detection of InternallyReflectedCherenkov) • Time of Flight System • MuonDetection System • Ring ImagingCherenkovDetector • Calorimetry: • Electromagneticcalorimeter

  4. PANDA DAQrequirements • interactionrate: up to 20 MHz (luminosity 2* 1032 cm-2s-1) • typicaleventsize: ~4 kB • expectedthroughput: 80 GB/s (100 GB/s) • richphysics program requires a high flexibilityineventselection • front end electronics workingincontinuoussamplingmode • lack of hardware triggersignal

  5. Thepush-onlyarchitecture SODA Time Distribution System Passivepoin-to-multipointbidirectionalfibernetworkproviding time referencewithprecissionbetterthan 20 ps and performssynchronization of data taking

  6. ATCA crate and backplane ATCA – AdvancedTelecommunicationsComputingArchitecture Backplane: one of possibleconfigurationisfullmesh (connectseachpair of moduleswithdedicatedpoint-to-pointbidirectionalllink

  7. ComputeNode Each board is equipped with 5 Virtex4 FX60 FPGAs. High bandwidth connectivity is provided by 8 Gbitopticallinksconnected to RocketIOports (6.5 Gb/s). In additiontheboardisequippedwithfiveGbit Ethernet links

  8. Inter-cratewiring 416 Module inslot N attheFEElevelconnects, with 2 linkstrunks, to modulesatslots N attheCPUlevel. 416 Theoddeventspacketsatthe FEE levelare first routed via thebackplane and thenoutbound to the CPU level via a propertrunk. Theeveneventspacketsoutbound to the CPU level first and thenusebackplaneto changethe slot.

  9. Inter-crateroutinganimation 205 366

  10. OnboardVirtexconnections Virtex0 – handlesallcommunications to/fromthebackplane Virtex1-4 - manage 2 input and 2 outputportsatthe front panel

  11. Discreteeventmodeling • Model – computer program simulating system dynamicsintime (supportfromSystemClibrary) • Fixed time step • Discreteevents t events t State of the system remainsconstantbetweenevents Processing system ina state maylead to scheduling a neweventinthefuture

  12. Parameterization of ports SendFifo – occupationcangrowifmultiplewritesareallowed OR the link speedissmallerthanthespeed of write OR therecentpacketsaresmallerthantheformerones. ReceiveFifo – occupationcangrowifthequeueheadcan not be transferreddue to destinationbeing busy withanother transfer OR therecentpacketsaresmallerthantheformerones. The transfer speedis a parameter – duringthesimulationsit was set to 6.5 Gb/s (RocketIO)

  13. Models of source and sink of data • Data source – Data Concentrator: • generates data packetwith a sizeproportional to the sum of number of inter-interactionscalculatedfrom Poisson distributionwithaverage of 20 MHz • Burst: 2 µs of interactions + 400 ns of silence gap • Super-burst: 10 bursts • Simulatesthe 8b/10b conversion, tagspacketswithdestination CPU number and pushesintothearchitecture. • Data sink – eventbuilding CPU: • Simulateseventbuilding of 416 fragmentswiththe same tag • size of burst: ~300 kB • size of super-burst: ~3 MB

  14. Eventbuildinglatency Constantlatencyin time indicatesability of thearchitecture to switch not only 100 GB/s but also 173 GB/s

  15. LoaddistributionbetweenCPUs The CPU for nexteventiscalculatedwiththeformula: Nt+1 = mod (Nt + 79) , 416)

  16. Monitoring queues’ evolution Averagedmaximalqueuelengthininputportsatthe FEE level. Theaveraging was overtheportswiththe same indexinall FEE modules.

  17. Monitoring links’ load Load on fiberlinksconnectingoutputportsfromthe FEE levelwithinputportsatthe CPU level. Homogeniousloadindicatesproperroutingscheme – also for trunking.

  18. Monitoring queues Average of maximallength of inputqueuesatthe CPU level. Theaverage was madewithportswiththe same index on allmodules.

  19. Monitoring Virtexlink’soccupation Atthe CPU level, thepacketsheading for odd-numberedCPUs go via thebackplane to the slot withdestination CPU.

  20. Summary • We proposetheeventbuildingarchitecture for thetriggerless DAQ system for the PANDA experiment. • ThearchitectureusesComputeNodemodulesin ATCA standard. • We builtsimplifiedmodels of thecomponentsusingSystemClibrary and run thefullscalesimulations to demonstraterequired performance and to analysedynamics of the system. • Thepush-onlymodeoffers 100 GB/s throughputwhichallows to performburst/super-burstbuilding and to run selectionalgorithms on fullyassembled data. • withtheinputlinksloadedup to 70% of theirnominalcapacitythearchitecturecan handle 173 GB/s

More Related