1 / 19

Development of the distributed monitoring system for the NICA cluster

Development of the distributed monitoring system for the NICA cluster. Ivan Slepov (LHEP, JINR). Mathematical Modeling and Computational Physics Dubna , Russia, July 8, 2013. The MultiPurpose Detector – MPD to study Heavy Ion Collisions at NICA.

sorcha
Download Presentation

Development of the distributed monitoring system for the NICA cluster

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Development of the distributed monitoring system for the NICA cluster Ivan Slepov (LHEP, JINR) Mathematical Modeling and Computational Physics Dubna, Russia, July 8, 2013

  2. The MultiPurpose Detector – MPDto study Heavy Ion Collisions at NICA

  3. Software for MultiPurposeDetector ROOT + FairRoot (FairBase + FairSoft software packages) = Detectors simulation MpdRootFramework components: Data reconstruction Event analysis

  4. Software for MultiPurposeDetector ROOT + FairRoot (FairBase + FairSoft software packages) = Detectors simulation MpdRootFramework components: Data reconstruction Event analysis

  5. Software for MultiPurposeDetector ROOT + FairRoot (FairBase + FairSoft software packages) = Detectors simulation MpdRootFramework components: Data reconstruction Event analysis

  6. Software for MultiPurposeDetector ROOT + FairRoot (FairBase + FairSoft software packages) = Detectors simulation MpdRootFramework components: Data reconstruction Event analysis

  7. Computing resources for MPDdata processing CPU:128 XEON cores GPU: ~1500 TESLA cores

  8. Computing resources for MPDdata processing CPU:128 XEON cores => in future ~10000XEON cores GPU: ~1500 TESLA cores

  9. Motivation to develop monitoring system MPD users need more information about all own cluster nodes and public computers! • Computing resources information (free space, memory, cpu, etc) • System load (load average, processes) • MPD software information (FairSoft version) • Cluster software information (SGE, xrootd, proof) • User tasks monitoring (batch processing and interactive jobs)

  10. Monitoring system schemes Scheme 1 – for collect general information DSH Software BASH Scripts Cron run job MySQL DB WEB Interface MySQL DB PHP Scripts

  11. Monitoring system schemes Scheme 1 – for collect general information DSH Software BASH Scripts Cron run job MySQL DB WEB Interface MySQL DB PHP Scripts WEB Interface PHP Scripts DSH Software BASH Scripts MySQL DB Scheme 2 – for collect information about user tasks and provide data management

  12. Web-interface for Monitoring system MPD software information Computing resources information System load User tasks monitoring

  13. Monitoring system web-interfaceUser tasks

  14. Monitoring system web-interfaceInteractive nodes

  15. Access to the monitoring system on websitempd.jinr.ru

  16. Thank you for your attention!

  17. Motivation to develop system monitoring MPD users need more information about all own cluster nodes and public computers! Why? If, for example, the concept of grid uses a layer of abstraction from the resources. Because MPD software now still under development and needs testing and debugging.

More Related