1 / 13

Key Activities. MND sections

Key Activities. MND sections. Group meeting 13.08.08. Overview. Yesterday section meeting showed substantial progress in many directions: Central repository of monitoring metrics for LHC experiments (Elisa and Dashboard team, intersection development  )

mnutting
Download Presentation

Key Activities. MND sections

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Key Activities. MND sections Group meeting 13.08.08

  2. Overview • Yesterday section meeting showed substantial progress in many directions: • Central repository of monitoring metrics for LHC experiments (Elisa and Dashboard team, intersection development  ) • GridMap view for site administrators (Elisa, Max, Dashboard team) • Testing of LB notification system (Olga Kodolova and LB Team) • Adapting CMS SAM availability applications for other LHC VOs (William) • Development of the Grid Messaging System and support of pilot users and applications (Daniel) • New features of ATLAS DDM and ProdSys monitoring. (Ricardo, Benjamin, Lu, Markus) • Creating of ATLAS Grid Information System (AGIS) and integration of Dashboard applications with it (Raquel, Ricardo, Benjamin) • A lot of improvements to the CMS Site Status Board (Pablo) • Improvements of the CMS Interactive Job Monitoring application, following the requests of the CMS community (Anastasia, Julia) • Improvements to the CMS schema and queries, fixing problems causing high DB load during CCRC08 (Irina, Julia) • Use of GMS for propagating of job status reports from the worker node to the Dashboard DB (Bernd) - Improvements for Data Transfer Reliability for ALICE experiments (Spyridon)

  3. Summer students • We have four summer students in the MND section this year: Lu feng, Bernd Schodel, Markus Huber, Spyridon Koulozis • Very good contribution to the sections work! • See details about student presentation at yesterday section meeting: http://indico.cern.ch/conferenceDisplay.py?confId=39049

  4. Central repository for common metrics and site view • Objective of this activity • ● Provide a comprehensive monitoring tool which, from one unique console, • gives an overview of the overall status of the services in their site. This • should be a tool easy to use, also for persons external to the VO, and which • does not require a particular knowledge of each experiment. • ● This tool will extract information from the experiment specific tools • (Dashboard, MonALISA, Dirac, Phedex) and will display it in a consistent • way. • ● It will display the information using the gridmap technology and will provide links to the source of the information • ● An additional requirement from sites is to have a clear definition of the • targets from the experiments: this information should be displayed by this • tool (but it has to be clearly defined and provided by the experiments)

  5. Testing of LB notification • We are first pilot users • In case of success LB notification should replace RGMA and IC XML sources for job status changes information for jobs submitted via LCG WMS.

  6. Adapting of CMS SAM availability application for other VOs • UI to follow the results of SAM tests is already in production for ATLAS • Is coming for LHCb and ALICE • Creating the collectors for topology info for LHC experiments to be introduced to SAM DB (William for ATLAS, Julia for CMS)

  7. ATLAS Data Management • Integration with AGIS • Collection of OSG scheduled downtimes • Notification messages for cloud activity / performance • Delivery using the dashboard messaging API • Part of maintenance now given to ATLAS shifters and experts • Activities for dataset subscriptions (in development) • Delivery by the end of August • New mandatory ‘activity’ field when issuing subscriptions • Transition to a single instance of the DDM dashboard – new filters by activity instead of multiple instances • New statistics in the main display (Lu Feng)

  8. ATLAS Production System • Integration with AGIS • RSS feed exposing problematic sites / tasks • Cloud / site admin display (Lu Feng) • Grouping error messages by similarity • Better than proddb error codes which are not always useful • Improved shifter’s display • Reporting of pilot job details via python API (issuing HTTP requests)

  9. AGIS • ATLAS Grid Information System • Built using the dashboard framework • Collection of site information • From GOCDB for EGEE and OIM for OSG • Collection of service information • From the LCG and OSG BDIIs • Collection of ATLAS cloud and tier information • From the TiersOfATLAS and Panda database • Stopped after a transition period when update of info directly in AGIS is fully tested • Querying available via a python API, command line tools, direct HTTP with multiple output formats (including ToA) • Updating via secure HTTP calls • Prototype: http://agis.cern.ch/dashboard/request.py/agis

  10. ATLAS Tier0 Dashboard • Developed in collaboration with the ATLAS T0 team (Guido Negri) • T0 data exposed via a dashboard application • Prototype display built using HTML canvas elements

  11. Grid Messaging System

  12. CMS Site Status Board

More Related