1 / 8

MUPJ - gLExec update

MUPJ - gLExec update. MB, 2013-05-14 Maarten Litmaath CERN. Nagios tests for “ops”. Each NGI/ROC needs to submit its own tests Register at least 1 DN with pilot role in “ops” VO Configure SAM-Nagios to submit the tests Thanks to EGI essentially done

thanos
Download Presentation

MUPJ - gLExec update

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. MUPJ - gLExec update MB, 2013-05-14 Maarten Litmaath CERN

  2. Nagios tests for “ops” • Each NGI/ROC needs to submit its own tests • Register at least 1 DN with pilot role in “ops” VO • Configure SAM-Nagios to submit the tests • Thanks to EGI essentially done • Each EGI site supporting glexec needs to update the GOCDB • A “gLExec” flag for each CE supporting glexec on the WN • A lot of sites and CEs still to be done • EGI added alarm to operations profile in Dec (not yet A/R) • Automatic tickets for sites whose “gLExec” tests fail • Results are available in MyWLCG • http://cern.ch/go/fgz8 • Currently only 117 CEs present, including ~all T1 with CREAM • Was 104 on Jan 15 Maarten Litmaath (CERN)

  3. gLExec deployment campaign • A few more sites claim glexec support in the BDII • 88 = 64 EGI + 24 OSG on May 13 • Was 82 = 59 + 23 on Jan 14 • As usual some sites now appear, others disappeared … • Downtime? • Reconfiguration? More likely! • Some T2 consistently OK in MyWLCG, but most are absent • https://twiki.cern.ch/twiki/bin/view/LCG/GlexecDeployment • Updated Apr 22 • How to implement gLExec on the WN • Includes example script for Argus configuration • Provided by Antonio Delgado of CIEMAT  OK site in MyWLCG! Maarten Litmaath (CERN)

  4. Maarten Litmaath (CERN)

  5. Nagios tests for LHCb • https://sam-lhcb-prod.cern.ch/nagios/cgi-bin/status.cgi? servicegroup=SERVICE_CREAM-CE&style=detail Maarten Litmaath (CERN)

  6. Test results for CMS https://sam-cms-prod.cern.ch/nagios/cgi-bin/status.cgi? servicegroup=SERVICE_CREAM-CE&style=detail https://sam-cms-prod.cern.ch/nagios/cgi-bin/status.cgi? servicegroup=SERVICE_OSG-CE&style=detail Maarten Litmaath (CERN)

  7. Test results for ATLAS https://sam-atlas-prod.cern.ch/nagios/cgi-bin/status.cgi? servicegroup=SERVICE_CREAM-CE&style=detail https://sam-atlas-prod.cern.ch/nagios/cgi-bin/status.cgi? servicegroup=SERVICE_OSG-CE&style=detail Maarten Litmaath (CERN)

  8. Experiment plans • CMS • Opening tickets against sites where glexec does not work • Enabling glexec in GlideinWMS per site where it works • ATLAS • Re-implementing glexec usage by pilot • ALICE • Implementing Security TEG proposal may be viable • Use of specially crafted proxies with critical extension that is only understood by glexec • LHCb • Re-enabled and tested glexec support in DIRAC • Ran into gLExec infrastructure issues at many sites Maarten Litmaath (CERN)

More Related