1 / 16

Grid Operations Centre Update

Grid Operations Centre Update. Trevor Daniels LCG Grid Deployment Board 10 th November 2003. Outline. New Staff Steering Group Meeting Work in Progress. New Staff. one moved p/t to GOC staff in Oct Glen Johnson (p/t) background in edg, unix will develop accounting system

brannigan
Download Presentation

Grid Operations Centre Update

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Grid Operations Centre Update Trevor Daniels LCG Grid Deployment Board 10th November 2003

  2. Outline • New Staff • Steering Group Meeting • Work in Progress

  3. New Staff • one moved p/t to GOC staff in Oct • Glen Johnson (p/t) • background in edg, unix • will develop accounting system • two started f/t 1 Nov 2003 • David Kant • background in edg, unix • GOC sysadmin • Matt Thorpe • background in user support • application (monitors, etc) maintenance • now up to proposed strength

  4. Steering Group Meeting • Phone Conference 20th October • Monitors for GOC Phase 2 • Operational Procedures • Accounting • Actions on various people

  5. Monitors for GOC Phase 2 • All monitors found to be useful • gppmon (quick high-level state) • MapCenter (detailed tests; history) • GridICE (user-level information) • SLA tests (moving to MapCenter) • So in Phase 2: • continue to develop present monitors, plus • MonALISA • network monitors • MapCenter

  6. MapCenter • Collaborating well with Frank Bonnassieux • Debugging problems with firewalls • Good vehicle for adding GOC-specific tests • Testing SLA tests • ce-auth installed • rb-joblm next

  7. Operational Procedures • SLA Guide • Site Self-Audit • Procedures for Resource Admins

  8. Accounting Plan • define accounting schema, • develop filters to transform required data from sites to the schema for one or two batch systems, • develop mechanisms for collecting data from sites and transmitting it to the GOC, • develop mechanism for matching up data from batch and CE, • develop and install suitable DB to hold accounting data, • develop suitable web-based static and interactive reports.

  9. Work in Progress • drafting Operational Procedures • moving to production GOC system • developing SLA-specific tests within MapCenter • developing gppmon • accounting • collaboration with GGUS

  10. Operational Procedures • Supplements to Security Policy • SLA Guide • Site Self-Audit Procedures • Procedures for Resource Admins • draft to Steering Group • will then be put to wider forum of local sysadmins • Meeting to be arranged at CERN, probably early in new year

  11. Production GOC System • Will be tailor-made for the job • Dedicated to GOC work only • MapCenter, gppmon, website initially • Other monitors after system is in production • MonALISA; network monitors • (not GridICE - remains at CERN) • Adding new tier2 sites to various monitors • four in Spain recently

  12. SLA-specific tests in MapCenter • high-level; close to user activity • but as specific to service being tested as possible • ce-auth (Authentication test) • Done • rb-joblm (job-list-match) • Running; not yet in MapCenter • mds-ldap (ldap query) • Running; not yet in MapCenter

  13. Developing gppmon • to show state of RBs • to add history • after migration to production GOC

  14. Accounting Issue • Should we be accounting all work? • or only that submitted via the Grid? • RRB is considering all LHC work

  15. Collaboration with GUS • Will share Remedy system at Karlsruhe • GOC now has access • Will shortly begin entering problems and resolutions

  16. Summary • Making Progress • Established Steering Committee • Accounting a priority • Start direct contacts with sysadmins

More Related