site validation session report n.
Skip this Video
Loading SlideShow in 5 Seconds..
Site Validation Session Report PowerPoint Presentation
Download Presentation
Site Validation Session Report

Loading in 2 Seconds...

play fullscreen
1 / 9

Site Validation Session Report - PowerPoint PPT Presentation

  • Uploaded on

Site Validation Session Report. Co-Chairs: Piotr Nyczyk, CERN IT/GD Leigh Grundhoefer, IU / OSG Notes from Judy Novak WLCG-OSG-EGEE Workshop CERN, June 19-20th 2006. Service Availability Monitoring (SAM) - “extension” of SFT:.

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
Download Presentation

PowerPoint Slideshow about 'Site Validation Session Report' - cilicia-romeo

Download Now An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
site validation session report

Site Validation Session Report


Piotr Nyczyk, CERN IT/GD

Leigh Grundhoefer, IU / OSG

Notes from Judy Novak


CERN, June 19-20th 2006

service availability monitoring sam extension of sft
Service Availability Monitoring (SAM) - “extension” of SFT:
  • generalized framework to monitor all LCG/EGEE services and not only CE: BDII, RB, LFC, FTS, etc.
  • most of the sensors run remotely (from central machine)
  • no installation needed on service machines
  • moved from MySQL to Oracle, optimized data schema

Available at:

SAM sensors:
    • currently: BDII (Taiwan), RB (RAL), CE, SRM, LFC, FTS, SE (CERN)
  • release updates + SAM (SFT)
    • certifying current tests with each new release
    • Create update tests as necessary
    • CA cert. releases are special
  • Availability views
    • current, daily, weekly, monthly
    • For CE, SE, SRM, siteBDII
    • displayed with GridView

osg validation services
OSG Validation services
  • CE/SE Validation aggregation : VORS - site scanner, BDII info
  • OSG VO’s VOMS validation
  • GridEX - application validation ( pilot job submissions )
  • Site Policy template and publication
  • GIP Validation
  • Monitoring validation : MonALisa Client status (VO Jobs I/O)
  • GridCat and the MIS-CI client
    • - Production instance
    • Client software:
  • It seems to be impossible to avoid cross-monitoring (OSG monitoring doesn't include LCG-specific services, and the other way around)
  • We should synchronize on VO level, but LCG/EGEE is also using regional structuring
osg and egee validation interoperability
OSG and EGEE Validation Interoperability
  • Site discovery - using discovered sites using BDII
    • Ops VO - supported only on OSG sites which are interoperable. (fully deployed in July)
    • How can we determine if EGEE site is interoperable? Review certain BDII informations
  • Cross installation of necessary tools and libraries for site validation
    • LCG tools - added as optionally installed package for OSG sites
    • OSG environment variables - ? (GIP)
osg and egee validation interoperability cont
OSG and EGEE Validation Interoperability (cont)
  • Use of existing GGUS- OSG GOC ticket exchange for error reporting
    • SAM database to use contact information for OSG GOC
  • Issue of coordinating scheduled downtime
    • OSG GOC will maintain a web page with downtimes
  • Propose review of effort to add OSG specific validations to SAM framework.
  • Testing and iterative development will be accomplished using Pre-Production sites and OSG ITB
db monitoring in sam for tier 1 s dirk duellmann
DB monitoring in SAM for Tier 1’s (Dirk Duellmann)
  • Jobs are connecting to the DB with either http (VO lib) or direct Oracle (instant client)
  • Should be completed by October when experiments will start using DBs
  • CMS + Alice don't need them, but only 'squid’
  • existing DB monitoring is too detailed for SAM/SFT, but SAM could provide highlevel monitoring of DB service
  • some DB services (like LFC) are already tested by SAM, BUT only the functionality is tested, not the DB! The test could be:
    • threshold for connection between T0 -> T1
    • user access (squid)
    • client latency (?)
  • Oracle client will be installed on the Worker Nodes