1 / 20

Middleware Selection

Middleware Selection GridPP13 – Durham – July’05 Robin Middleton Introduction LCG Baseline Service Group Report http://lcg.web.cern.ch/LCG/peb/bs/BSReport-v1.0.pdf Services:

niveditha
Download Presentation

Middleware Selection

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Middleware Selection GridPP13 – Durham – July’05 Robin Middleton

  2. Introduction • LCG Baseline Service Group Report • http://lcg.web.cern.ch/LCG/peb/bs/BSReport-v1.0.pdf • Services: • Storage Element; Basic File Transfer; Reliable File Transfer; Catalogue Services; Data Management Tools; Compute Element; Workload Management; VO Agents; VO Membership Services; Database Services; Posix-like I/O; Application software Installation Tools; Job Monitoring; Reliable Messaging; Information system • Experiment priorities for Services • A quick look at OMII • Conclusions GridPP Middleware Roadmap

  3. Storage Element • Characteristics: • Mass storage, disk pool, disk cache front-end • gridFTP service transfers in/out • local POSIX-like I/O • Authentication/authorisation & audit/accounting • SRM (version x.y) Interface • fine detail required evolves through service challenges • Implementations: • CASTOR (SRM v1.1); dCache (SRM v1.1++); LCG-DPM (SRM v1.1 & v2.1); DRM (SRM v1.1) • Goal: functionality in place by end 2005 GridPP Middleware Roadmap

  4. Basic File Transfer • Characteristics: • Critical • Absolutely reliable • fault tolerant against machine failures • load-balancing at sites • Transfer bandwidths at all sites need careful planning • Implementations: • gridFTP - from GT2; eventually migrate to GT4 version when proven • srmCopy - layered on top of gridFTP and preferred interface GridPP Middleware Roadmap

  5. Reliable File Transfer • Characteristics: • Layered on Basic Data Transfer • Request scheduling (relative VO priorities) • 3rd party transfers • Interaction with grid catalogues • Implementations: • gLite FTS (used to define base interfaces & functionality for any other future implementations) • Globus RFT (currently does not layer on srmCopy, though transfer restarts are possible) • also FTD (AliEn), Don Quixote (Atlas), PhedEx (CMS) & LHCb system GridPP Middleware Roadmap

  6. Catalogue Services • Characteristics: • Vary significantly between experiments dependent information (bookkeeping, metadata, etc.) • Implements “collections” – datasets, fileblocks, … • More than data files – e.g. log files, … • Mappings : LFN, GUID, SURL, … • Hierarchical namespace • Access control • Bulk operations • Interfaces to POOL, WMS, Posix-like I/O • Centralised catalogue • Implementations: • LCG File Catalogue - fulfils requirements • FireMan (gLite) – fulfils requirements • Globus RLS (does not implement all required interfaces) • Experiments plans: • AliEn FC (Alice, LHCb ?) • Atlas based on LFC, Fireman & Globus RLS (in US) • CMS possibly LFC, Fireman, other ? GridPP Middleware Roadmap

  7. Data Management Tools • Implementations: • LCG-2 provides complete set of tools for replica management, catalogue interaction and manipulation • gLite has similar toolset • POOL provides back-end catalogue manipulation • Future: • wish to see convergence on single toolset, but exact composition depends catalogue choices ! GridPP Middleware Roadmap

  8. Compute Element • Characteristics: • Job submission to local batch • Provide resource availability information • Availability of accounting information • Job status queries • Authentication, authorisation based on VOMS • Implementations: • Globus gatekeeper (LCG-2 & OSG); ARC (NorduGrid) • Info : standardised on GLUE schema • Accounting : standardised on GGF accounting schema • Job status : R-GMA (LCG-2.4) • new CE (gLite – based on Condor-C) will be evaluated to replace Globus GRAM-based CEs GridPP Middleware Roadmap

  9. Workload Management • Characteristics: • Express resource requirements • Service matches against resource availability & submits to best match • Interfaces to Data Location Interface, Storage Index • Implementations: • LCG-2 Resource Broker • Condor-G • gLite RB (push & pull) • Future: • expect WM systems to evolve and mature GridPP Middleware Roadmap

  10. VO Agents • Characteristics • perform activities for an experiment • job submission / monitoring • file transfer scheduling • database update scheduling • Implementations: • currently jobs running in batch – not ideal • generic solution needed GridPP Middleware Roadmap

  11. VO Membership Service • Characteristics: • register users • generate extended proxy certificates • handle authorisation for use of resources • Implementations: • all providers participating in LCG have/will have a VOMS service • mechanisms for mapping users/groups and to provide access control vary depending on local policies & requirements GridPP Middleware Roadmap

  12. Database Services • Characteristics: • Provide back-ends for file catalogues, metadata, conditions, etc. • Write access limited to experiment software managers • Reliable, scalable services based on reliable hardware • Required at Tier-0, Tier-1 and some Tier-2 • Some replication to remote sites of centralised databases • Implementations: • Oracle • MySQL (at some smaller sites) GridPP Middleware Roadmap

  13. Posix-like I/O • Characteristics: • Support intermediate libraries : POOL, ROOT, … • Support direct from application code • Communicate with Grid File Catalogues (allows LFN / GUID access) • Implementations: • LCG GFAL (Grid File Access Library) • gLiteIO (access control via Fireman catalogue) • xrootd • Comment: • remote (from other sites) I/O not expected, files should be replicated locally GridPP Middleware Roadmap

  14. Application Software Installation Tools • Characteristics: • VO specific installation of software • Validation of installed software • Write access limited to experiment software managers • Publish installed software in information system • Implementations: • toolset in LCG-2 • experiment specific solutions GridPP Middleware Roadmap

  15. Job Monitoring • Characteristics: • monitor & trace submitted grid jobs • Instrumentation of job wrapper scripts • VO-level monitoring of resource usage of the VO • Implementations: • partial solution in LCG-2 workload management system • LCG-2 publish Resource Broker info of every job in R-GMA (CPU time, wall clock time, memory usage, etc.) • ARC Middleware (NorduGrid) GridPP Middleware Roadmap

  16. Reliable Messaging • Characteristics: • messaging between applications, services & users • reliable, asynchronous • Implementations: • some experiment specific solutions – a common trustworthy service would be of value; making use of existing open source or public domain tools GridPP Middleware Roadmap

  17. Information System • Characteristics: • (not seen as a baseline service. but still crucial) • Info published through application interactions with other services • Schema must be adequate to describe services and their parameters • Implementations: • GLUE schema exists and proposed as standard for LCG (update expected in Q4 2005); common between EGEE, OSG & NorduGrid GridPP Middleware Roadmap

  18. Baseline Services Report- Experiment Priorities - A: High priority,mandatory B: Standard solutions required but expts could select different implementations C: Desirable common solution, but not essential GridPP Middleware Roadmap

  19. OMII • Evaluated within EGEE - reported at Athens meeting • based on web service standards; fully decentralised • might fulfil LCG requirements (superficially at least) in • user account management • job submission • file transfer • X.509 based security infrastructure • Users need accounts at all resource sites; sites queried for resources for each job; manual site selection; applications pre-installed at all participating sites • No services for data management (other than file transfer) • No support for catalogues, databases, mass storage, VO management • A major undertaking to use in meeting LHC requirements and would need significant additions of other tools… not clear what advantages would result. Possible use to leverage local resources (e.g. shared with non-HEP), but don’t under estimate work in forging interoperability and ongoing maintenance. GridPP Middleware Roadmap

  20. Conclusion • LCG brings together components from a number of sources and packages them in a common framework • No single project (EDG, gLite, VDT, OMII, …) provides all the answers • Baseline services report is an end-user view • Are there holes in the analysis ? • What about ? • Security infrastructure • Information system infrastructure • Grid monitoring & Operations toolsets • LCG/EGEE releases are best placed to meet LHC needs in the UK GridPP Middleware Roadmap

More Related