1 / 38

Experiences of the Grid…

Experiences of the Grid…. Gavin McCance University of Glasgow NeSC Meeting, 24 October 2001. Background. Experimental Particle Physics background Analysing the structure of matter …Fortran (19)77 ! Working in ‘Grid’-like areas since January this year. GridPP. 20+ institutes…. …GridPP.

Download Presentation

Experiences of the Grid…

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Experiences of the Grid… Gavin McCance University of Glasgow NeSC Meeting, 24 October 2001

  2. Background • Experimental Particle Physics background • Analysing the structure of matter • …Fortran (19)77 ! • Working in ‘Grid’-like areas since January this year Gavin McCance, University of Glasgow

  3. GridPP 20+ institutes… Gavin McCance, University of Glasgow

  4. …GridPP • £17M 3-year project • Working in collaboration with EU DataGrid project • Middleware production • Integration of middleware technologies into HEP experiments • Validation of Grid Software Gavin McCance, University of Glasgow

  5. …GridPP • Initial GridPP testbed underway • A personal snapshot of activities on the grid… • Middleware activities we’re involved in • Some examples • Technologies we’re using • Issues with integration of ‘Grid’ with particle physics experiments Gavin McCance, University of Glasgow

  6. Middleware • What is middleware…??? Application programs – local gridopen() Layered API’s. Transparent security. Transparent data access. Intelligent use of distributed resources. Grid middleware Data access specifics – HPSS, Castor Job submission specifics – PBS, LSF Specific security procedures Gavin McCance, University of Glasgow

  7. Middleware Activities • GridPP ~mirrors EU DataGrid: • Workload Management • What jobs go where? • Data Management (*) • Where’s the (best) data? • Information Services (*) • What’s the state of everything? Gavin McCance, University of Glasgow

  8. …Middleware Activities • Fabric Management • Interfaces to underlying systems • Mass Storage Management • How to get the data to/from the fabric e.g. Implementing ‘file-save()’ APIs for different mass storage systems • Security • Crops up everywhere … transparent to applications Gavin McCance, University of Glasgow

  9. Data Management • Data Replication • Transparent and Secure Data Access • Meta Data Storage • Query Optimisation Gavin McCance, University of Glasgow

  10. Example problem: Data Replication • Problems if data exist only in one place • Multiple accesses to the same data overload network! Petabytes! • Funding constraints! e.g. CERN can’t store all of the data required • Make Replica! But need to keep track of all the files and their various replica! • Need replica catalogue! Gavin McCance, University of Glasgow

  11. …Catalogues • Examples solutions: • Have a globally unique Logical File Name (LFN) mapping to multiple physical instances of the file (PFNs). • Replica selection required • Choose the ‘best’ / ‘nearest’ / ‘fastest’ • Cost modelling… how time expensive to transfer files X’ from A to B File-1 LFN Paris File-1 Glasgow File-1 Chicago File-1 Gavin McCance, University of Glasgow

  12. …Data Replication • Grid Data Mirroring Package • C, C++, JAVA, command-line APIs • Replication issues: • File transfer… • Synchronisation / consistency models • Basic middleware doesn’t enforce any policy • Scalable architectures Gavin McCance, University of Glasgow

  13. …GDMP • File transfer uses GridFTP • Existing IETF-approved (?RFC?) ftp additions • + the standard grid security (GSI) • Registers new files in replica catalogue • E.g. interfaced to the existing Globus Replica Catalogue • Basic replica manager functionality to maintain consistency of replica sets Gavin McCance, University of Glasgow

  14. CERN Root RC INFN RC UK RC CERN RC …Implementation issues • Structure not imposed by the middleware software itself… • But … must think about scalable implementations • E.g. a RC may exist on each storage element  responsible for its own files Queries will propagate down until replica information is found… Gavin McCance, University of Glasgow

  15. …Longer term problems • Query / Replica Optimisation • Grid can make / delete replica • Eg. Many people in Glasgow & Edinburgh access the ATLAS Higgs dataset ‘A1’… • Autonomously make new replica in / near Scotland based on historical information • Grid might re-cluster data Paris Glasgow Paris Glasgow A1 B3 A1 B3 A2 B1 A2 B2 A3 A3 B2 B1 Gavin McCance, University of Glasgow

  16. MONARC simulation tool real Grid... …simulated Grid provides testing arena for more adventurous ideas! …longer term Gavin McCance, University of Glasgow

  17. …Integration of middleware • Many iterations of requirements and use-cases with end-users… meetings… • Middleware solutions must be scalable and useable by a variety of end users • HEP, Biological, Earth sciences, Astro • Always looking for common elements • E.g. replica / meta-data catalogues… data transport… security… Gavin McCance, University of Glasgow

  18. …examples of common interfaces:generic meta-data catalogue tools • SQL Database Service: • Problem: many relational databases, diverse security, diverse wire protocols • …Solution: • Build on existing wire protocols: XML transported over HTTP(S) • Grid standard security framework (GSI) Gavin McCance, University of Glasgow

  19. ..examples • Leverage open-source technology • JAVA servlet based (Apache Tomcat engine) • JDBC drivers • Utilises Oracle’s XSQL servlet (open source) • Security over HTTPS with Grid-standard GSI mechanism Gavin McCance, University of Glasgow

  20. …examples • Allows any HTTP compliant system e.g. Web-browsers / standard C++ HTTP libraries to access any relational database… + + Oracle PostgreSQL PKI Security Standard communication protocols (XML over HTTPS) = SQL Database Service (Spitfire) Gavin McCance, University of Glasgow

  21. Global Grid Forum • Global Grid Forum meetings • GGF1: Amsterdam meeting in April 2001 • Helps define aspects common to all Grid-like projects. • E.g. architectures, ‘grid’ protocols • As example… Grid Monitoring Architecture (GMA) Gavin McCance, University of Glasgow

  22. Information Services - GMA • One Implementation of the GMA •  Globus MDS, currently based on (Open)LDAP • Hierarchical directory like structure • Very fast for information retrieval if you already know the query  designed into structure. • Bad for complex or ranged queries Gavin McCance, University of Glasgow

  23. ..complementary implementation Register, re-register, publish Producer Servlet Registry Servlet Producer Producer API Schema Servlet subscribe Query stream Consumer Querying API Implementation of GMA Relational queries in SQL format Relational Database Gavin McCance, University of Glasgow

  24. …relational GMA • Information is transferred in generic SQL format… • ‘Producers’ of information register themselves… • ‘Consumers’ construct (possibly complex) SQL query and are streamed query results directly from Producers. Gavin McCance, University of Glasgow

  25. …implementation • Again, uses JAVA servlets • Tomcat servlet engine • Again, communication with servlet is over standard HTTP. • All the internal parts communicate via HTTP and XML modular design, easily replaceable… Gavin McCance, University of Glasgow

  26. Useful Tools… • JAVA… nicely platform independent • UML Universal(?) Modelling Language • Architecture and API’s ‘should be’ defined in this…! • CASE tools • Together Control Centre Gavin McCance, University of Glasgow

  27. …useful tools • Globus toolkit • Both the original and its java implementation (CoG) • My experience of CoG so far is generally good…! • Easy GSI authentication, Globus file transfer, Globus job submission, MDS interface Gavin McCance, University of Glasgow

  28. Testbeds • For GridPP, primary testbeds are the HEP experiment ones • CERN LHC (EU DataGrid WP8) • US experiments, e.g. Fermilab, SLAC • First software release now!! • Integration team ‘show-and-tell’ at CERN end of this month… Gavin McCance, University of Glasgow

  29. ...testbed work Grid software packaged for release to experiments! Primarily packaged using RPM For end of October release, supported platforms are: Linux (and Solaris on a best effort basis) Gavin McCance, University of Glasgow

  30. ..Globus installation • Generally found the Globus software installation OK! • Successfully deployed on a number of batch systems in UK • Experience fed back into eScience Centres Difficulties were setting up and recognising each countries’ Certificate Authorities (CAs)  Tricky legal implications to resolve! Gavin McCance, University of Glasgow

  31. Testbed work so far… • UK Certificate Authority set-up… • Many institutes already on testbed Grid Status and Network monitoring demonstrator available soon Networking status information provided by GridPP and DataGrid networking groups! Gavin McCance, University of Glasgow

  32. …testbed work so far • Successful tests within ATLAS (and others) of some middleware products • E.g. Large file transfers between UK, Italy, US and CERN • Further tests planned with new release! Gavin McCance, University of Glasgow

  33. …experimental integration • Work to do… • Taking the kit and trying to integrate it into the experiments’ software frameworks ATLAS/LHCb software framework (GAUDI) Make Grid Services transparently available to ATLAS and LHCb programs GANGA framework Grid middleware Gavin McCance, University of Glasgow

  34. Grid validation • Preliminary tests of basic middleware has been successful • Now we have opportunity to see how it performs and scales with real datasets and real experimental users Gavin McCance, University of Glasgow

  35. Overall experience • Middleware development is fun! • Several good products have already been delivered • Re-using industry standard components and protocols where they exist • LDAP, SQL, HTTP(S), XML, SOAP • PKI security • Open Source…! Gavin McCance, University of Glasgow

  36. …overall • Middleware being built using a variety of languages… JAVA, C++, C, Python • APIs should be available for all JAVA, C++, C and command line… web access(?) Gavin McCance, University of Glasgow

  37. …overall • Coordination very important • Forums for discussion: • Vital to ensure middleware is useful to a wide range of applications • Prevent divergent technology Gavin McCance, University of Glasgow

  38. …finally • Preliminary grid software architectures have been defined • Basic middleware has been delivered • Large scale validation underway NOW!! • A excellent base to build on! • Plenty still to do! Gavin McCance, University of Glasgow

More Related