1 / 188

Grid Computing: Concepts, Appplications, and Technologies

Grid Computing: Concepts, Appplications, and Technologies. Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer Science The University of Chicago http://www.mcs.anl.gov/~foster.

trella
Download Presentation

Grid Computing: Concepts, Appplications, and Technologies

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Grid Computing:Concepts, Appplications, and Technologies Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer Science The University of Chicago http://www.mcs.anl.gov/~foster Grid Computing in Canada Workshop, University of Alberta, May 1, 2002

  2. Outline • The technology landscape • Grid computing • The Globus Toolkit • Applications and technologies • Data-intensive; distributed computing; collaborative; remote access to facilities • Grid infrastructure • Open Grid Services Architecture • Global Grid Forum • Summary and conclusions

  3. Outline • The technology landscape • Grid computing • The Globus Toolkit • Applications and technologies • Data-intensive; distributed computing; collaborative; remote access to facilities • Grid infrastructure • Open Grid Services Architecture • Global Grid Forum • Summary and conclusions

  4. Living in an Exponential World(1) Computing & Sensors Moore’s Law: transistor count doubles each 18 months Magnetohydro- dynamics star formation

  5. Living in an Exponential World:(2) Storage • Storage density doubles every 12 months • Dramatic growth in online data (1 petabyte = 1000 terabyte = 1,000,000 gigabyte) • 2000 ~0.5 petabyte • 2005 ~10 petabytes • 2010 ~100 petabytes • 2015 ~1000 petabytes? • Transforming entire disciplines in physical and, increasingly, biological sciences; humanities next?

  6. Data Intensive Physical Sciences • High energy & nuclear physics • Including new experiments at CERN • Gravity wave searches • LIGO, GEO, VIRGO • Time-dependent 3-D systems (simulation, data) • Earth Observation, climate modeling • Geophysics, earthquake modeling • Fluids, aerodynamic design • Pollutant dispersal scenarios • Astronomy: Digital sky surveys

  7. Ongoing Astronomical Mega-Surveys • Large number of new surveys • Multi-TB in size, 100M objects or larger • In databases • Individual archives planned and under way • Multi-wavelength view of the sky • > 13 wavelength coverage within 5 years • Impressive early discoveries • Finding exotic objects by unusual colors • L,T dwarfs, high redshift quasars • Finding objects by time variability • Gravitational micro-lensing MACHO 2MASS SDSS DPOSS GSC-II COBE MAP NVSS FIRST GALEX ROSAT OGLE ...

  8. Crab Nebula in 4 Spectral Regions X-ray Optical Infrared Radio

  9. Coming Floods of Astronomy Data • The planned Large Synoptic Survey Telescope will produce over 10 petabytes per year by 2008! • All-sky survey every few days, so will have fine-grain time series for the first time

  10. Data Intensive Biology and Medicine • Medical data • X-Ray, mammography data, etc. (many petabytes) • Digitizing patient records (ditto) • X-ray crystallography • Molecular genomics and related disciplines • Human Genome, other genome databases • Proteomics (protein structure, activities, …) • Protein interactions, drug delivery • Virtual Population Laboratory (proposed) • Simulate likely spread of disease outbreaks • Brain scans (3-D, time dependent)

  11. A Brainis a Lotof Data!(Mark Ellisman, UCSD) And comparisons must be made among many We need to get to one micron to know location of every cell. We’re just now starting to get to 10 microns – Grids will help get us there and further

  12. An Exponential World: (3) Networks(Or, Coefficients Matter …) • Network vs. computer performance • Computer speed doubles every 18 months • Network speed doubles every 9 months • Difference = order of magnitude per 5 years • 1986 to 2000 • Computers: x 500 • Networks: x 340,000 • 2001 to 2010 • Computers: x 60 • Networks: x 4000 Moore’s Law vs. storage improvements vs. optical improvements. Graph from Scientific American (Jan-2001) by Cleo Vilett, source Vined Khoslan, Kleiner, Caufield and Perkins.

  13. Outline • The technology landscape • Grid computing • The Globus Toolkit • Applications and technologies • Data-intensive; distributed computing; collaborative; remote access to facilities • Grid infrastructure • Open Grid Services Architecture • Global Grid Forum • Summary and conclusions

  14. Evolution of the Scientific Process • Pre-electronic • Theorize &/or experiment, alone or in small teams; publish paper • Post-electronic • Construct and mine very large databases of observational or simulation data • Develop computer simulations & analyses • Exchange information quasi-instantaneously within large, distributed, multidisciplinary teams

  15. Evolution of Business • Pre-Internet • Central corporate data processing facility • Business processes not compute-oriented • Post-Internet • Enterprise computing is highly distributed, heterogeneous, inter-enterprise (B2B) • Outsourcing becomes feasible => service providers of various sorts • Business processes increasingly computing- and data-rich

  16. The Grid “Resource sharing & coordinated problem solving in dynamic, multi-institutional virtual organizations”

  17. An Example Virtual Organization: CERN’s Large Hadron Collider 1800 Physicists, 150 Institutes, 32 Countries 100 PB of data by 2010; 50,000 CPUs?

  18. ~PBytes/sec ~100 MBytes/sec Offline Processor Farm ~20 TIPS There is a “bunch crossing” every 25 nsecs. There are 100 “triggers” per second Each triggered event is ~1 MByte in size ~100 MBytes/sec Online System Tier 0 CERN Computer Centre ~622 Mbits/sec or Air Freight (deprecated) Tier 1 FermiLab ~4 TIPS France Regional Centre Germany Regional Centre Italy Regional Centre ~622 Mbits/sec Tier 2 Tier2 Centre ~1 TIPS Caltech ~1 TIPS Tier2 Centre ~1 TIPS Tier2 Centre ~1 TIPS Tier2 Centre ~1 TIPS HPSS HPSS HPSS HPSS HPSS ~622 Mbits/sec Institute ~0.25TIPS Institute Institute Institute Physics data cache ~1 MBytes/sec 1 TIPS is approximately 25,000 SpecInt95 equivalents Physicists work on analysis “channels”. Each institute will have ~10 physicists working on one or more channels; data for these channels should be cached by the institute server Pentium II 300 MHz Pentium II 300 MHz Pentium II 300 MHz Pentium II 300 MHz Tier 4 Physicist workstations Grid Communities & Applications:Data Grids for High Energy Physics www.griphyn.org www.ppdg.net www.eu-datagrid.org

  19. Data Integration and Mining: (credit Sara Graves) From Global Information to Local Knowledge Emergency Response Precision Agriculture Urban Environments Weather Prediction

  20. Intelligent Infrastructure:Distributed Servers and Services

  21. Grid Computing

  22. The Grid:A Brief History • Early 90s • Gigabit testbeds, metacomputing • Mid to late 90s • Early experiments (e.g., I-WAY), academic software projects (e.g., Globus, Legion), application experiments • 2002 • Dozens of application communities & projects • Major infrastructure deployments • Significant technology base (esp. Globus ToolkitTM) • Growing industrial interest • Global Grid Forum: ~500 people, 20+ countries

  23. The Grid World: Current Status • Dozens of major Grid projects in scientific & technical computing/research & education • www.mcs.anl.gov/~foster/grid-projects • Considerable consensus on key concepts and technologies • Open source Globus Toolkit™ a de facto standard for major protocols & services • Industrial interest emerging rapidly • IBM, Platform, Microsoft, Sun, Compaq, … • Opportunity: convergence of eScience and eBusiness requirements & technologies

  24. Outline • The technology landscape • Grid computing • The Globus Toolkit • Applications and technologies • Data-intensive; distributed computing; collaborative; remote access to facilities • Grid infrastructure • Open Grid Services Architecture • Global Grid Forum • Summary and conclusions

  25. Grid Technologies:Resource Sharing Mechanisms That … • Address security and policy concerns of resource owners and users • Are flexible enough to deal with many resource types and sharing modalities • Scale to large number of resources, many participants, many program components • Operate efficiently when dealing with large amounts of data & computation

  26. Aspects of the Problem • Need for interoperability when different groups want to share resources • Diverse components, policies, mechanisms • E.g., standard notions of identity, means of communication, resource descriptions • Need for shared infrastructure services to avoid repeated development, installation • E.g., one port/service/protocol for remote access to computing, not one per tool/appln • E.g., Certificate Authorities: expensive to run • A common need for protocols & services

  27. The Hourglass Model • Focus on architecture issues • Propose set of core services as basic infrastructure • Use to construct high-level, domain-specific solutions • Design principles • Keep participation cost low • Enable local control • Support for adaptation • “IP hourglass” model A p p l i c a t i o n s Diverse global services Core services Local OS

  28. Application Application Internet Protocol Architecture “Coordinating multiple resources”: ubiquitous infrastructure services, app-specific distributed services Collective “Sharing single resources”: negotiating access, controlling use Resource “Talking to things”: communication (Internet protocols) & security Connectivity Transport Internet “Controlling things locally”: Access to, & control of, resources Fabric Link Layered Grid Architecture(By Analogy to Internet Architecture)

  29. Globus Toolkit™ • A software toolkit addressing key technical problems in the development of Grid-enabled tools, services, and applications • Offer a modular set of orthogonal services • Enable incremental development of grid-enabled tools and applications • Implement standard Grid protocols and APIs • Available under liberal open source license • Large community of developers & users • Commercial support

  30. General Approach • Define Grid protocols & APIs • Protocol-mediated access to remote resources • Integrate and extend existing standards • “On the Grid” = speak “Intergrid” protocols • Develop a reference implementation • Open source Globus Toolkit • Client and server SDKs, services, tools, etc. • Grid-enable wide variety of tools • Globus Toolkit, FTP, SSH, Condor, SRB, MPI, … • Learn through deployment and applications

  31. Key Protocols • The Globus Toolkit™ centers around four key protocols • Connectivity layer: • Security: Grid Security Infrastructure (GSI) • Resource layer: • Resource Management: Grid Resource Allocation Management (GRAM) • Information Services: Grid Resource Information Protocol (GRIP) and Index Information Protocol (GIIP) • Data Transfer: Grid File Transfer Protocol (GridFTP) • Also key collective layer protocols • Info Services, Replica Management, etc.

  32. Job manager Job manager Globus Toolkit Structure Service naming Soft state management Reliable invocation GRAM MDS GridFTP MDS ??? Notification GSI GSI GSI Other Service or Application Compute Resource Data Resource

  33. Connectivity LayerProtocols & Services • Communication • Internet protocols: IP, DNS, routing, etc. • Security: Grid Security Infrastructure (GSI) • Uniform authentication, authorization, and message protection mechanisms in multi-institutional setting • Single sign-on, delegation, identity mapping • Public key technology, SSL, X.509, GSS-API • Supporting infrastructure: Certificate Authorities, certificate & key management, … GSI: www.gridforum.org/security/gsi

  34. Why Grid Security is Hard • Resources being used may be extremely valuable & the problems being solved extremely sensitive • Resources are often located in distinct administrative domains • Each resource may have own policies & procedures • The set of resources used by a single computation may be large, dynamic, and/or unpredictable • Not just client/server • It must be broadly available & applicable • Standard, well-tested, well-understood protocols • Integration with wide variety of tools

  35. Grid Security Requirements User View Resource Owner View 1) Specify local access control 2) Auditing, accounting, etc. 3) Integration w/ local systemKerberos, AFS, license mgr. 4) Protection from compromisedresources 1) Easy to use 2) Single sign-on 3) Run applicationsftp,ssh,MPI,Condor,Web,… 4) User based trust model 5) Proxies/agents (delegation) Developer View API/SDK with authentication, flexible message protection, flexible communication, delegation, ...Direct calls to various security functions (e.g. GSS-API)Or security integrated into higher-level SDKs: E.g. GlobusIO, Condor-G, MPICH-G2, HDF5, etc.

  36. Grid Security Infrastructure (GSI) • Extensions to existing standard protocols & APIs • Standards: SSL/TLS, X.509 & CA, GSS-API • Extensions for single sign-on and delegation • Globus Toolkit reference implementation of GSI • SSLeay/OpenSSL + GSS-API + delegation • Tools and services to interface to local security • Simple ACLs; SSLK5 & PKINIT for access to K5, AFS, etc. • Tools for credential management • Login, logout, etc. • Smartcards • MyProxy: Web portal login and delegation • K5cert: Automatic X.509 certificate creation

  37. Single sign-on via “grid-id” & generation of proxy cred. Or: retrieval of proxy cred. from online repository Remote process creation requests* GSI-enabled GRAM server Authorize Map to local id Create process Generate credentials Ditto GSI-enabled GRAM server Process Process Communication* Local id Local id Kerberos ticket Restricted proxy Remote file access request* Restricted proxy User Proxy GSI-enabled FTP server Proxy credential Authorize Map to local id Access file * With mutual authentication GSI in Action: “Create Processes at A and B that Communicate & Access Files at C” User Site B (Unix) Site A (Kerberos) Computer Computer Site C (Kerberos) Storage system

  38. GSI Working Group Documents • Grid Security Infrastructure (GSI) Roadmap • Informational draft overview of working group activities and documents • Grid Security Protocols & Syntax • X.509 Proxy Certificates • X.509 Proxy Delegation Protocol • The GSI GSS-API Mechanism • Grid Security APIs • GSS-API Extensions for the Grid • GSI Shell API

  39. GSI Futures • Scalability in numbers of users & resources • Credential management • Online credential repositories (“MyProxy”) • Account management • Authorization • Policy languages • Community authorization • Protection against compromised resources • Restricted delegation, smartcards

  40. 1. CAS request, with user/group CAS resource names membership Does the and operations collective policy resource/collective authorize this 2. CAS reply, with membership request for this capability and resource CA info user? collective policy information Resource 3. Resource request, authenticated with Is this request capability authorized by the local policy capability? information 4. Resource reply Is this request authorized for the CAS? Community Authorization User Laura Pearlman, Steve Tuecke, Von Welch, others

  41. Resource LayerProtocols & Services • Grid Resource Allocation Management (GRAM) • Remote allocation, reservation, monitoring, control of compute resources • GridFTP protocol (FTP extensions) • High-performance data access & transport • Grid Resource Information Service (GRIS) • Access to structure & state information • Others emerging: Catalog access, code repository access, accounting, etc. • All built on connectivity layer: GSI & IP GRAM, GridFTP, GRIS: www.globus.org

  42. Resource Management • The Grid Resource Allocation Management (GRAM) protocol and client API allows programs to be started and managed on remote resources, despite local heterogeneity • Resource Specification Language (RSL) is used to communicate requirements • A layered architecture allows application-specific resource brokers and co-allocators to be defined in terms of GRAM services • Integrated with Condor, PBS, MPICH-G2, …

  43. Broker Co-allocator Resource Management Architecture RSL specialization RSL Application Information Service Queries & Info Ground RSL Simple ground RSL Local resource managers GRAM GRAM GRAM LSF Condor NQE

  44. Data Access & Transfer • GridFTP: extended version of popular FTP protocol for Grid data access and transfer • Secure, efficient, reliable, flexible, extensible, parallel, concurrent, e.g.: • Third-party data transfers, partial file transfers • Parallelism, striping (e.g., on PVFS) • Reliable, recoverable data transfers • Reference implementations • Existing clients and servers: wuftpd, ncftp • Flexible, extensible libraries in Globus Toolkit

  45. The Grid Information Problem • Large numbers of distributed “sensors” with different properties • Need for different “views” of this information, depending on community membership, security constraints, intended purpose, sensor type

  46. The Globus Toolkit Solution: MDS-2 Registration & enquiry protocols, information models, query languages • Provides standard interfaces to sensors • Supports different “directory” structures supporting various discovery/access strategies

  47. Globus Applications and Deployments • Application projects include • GriPhyN, PPDG, NEES, EU DataGrid, ESG, Fusion Collaboratory, etc., etc. • Infrastructure deployments include • DISCOM, NASA IPG, NSF TeraGrid, DOE Science Grid, EU DataGrid, etc., etc. • UK Grid Center, U.S. GRIDS Center • Technology projects include • Data Grids, Access Grid, Portals, CORBA, MPICH-G2, Condor-G, GrADS, etc., etc.

  48. Globus Futures • Numerous large projects are pushing hard on production deployment & application • Much will be learned in next 2 years! • Active R&D program, focused for example on • Security & policy for resource sharing • Flexible, high-perf., scalable data sharing • Integration with Web Services etc. • Programming models and tools • Community code development producing a true Open Grid Architecture

  49. Outline • The technology landscape • Grid computing • The Globus Toolkit • Applications and technologies • Data-intensive; distributed computing; collaborative; remote access to facilities • Grid infrastructure • Open Grid Services Architecture • Global Grid Forum • Summary and conclusions

  50. Important Grid Applications • Data-intensive • Distributed computing (metacomputing) • Collaborative • Remote access to, and computer enhancement of, experimental facilities

More Related