1 / 7

Planning and Managing Virtual Data Requests In a Grid Environment

Planning and Managing Virtual Data Requests In a Grid Environment. Miron Livny University of Wisconsin-Madison. Driving Concepts. Virtual Data – Service Requests are in the form “ place y = F(x) at location L”. A common reference Data Grid Architecture.

lucyharold
Download Presentation

Planning and Managing Virtual Data Requests In a Grid Environment

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Planning and Managing Virtual Data Requests In a Grid Environment Miron Livny University of Wisconsin-Madison LCGP 13.3.2002

  2. Driving Concepts • Virtual Data – Service Requests are in the form “ place y = F(x) at location L”. • A common reference Data Grid Architecture. • Integrated research efforts in the areas of planning algorithms and scheduling policies. • A framework of networked services connected by reliable, recoverable and bi-directional interfaces. • Uniform view of processing and data placement activities. • Job flow management based on Directed Acyclic Graphs (DAGs) of jobs. • Evaluation of Grid technology via end-to-end implementations of “real-life” services. • Interoperability with “external” technologies and deployed infrastructure. LCGP 13.3.2002

  3. MCAT; GriPhyN catalogs MDS MDS GDMP Condor-G, DAGMan, Kangaroo GSI, CAS Globus GRAM GridFTP; GRAM; SRM GriPhyN/PPDGData Grid Architecture Application = initial solution is operational DAG (logical) Catalog Services Monitoring Planner Info Services DAG (physical) Repl. Mgmt. Executor Policy/Security Reliable Transfer Service Compute Resource Storage Resource LCGP 13.3.2002

  4. From Virtual to Real • To do or not to do? (does y exist? what is “cheaper”, to retrieve or to materialize?) • From a request to a logical DAG (Virtual Data Language - VDL) • From a logical DAG to a physical DAG – • Select an execution site, a source for x and a destination for y • Build a “job” to allocate space for storing x and y at execution “location” (eager or lazy?) • Build a “job” to move x to execution location • Build a “job” to run F on x and generate y • Build a “job” to move y to destination • Build a “job” to free allocated space • Execute physical DAG LCGP 13.3.2002

  5. Services Needed • Collection and retrieval of information – what is where, who can do what, … • Planning of a physical DAG • All jobs at DAG submission time (eager) • A job at a time when job is ready to go (lazy) or when resource is available (very lazy) • Any combination of the above … • Execution of a DAG – A DAG Manager dispatches a job when the conditions it waits for are satisfied. • Scheduling of Data Placement (DaP) and Processing jobs • Allocation of storage space – Give me a “lot” of X MB for Y days at location Z and call it L • Transfer files – Move file F (or part of) from A to B and place it in L • Run job J on machine M LCGP 13.3.2002

  6. Services (soon) Available • The Globus Tool Kit – Inter-domain information security and job submission services. • The Condor system – Intra-domain information, security, job management and resource allocation services. • Condor-G – Job management services for Globus jobs • RLS – Logical to physical mapping of file names • DRM – Data staging services • VDS – Virtual Data Language, Derivation and Transformation Catalogs • GridFTP and RFT – file/data movers • DAGMan – Job flow services LCGP 13.3.2002

  7. Challenges (near term) • Integrate existing (generic) services with experiment (specific) software to form a vertically integrated end-to-end Grid enabled “production quality” capability. • Effectively leverage available technology sources and deployed capabilities. • Co-manage storage and processing resources. • Provide a “basic” set of exception handling and error recovery services. • Avoid “stupid” planning and/or scheduling decisions. • Interplay of authorization, planning and scheduling. • Packaging and support. • Have fun. LCGP 13.3.2002

More Related