1 / 13

Discussion Points

Discussion Points. Maintaining the right balance between research and development Maintaining focus vs. accepting broader scope E.g., international collaboration E.g., GriPhyN in the large (GriPhyN-2) E.g., Terascale Creating a national cyberinfrastructure What is our appropriate role.

enan
Download Presentation

Discussion Points

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Discussion Points • Maintaining the right balance between research and development • Maintaining focus vs. accepting broader scope • E.g., international collaboration • E.g., GriPhyN in the large (GriPhyN-2) • E.g., Terascale • Creating a national cyberinfrastructure • What is our appropriate role

  2. Discussion Points • Outreach to other disciplines • Biology, NEES, … • Virtual data toolkit • Inclusive or focused? • Resource issue, again • Achieving critical mass of resources to deliver on the complete promise

  3. Planning • Review of Year 1 milestones • Top 10 research challenges • Demonstration projects • Research projects + goals • Workshops

  4. Year 1 Milestones:Virtual Data • Develop basic information model to represent data elements, relationships between different data types, characteristics of data elements • Develop protocols for storing, discovering, and retrieving these models • Design and develop tools for creating, accessing and manipulating these models by interactive tools, & planning and scheduling tools • Deploy centralized metadata and replica catalog services. Develop tools for managing catalogs

  5. Year 1 Milestones:Request Planning • Develop generic models for representing execution plans. Define API and tools for constructing, traversing, and manipulating plan data structures. Develop protocols and formats for storing and exchanging execution plans. • Develop uniform policy representation for code, data, and resource access. Develop a set of global and local policy scenarios that reflect the requirements of the user communities of the four physics experiments. • Develop simple optimization heuristics. Initial thrust will be on data movement only and focus on the use of alternative, or branching plans to compensate for both resource failure and changes in resource performance. Implement planning heuristics in prototype planning module. Evaluate performance of alternatives with simulation and model based studies, as well as execution on GriPhyN testbed.

  6. Year 1 Milestones:Request Execution • Develop and evaluate a task control language capable of capturing the requirements, preferences and dependencies of a PVDG request. Implement prototype of an interpreter to a basic subset of the language • Enhance the "Gang Matching" capabilities of the ClassAd language and add these enhancements to the run-time support library • Explore ways to enhance the ClassAd language to support events and triggers • Develop a protocol for information exchange between the execution and planning agents

  7. Year 1 Milestones:Virtual Data Toolkit • VDT-1 (Basic Grid Services) provides an initial set of grid enabling services and tools, including security, information, metadata, CPU scheduling, and data transport. VDT-1 will support efficient operation on O(10 TB) datasets, O(100) CPUs, and O(100 MB/s) wide area networks and will build extensively on existing technology.

  8. Year 1 Milestones:CMS & LIGO • CMS • Build basic services and 1-2 prototype Tier 2 centers • Complete High Level Trigger milestones and perform studies with ORCA, the CMS object-oriented reconstruction & analysis software • LIGO • Develop a cataloging approach for data access methods & data location (metadata definition, design) • Develop an access and use model for LIGO data across the GriPhyN system

  9. Year 1 Milestones:ATLAS and SDSS • SDSS • Build a prototype distributed analysis system • ATLAS • Connect the Athena analysis framework to a set of prototype virtual data services • Start with Globus replica catalog service • Athena EventSelector service to a replica catalog (reading) • Athena Replica catalog update service • Testing of basic file replication and transport using 500 GB testbeam data sets. • Develop Condor interface to the ATLAS testbed • Build basic services and 1-2 prototype Tier 2 centers

  10. Schedule

  11. Schedule

  12. Breakouts/Workshops? • Virtual data representations • Naming, etc. • Simulation strategies and tools • UC, CIT, UCB, others? • Architecture • What are the essential (and missing) pieces • Failure models

  13. Workloads • Three types: queries, objects, files • Koen Holtman’s modeling work • LIGO workloads (says Valerie)

More Related