1 / 19

TeraGrid Program Year 5 Overview

TeraGrid Program Year 5 Overview. John Towns Chair, TeraGrid Forum Director, Persistent Infrastructure National Center for Supercomputing Applications University of Illinois. Our Vision of TeraGrid. Three part mission: support the most advanced computational science in multiple domains

Download Presentation

TeraGrid Program Year 5 Overview

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. TeraGrid Program Year 5 Overview John Towns Chair, TeraGrid Forum Director, Persistent Infrastructure National Center for Supercomputing Applications University of Illinois

  2. Our Vision of TeraGrid • Three part mission: • support the most advanced computational science in multiple domains • empower new communities of users • provide resources and services that can be extended to a broader cyberinfrastructure • TeraGrid is… • an advanced, nationally distributed, open cyberinfrastructure comprised of supercomputing, storage, and visualization systems, data collections, and science gateways, integrated by software services and high bandwidth networks, coordinated through common policies and operations, and supported by computing and technology experts, that enables and supports leading­edge scientific discovery and promotes science and technology education • a complex collaboration of over a dozen organizations and NSF awards working together to provide collective services that go beyond what can be provided by individual institutions

  3. Strategic Objectives • Objectives determined from considering numerous inputs • user input via various mechanisms • surveys, user contacts, advisory bodies, review panels, etc. • technical input from TG staff • Planning for PY5 started by identifying 5 high level project strategic objectives • Enable science that could not be done without TeraGrid • Broaden the user base • Simplify users lives • Improve Operations • Enable connections to external resources

  4. Advanced User Support – PY5 Objectives • Continue to work collaboratively across TG areas to provide advanced support • activities selected based on user input, TRAC recommendations, ad hoc requests, needs indentified by staff, etc. • Advanced Support for TeraGrid Applications (ASTA) • maintain ~25 ASTA projects throughout the year • contributions to Science Highlights • Advanced Support for Projects (ASP) • “foundation work” to allow use of scientific codes and applications on the HPC resources. • projects with potential to benefit a number (10+) of users in a domain science or TeraGrid users in general • Advanced Support for EOT (ASEOT) • content development and presentation; workshop, panel and BOF participation • outreach to new CI programs: DataNet, iPlant, etc.

  5. User Services – PY5 Objectives • Further improve: • the responsiveness and quality of problem resolution for our users • the feedback mechanisms by which our users become active partners in enhancing the ability of the TeraGrid • 2010 user satisfaction survey will show promptness and quality of user support are rated at least 85%

  6. User Facing Projects and Core Services – PY5 Objectives • Resource Description Repository • further consolidate, integrate resource information in central environment, IIS • simplify tasks for RPs • improve consistency of info for users • Migration to integrated backend for Portal, Web, and Wiki • more monitoring, measurement for users • Work toward user-created portal logins • integration with Shibboleth authentication • IIS-based software catalog • combine CTSS/3rd-party software information • New access tools • job submission, metascheduling in TGUP • Customization & Personalization • personalized, dynamic TGUP Home page • domain views • User Discussion Forums • $PORTAL_HOME file space for TG Users

  7. Science Gateways – PY5 Objectives • Support Services • ability to routinely count end gateway users • create a gateway software listing capability • step by step integration instructions for gateways • straightforward process for making use of and securing community accounts • Targeted Support Projects • GridChem • PolarGrid • OSG cloud on TeraGrid via NIMBUS • SIDgrid • SimpleGrid • Earth Systems Science Gateway • Computational Infrastructure for Geodynamics • SCEC and NEES • Cyberinfrastructure for End-to-End Environmental Exploration

  8. Networking, Operations, and Security – PY5 Objectives • Single Sign On • complete PSC deployment of backup MyProxy service • complete integration of Shibboleth support into Internet Framework/TGUP • bridging authorization with OSG and EGEE to support other activities • Maintain infrastructure support for Core Services • TGCDB, POPS, account and allocations management, and the TeraGrid Ticketing System • Inca improvements • “views” based on user needs and output of QA and CUE • interface for RP administrators to execute tests on-demand • Knowledgebase for errors, causes and solutions

  9. RP Operations – PY5 Objectives • New Resources • NICS’ Kraken system upgrade • NCAR’s Sun Ultra 40 system dedicated to data analysis and visualization • Track 2c @ PSC; Track 2d resources • XD Visualization and Data Analysis services/resources • Retiring Resources and TeraGrid Extension • all non-Track 2 resources slated to discontinue as TeraGrid resources after March 31, 2010 • TeraGrid Extension proposal pending • requests continued support for several resources • 12 months: Abe, QueenBee, Lonestar, Steele, Quarry, Lincoln • 3 months: Pople • orderly transition of users to resources with ongoing support

  10. Data and Visualization – PY5 Objectives • Implement Data Architecture recommendations • User Portal integration; Data Collections infrastructure; archival replication services; continued investigation of new location-independent access mechanisms (Petashare, Reddnet) • Complete production deployments of Lustre-WAN • develop plans for next-generation Lustre-WAN and pNFS technologies • Develop a TeraGrid wide petascale visualization and data analysis plan • establish policies, documentation and training that provides a comprehensive petascale visualization and data analysis picture to the user community. • Expand resources and capabilities in the TG Visualization Gateway via integration of RP tools into TG Vis Gateway • XD Visualization and Data Analysis services/resources

  11. Software Integration – PY5 Objectives • Primary focus still on operational support • Some investment in high-impact software integration activities • expand and streamline Integrated Information Service • more data for meta-schedulers, data for/from SGWs, descriptions of data collections, hardware capabilities, etc. • standard VM, application, and service hosting capabilities • convert prototypes to a standardized capability • finalize SDCI/STCI/CISE support services • support software build and test quality on TeraGrid • improved (CTSS) capability user documentation • reorganize documentation around user-focused use cases

  12. Quality Assurance/Common User Environment – PY5 Objectives • Quality Assurance Working Group • expediting problem resolution when service failures detected • improving the use of the Inca monitoring framework • validate common user environment (with CUE WG) • develop/propose a more formal process for CTSS software deployment (with Software WG) • Common User Environment Working Group • remove barriers to user movement between TeraGrid resources • working with RP staff and TG WG to implement: • CUE Documentation (CUED) • CUE Management System (CUEMS) • CUE Build Environment (CUBE) • CUE Testing Platform (CUETP) • CUE Variable Collection (CUEVC)

  13. Training – PY5 Objectives • Offerings to include live, synchronous and asynchronous offerings covering all TeraGrid resources and services • expansion of HPC University offerings • conduct at least 50 training sessions, of which 10 will include new content • at least 30 sessions offered synchronously; at least 8 new async topics • Establish mechanism for users to provide TeraGrid with feedback on their evolving training needs • Work with other regional, national and international groups (e.g. OSG, NCSI, DOE, NASA, etc.) • avoid duplication of effort

  14. Education and Outreach – PY5 Objectives • Education - continue K-12 and undergrad professional development and curriculum development efforts • SC09 Education Program - 10 summer workshops and Nov event • plan to conduct over 100 workshops, institutes and tutorials • Outreach - expand outreach, especially with under-served communities • expand Campus Champions to > 60 campuses; focus on under-represented campuses (e.g. EPSCoR sites) • TeraGrid Pathways program • conduct at least two petascale workshops with Blue Waters; outreach to at least 20 Professional Society Meetings • Student Engagement - expand opportunities • Student competitions @ TG09, SC09; computational science problem of the week • Graduate Research Fellows seminars and allocations; student internships, REUs and workshops • support at least 300 K-12 students and 500 college students • EOT-wide • EOT highlights • EOT Highlights produced in time for SC09; EOT monthly newsletter launched and being distributed • Evaluation • pilot instruments through April; move into production in May • create database for longitudinal impact studies; conduct first 6-month analysis by December • continue to review and improve metrics of long-term impact

  15. External Relations – PY5 Objectives • Provide easily accessible information • revise the public web site to improve the content, navigation and ease of access to information • Convey the impact of TeraGrid on research, education and society • develop science impact stories • Science Highlights • Work with other regional, national and international groups • e.g. iSGTW and other professional societies • focus on under-represented groups

  16. Project Management – PY5 Objectives • Tracking and reporting • track progress against the IPP • report progress against the IPP quarterly and annually • provide updates as otherwise requested • Change Management • manage change management process for IPP • Planning • anticipate coordinating development of IPP for proposed TeraGrid Extension • timing issue

  17. TeraGrid → TeraGrid Extension (?) → XD Transition Planning • All activity areas have effort reserved for TeraGrid → XD transition effort as appropriate • transition issues exist for nearly all areas • effort ear-marked to support transition issues • more was learned about timing after PY5 IPP was developed • Start of XD for CMS/AUSS/TEOS deferred for one year (1 April 2011) • induced TeraGrid Extension Proposal • 12-month funding to support most GIG functions and some non-Track 2 RP resources • uncertainty in sequence of events • still need to address many changes in TeraGrid going into presumed Extension Period • many resources exit Teragrid • PY5 planning process will likely need to address: • TeraGrid Extension following PY5 • necessarily include transition to XD in extension period

  18. PY5 Total TeraGrid Budget Distribution • TeraGrid budget most closely resembles the RP budgets • RP expenditures ~4x those of the GIG • Bulk of TeraGrid funds go directly to • providing facilities to scientific researchers • assuring they have necessary support to make productive use of them

  19. PY5 Total TeraGrid Budget Detail

More Related