a new era for computational science
Download
Skip this Video
Download Presentation
A New Era for Computational Science

Loading in 2 Seconds...

play fullscreen
1 / 36

A New Era for Computational Science - PowerPoint PPT Presentation


  • 109 Views
  • Uploaded on

A New Era for Computational Science. NPACI Parallel Computing Institute August 28, 2000 Sid Karin Director, NPACI/SDSC [email protected] SDSC. A National Laboratory for Computational Science and Engineering. Leading-Edge Site for NPACI. NPACI. Continuing Evolution. NPACI. SDSC.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'A New Era for Computational Science' - sheila-harris


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
a new era for computational science

A New Era for Computational Science

NPACI Parallel Computing Institute

August 28, 2000

Sid Karin

Director, NPACI/SDSC [email protected]

slide2
SDSC

A National Laboratory for Computational Science and Engineering

Leading-Edge Site for NPACI

continuing evolution
NPACI

Continuing Evolution

NPACI

SDSC

Resources

Resources

Education

Outreach & Training

Enabling technologies

Technology & applications thrusts

Applications

Individuals

Partners

1985

2000

npaci is a highly leveraged national partnership of partnerships
NPACI is a Highly Leveraged National Partnership of Partnerships

46 institutions

20 states

4 countries

5 national labs

Many projects

Vendors and industry

Government agencies

accelerate scientific discovery
MissionAccelerate Scientific Discovery

Through the development and implementationof computationaland computerscience techniques

By creating a ubiquitous, continuous, and pervasive national infrastructure: the grid

changing how science is done
VisionChanging How Science is Done
  • Collect data from digital libraries, laboratories, and observation
  • Analyze the data with models run on the grid
  • Visualize and share data over the Web
  • Publish results in a digital library
embracing the scientific community
Goals: Fulfilling the MissionEmbracing the Scientific Community
  • Capability Computing
    • Provide compute and information resources of exceptional capability
  • Discovery Environments
    • Develop and deploy novel, integrated, easy-to-use computational environments
  • Computational Literacy
    • Extend the excitement, benefits, and opportunities of computational science
partnership organizing principle thrusts
Partnership Organizing Principle: “Thrusts”

Computational Literacy

EOT

Discovery Environments

TECHNOLOGIES

Discovery Environments

APPLICATIONS

Metasystems

Programming Tools & Environments

Data-intensive Computing

Interaction Environments

Molecular Science

Neuroscience

Earth Systems Science

Engineering

Capability Computing

RESOURCES

projects meld applications and technology
Projects Meld Applications and Technology

Brain databases

Data-IntensiveComputing+Neuroscience

Metasystems andParallel Tools

+

Engineering

leadership team
Leadership Team

Sid Karin, [email protected]

Peter Arzberger, SDSC Executive [email protected]

Paul Messina, CaltechChief Architect(on leave)

Susan Graham, UC Berkeley Chief Computer [email protected]

Peter Taylor, SDSCChief Applications [email protected]

Wayne Pfeiffer, SDSCDeputy [email protected]

Greg Moses, U WisconsinEducation, Outreach, and Training [email protected]

npaci executive committee
NPACI Executive Committee

Andrew Grimshaw, U VirginiaMetasystems

Joel Saltz, U MarylandProgramming Tools and Environments

Reagan Moore, SDSCData-Intensive Computing

Arthur Olson, TSRIInteraction Environments

William Martin, U MichiganResource Representative

Russ Altman, Stanford UMolecular Science

Mark Ellisman, UCSDNeuroscience

Bernard Minster, UCSD (SIO)Earth Systems Science

Tinsley Oden, U Texas (TICAM)Engineering

James Pool, CaltechResource Representative

Leadership Team plus:

Aron Kuppermann, Caltech

User Representative

npaci oversight
NPACI Oversight

Institutional Oversight Board

External Visiting Committee

Director’s Advisory Committee

Users’ Advisory Committee

EVC

UAC

IOB

DAC

ExecutiveCommittee

Leadership Team

Resource Partner Representatives

Technologies

Thrust Leaders

Applications

Thrust Leaders

budget balance
EOT

Applications

Technologies

Resources

Budget Balance

SDSC

Partners

complementary roles of five compute resource sites
Complementary rolesof five compute resource sites
  • Leading-edge site (SDSC)
    • Very high-performance resources
    • IBM SP teraflops system
  • Mid-range sites (U Texas & U Michigan)
    • Smaller systems compatible with LES
    • Support for applications with limited scalability, large-memory jobs, application development, OS testing, and education
  • Alternate architecture & research systems
    • Caltech, UC Berkeley, SDSC
    • Support for leading-edge applications, thrusts, and evaluation
leading edge site supercomputer roadmap
14

10

13

10

World's fastest

12

10

supers

Peak speed (flops)

11

10

SDSC's

vector

supers

10

10

9

10

8

10

1980

1985

1990

1995

2000

2005

Year installed

Leading-Edge Site Supercomputer Roadmap

1 TFLOPSIBM SP1999

npaci s balanced complement of high end resources for 2000
NPACI’s balanced complement ofhigh-end resources for 2000
  • Compute resources (SDSC & 4 partners)
    • IBM SP Teraflops system at SDSC
    • Complementary systems at partner sites
  • Data resources (SDSC & 10 partners)
    • >180 TB mass store at SDSC
    • >100 GB data sets at partner sites
  • Network resources (SDSC & all partners)
    • >100 Mbps access to compute & data resources
    • Communications backbone for metacomputing
ibm selected as first npaci teraflops vendor
IBM Selected as First NPACI Teraflops Vendor
  • Strong commitment to high end by IBM
  • Technology being developed through ASCI
  • SDSC has largest system in US academia
  • Growing partnership with IBM
1 st teraflops system for us academia
1st Teraflops System for US Academia

Nov 1999

  • 1 TFLOPs IBM SP
    • 144 8-processor compute nodes
    • 12 2-processor service nodes
    • 1,176 Power3 processors at 222 MHz
    • > 640 GB memory (4 GB/node), upgrade to > 1 TB later
    • 6.8 TB switch-attached disk storage
  • Largest SP with 8-way nodes
  • High-performance access to HPSS
  • Trailblazer switch interconnect with subsequent upgrade
current large sp allocations
Fundamental Physics

T. Kinoshita, Cornell University

R. Sugar, UC Santa Barbara

Ab initio Biochemistry

H. Scheraga, Cornell University

A. McCammon, UC San Diego

M. Klein, Univ. of Pennsylvania

M. Gordon, Iowa State University

Biomedicine

A. Garfinkel, UCLA

B. Pettitt, University of Houston

Materials Science

F. Abraham, IBM Almaden

J. Kim, Ohio State University

Fluid Dynamics

K. Gubbins, Cornell University

J. Kim, UCLA

G. Karniadakis, Brown University

Astrophysics

P. Hauschildt, Univ. of Georgia

J. Raeder, UCLA

M. Ashour-Abdalla, UCLA

Current Large SP Allocations
npaci alpha projects
NPACI “alpha” projects
  • Bioinformatics Infrastructure for Large-Scale Analyses
  • Protein Folding in a Distributed Computing Environment
  • Telescience for Advanced Tomography Applications
  • Multi-Component Models for Energy and the Environment
  • Scalable Visualization Toolkits for Bays to Brains
bioinformatics infrastructure for large scale analyses
Bioinformatics Infrastructure for Large-Scale Analyses
  • Next-generation tools for accessing, manipulating, and analyzing biological data
    • Russ Altman, Stanford University
    • Reagan Moore, SDSC
  • Analysis of Protein Data Bank, GenBank and other databases
  • Accelerate key discoveries for health and medicine
protein folding in a distributed computing environment
Protein Folding in a Distributed Computing Environment
  • Simulating protein movement governing reactions within cells
    • Andrew Grimshaw, U Virginia
    • Charles Brooks, The Scripps Research Institute
    • Bernard Pailthorpe, UCSD/SDSC
  • Computationally intensive
  • Distributed computing power from Legion
telescience for advanced tomography applications
Telescience for Advanced Tomography Applications
  • Integrates remote instrumentation, distributed computing, federated databases, image archives, and visualization tools.
    • Mark Ellisman, UCSD
    • Fran Berman, UCSD
    • Carl Kesselman, USC
  • 3-D tomographic reconstruction of biological specimens
multi component modeling for energy and the environment
Multi-Component Modeling for Energy and the Environment
  • Simulating contaminant movement through ecosystems
    • Leaders: Joel Saltz, U Maryland and Johns Hopkins U; Mary Wheeler, U Texas
  • Will assist environmental cleanup efforts and strategies
  • Engineering and environmental models linked through metasystems and data manipulation tools
scalable visualization toolkits
Scalable Visualization Toolkits
  • Vast data collections and large-scale simulations require scalable visualization tools
    • Art Olson, The Scripps Research Institute
    • Bernard Pailthorpe, SDSC/UCSD
    • Art Toga, UCLA
    • Carl Wunsch, MIT
  • 3-D reconstruction, time-dependent modeling
examples of additional projects
Examples of Additional Projects
  • NPACI and SDSC activities
mice transparent supercomputing
MICE: Transparent Supercomputing
  • Molecular Interactive Collaborative Environment
  • Gallery allows researchers, students to search for, visualize, and manipulate molecular structures
  • Integrates key SDSC technological strengths
    • Biological databases
    • Transparent supercomputing
    • Web-based Virtual Reality Modeling Language
the protein data bank
The Protein Data Bank
  • World’s single scientific resource for depositing and searching protein structures
  • Protein structure data growing exponentially
    • 10,500 structures in PDB today
    • 20,000 by the year 2001
  • Vital to the advancement of biological sciences
  • Working towards a digital continuum from primary data to final scientific publication
  • Capture of primary data from high-energy synchrotrons (e.g. Stanford Linear Accelerator Center) requires 50Mbps network bandwidth

1CD3: The PDB’s 10,000th structure.

new mode of visualization
New Mode of Visualization
  • Network-accessible “TeleManufacturing”
    • 3-D hardcopy for visualization
  • Used by many disciplines
    • Molecules to Hurricanes
    • Death Valley to Venus
    • Reimann Zeta Function to Ozone Hole
digital galaxy
Collaboration with Hayden Planetarium

American Museum of Natural History

Support from NASA

Linking SDSC’s mass storage to Hayden Planetarium requires 155 Mbps

MPIRE Galaxy Renderer

Scalable volume visualization

Linked to database of astronomical objects

Produces translucent, filament-like objects

An artificial nebula, modeled after a planetary nebula

Digital Galaxy
the digital sky
The Digital Sky
  • Billions of objects can be detected with optical, infrared, and radio telescopes
    • Tens of terabytes of image and catalog data
  • Digital Sky federating four sky surveys to allow multi-wavelength studies across the data sets
    • DPOSS, 2MASS, NVSS, FIRST
    • Tom Prince, Caltech, leading federation effort
    • Uses MIX, SDSC SRB, and NPACI mass storage systems

A globular cluster from the DPOSS archive. Such clusters provide a minimum age for the universe. Image by Thomas Handley, Caltech.

looking out for san diego s regional ecology
Looking out for San Diego’s Regional Ecology
  • Unique partnership
    • 31 federal, state, regional,and local agencies
    • John Helly, et al., SDSC
  • Combines technologies and multi-agency data
    • Sensing, analysis, VRML
    • Physical, chemical, and biological data
  • Web-based tool for science and public policy
amico the art of managing art
AMICO: The Art of Managing Art
  • Art Museum Image Consortium (AMICO)
    • 28 art museums working toward educational use of digital multimedia
  • Launch of the AMICO Library includes more than 50,000 works of art
    • AMICO, CDL, SDSC
    • XML information mediation
    • SDSC SRB data management
    • Links between images, scholarly research, educational material
mapping the net s terra incognita
Mapping the Net’s Terra Incognita

Nature: Web Matters, 1/7/99. Science 10/16/98

ad