grid deployments and cyberinfrastructure n.
Skip this Video
Loading SlideShow in 5 Seconds..
Grid Deployments and Cyberinfrastructure PowerPoint Presentation
Download Presentation
Grid Deployments and Cyberinfrastructure

Loading in 2 Seconds...

play fullscreen
1 / 23

Grid Deployments and Cyberinfrastructure - PowerPoint PPT Presentation

  • Uploaded on

Grid Deployments and Cyberinfrastructure. Andrew J. Younge 102 Lomb Memorial Drive Rochester, NY 14623 How Do we Make Use of These Tools?. Grid Hierarchy . Cluster Systems. All batch queuing systems!. PBS – Portable Batch System.

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
Download Presentation

Grid Deployments and Cyberinfrastructure

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
grid deployments and cyberinfrastructure

Grid Deployments and Cyberinfrastructure

Andrew J. Younge

102 Lomb Memorial Drive

Rochester, NY 14623

cluster systems
Cluster Systems

All batch queuing systems!

pbs portable batch system
PBS – Portable Batch System
  • Used for dedicated Cluster resources
    • Homogeneous clusters with MPI
  • Manages thousands of CPUs in near real time
  • Schedules large numbers of jobs quickly and efficiently
  • Many different implementations
    • PBS Pro (not free but advanced)
    • Open PBS (free but old)
    • Torque & Maui (free, stable, advanced)
  • Deployments
    • Dedicated clusters in academic and corporate settings
    • Playstation3 Clusters
  • Used for dedicated and non-dedicated resources
    • Typically used to “scavenge” CPUs in places where a lot of workstations are available
    • Heterogeneous environments
  • Separate Condor tasks – Resource Management and Job Management
  • User interface is simple; commands that use small config files
  • Not good for MPI jobs
  • Deployments
    • Campus workstations and desktops
    • Corporate servers
grid tools in condor
Grid tools in Condor
  • Condor-G
    • Replicates the Job Management functionality
    • Submission to a grid resource using the Globus Toolkit
    • NOT a grid service, just a way to submit to a grid
  • Flocking
    • Allows for queued jobs in one Condor cluster to be executed on another Condor cluster
    • Directional flocking (A => B but not B => A)
    • Unidirectional flocking (A <=> B)
  • Glidein
    • Dynamically adds machines to a Condor cluster
    • Can be used to create your own personal Condor cluster on the Teragrid!
  • Desktop based Grid Computing - “Volunteer Computing”
    • Centralized Grid system
    • Users encouraged by gaining credits for their computations
    • Can partake in one or many different projects
  • Open access for contributing resources, closed access for using grid
  • Allows organizations to gain enormous amounts of computational power with very little cost.
  • BOINC is really a cluster and a grid system in one!
boinc projects 2
BOINC Projects (2)

Full List of Projects:

the open science grid osg
The Open Science Grid - OSG
  • Large national-scale grid computing infrastructure
      • 5 DOE Labs, 65 Universities, 5 regional/campus grids
      • 43,000 CPUs, 6 Petabytes of disk space
  • Uses the Globus Toolkit
    • GT4, however uses pre-WS services (GT2)
    • Typically connects to Condor pools
  • Virtual Data Toolkit (VDT) & OSG Release Tools
    • NMI + VOMS, CEMon, MonaLisa, AuthZ, VO management tools, etc
    • VORS – Resource Selector:
the teragrid
The TeraGrid
  • NSF-funded national-scale Grid Infrastructure
    • 11 Locations – LONI, NCAR, NCSA, NICS, ORNL, PSC, IU, PU, SDSC, TACC, UC/ANL
    • 1.1Petaflops, 161 thousand CPUs, 60 Petabytes disk space
    • Dedicated 10G fiber lines to each location
    • Specialized visualization servers
  • Uses Globus Toolkit 4’s basic WS services and security protocols
  • Grid Infrastructure Group (GIG) at U. Chicago
    • Commity for Teragrid planning, management, and coordination
  • Science Gateways
    • Independent services for specialized groups and organizations
    • “TeraGrid Inside” capabilities
    • Web Portals, desktop apps, coordinated access points
    • Not Virtual Organizations (VOs)
teragrid overview
TeraGrid Overview

Grid Infrastructure Group (UChicago)















Resource Provider (RP)

Software Integration Partner

  • European Commision funded International Grid system
    • 250 resource locations, 40,000 CPUs, 20 Petabytes of storage
    • Originally European grid, but expanded to US and Asia
  • Uses the gLite Middleware system
    • Uses Globus’ Grid Security Infrastructure (GSI)
    • Specialized elements to utilize underlying hardware
    • Groups organized as Virtual Organizations (VOs) and uses VOMS membership services to enable user privileges
  • Originally based on the old LHC Grid
    • EGEE-I Ended April, 2006. Continued on as EGEE-II
    • Now part of WLCG
worldwide lhc computing grid
Worldwide LHC Computing Grid
  • Large grid to support the massive computational needs of the Large Hadron Collider at CERN
    • Project produces >15 Petaflops per year!
  • WLCG is really a mashup of other grids
    • EGEE, OSG, GridPP, INFN Grid, NorduGrid
    • Uses specialized upperware to manage these grids
  • Multi-tier system for efficiently distributing data to scientists and researchers around the world
  • Used mostly for ATLAS, ALICE, CMS, LHCb, LHCf and TOTEM experiments
cyberaide shell
Cyberaide Shell
  • There are many different cyberinfrastructure deployments today.
    • How do we make sence of them?
    • How do we use them for our benefit?
  • Our idea for Cyberaide Gridshell will be to link to these grid deployments
    • Provide an easy, all-in-one interface for many different grids
    • Automate scheduling and resource management
    • Leverage Web 2.0 technologies