Slide1 l.jpg
This presentation is the property of its rightful owner.
Sponsored Links
1 / 39

Overview of Research Computing ITS Research Computing Mark Reed PowerPoint PPT Presentation


  • 110 Views
  • Uploaded on
  • Presentation posted in: General

Overview of Research Computing ITS Research Computing Mark Reed. Overview – Research Computing. Resources Services Projects. ReCo Resources. Computational Resources compute clusters: Killdevil , Kure Special purpose servers: galaxy, bioapps , sapientia , ICISS, eruditio Software

Download Presentation

Overview of Research Computing ITS Research Computing Mark Reed

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Slide1 l.jpg

Overview of Research Computing ITS Research Computing

Mark Reed


Overview research computing l.jpg

Overview – Research Computing

  • Resources

  • Services

  • Projects


Reco resources l.jpg

ReCo Resources

  • Computational Resources

    • compute clusters: Killdevil, Kure

    • Special purpose servers:

      • galaxy, bioapps, sapientia, ICISS, eruditio

  • Software

    • licensed

    • open source

  • Data Storage

  • Virtual Computing Lab (VCL)

  • Access to National Resources


Reco services l.jpg

ReCo Services

  • Technical Support

  • Training and Development

  • Engagement and Collaboration

  • Research Database Support

  • Secure Data Exchange

  • Data Grids – iRODS

  • Desktop Support - THL


Reco projects l.jpg

ReCo Projects

  • EFRC

  • HTS and Seqware

  • Digital Humanities


Slide6 l.jpg

Resources


Compute cluster advantages l.jpg

Compute Cluster Advantages

  • fast interconnect, tightly coupled

  • aggregated resources

    • compute cores

    • memory

  • installed software base

  • high availability

  • large (scratch) file spaces

  • scheduling and job management

  • data backup


Multi purpose killdevil cluster l.jpg

Multi-Purpose Killdevil Cluster

  • High Performance Computing

    • Large parallel jobs, high speed interconnect

  • High Throughput Computing (HTC)

    • high volume serial jobs

  • Large memory jobs

    • special nodes for extreme memory

  • GPGPU computing

    • computing on Nvidia processors


Killdevil nodes l.jpg

Killdevil Nodes

  • Three types of nodes:

    • compute nodes

    • large memory nodes

    • GPGPU nodes


Killdevil compute cluster l.jpg

Killdevil Compute Cluster

Infiniband4x QDR Interconnect

priority usage for patrons

Buy in is cheap

Storage

large lustre scratch file system IB connected

/netscr

  • Heterogeneous Research Cluster

  • Dell Blades

  • 700+ Compute Nodes mostly

    • Xeon 5670 2.93 GHz

    • 9600 cores

    • Nehalem Microarchitecture

    • Dual socket, hex core and octcore

    • 48 GB memory

    • some higher memory nodes

  • GPGPU Nodes

    • 64 Nvidia Tesla M2070

  • Extreme Memory Nodes

    • two 1 TB node, 32 cores


Slide11 l.jpg

Kure

  • A HPC/HTC research compute cluster in RC

  • Named after the beach in North Carolina

  • It’s pronounced like the Nobel prize winning physicist and chemist, Madame Curie


Kure compute cluster l.jpg

Kure Compute Cluster

priority usage for patrons

Buy in is cheap

Storage

/netscr, /proj

  • Heterogeneous Research Cluster

  • Hewlett Packard Blades

  • 200+Compute Nodes, mostly

    • Xeon 5560 2.8 GHz

    • Nehalem Microarchitecture

    • Dual socket, quad core

    • 48 GB memory

    • over 1800 cores

    • some higher memory nodes

  • Infiniband4x QDR


Getting an account l.jpg

Getting an account:

For Kure, KillDevil and Mass Storage

  • http://onyen.unc.edu

  • Subscribe to Services


Resources available software l.jpg

Resources: Available Software


Licensed software l.jpg

Licensed Software

  • over 20 licensed software applications (some are site or volume licensed, others restricted)

    • SAS, Matlab, Maple, Mathematica, Gaussian, Accelrys Materials Studio and Discovery Studio modules, Sybyl, Schrodinger, Stata, ArcGIS, NAG, IMSL, Totalview, Envi/IDL, JMP, and JMP Genomics

  • compilers (licensed and otherwise)

    • intel, PGI, gnu, CUDA compiler


Large installed software base l.jpg

Large Installed Software Base

  • Numerous other packages provided for research and technical computing

    • including BLAST, PyMol, SOAP, PLINK, NWChem, R, Cambridge Structural Database, Amber, Gromacs, Petsc, Scalapack, Netcdf, Babel, Qt, Ferret, Gnuplot, Grace, iRODS, XCrySDen, and many more.


Mass storage l.jpg

long term archival storage

easy to access and use

“limitless” capacity

2 TB free

looks like ordinary disk file system – data is actually stored on tape

data is backed up

Mass Storage

Recently Upgraded!

“To infinity … and beyond” - Buzz Lightyear


Virtual computing lab vcl l.jpg

Virtual Computing Lab (VCL)

  • Collaboration with NC State to establish VCL infrastructure for UNC.

  • VCL provides on-demand access to high-end computing resources, via highly customized, virtual Windows and Linux machines.


Virtual computing lab vcl19 l.jpg

Virtual Computing Lab (VCL)

  • Users can log on from anywhere at any time to make a reservation to use a machine

  • Lots of software available!

    • ArcGIS

    • SAS

    • MATLAB

    • Adobe

    • MS Office

    • LaTEX

    • SigmaPlot

    • MUCH MORE!

Go to http://vcl.unc.edu to sign on

For help, see “Getting Started on VCL” webpage http://help.unc.edu/CCM3_007680


Access to national resources l.jpg

Access to National Resources

  • XSEDE – NSF funded leadership class infrastructure at 11 partner sites.

  • Open Science Grid – national shared computing and storage resources in a common grid infrastructure


Slide21 l.jpg

Services


Services training l.jpg

Services: Training

  • Courses are offered in the following areas:

    • Introductions to HPC resources

    • Research Applications

    • Linux

    • General Computing

    • Parallel Programming

  • Courses are taught throughout year by Research Computing, for listings and details, go to:

    • http://learnit.unc.edu/workshops

    • http://help.unc.edu/CCM3_008194


Services technical support l.jpg

Services: Technical Support

  • Technical support in using RC resources is available

    • Support in compiling, porting, using tools, submitting jobs, using software packages, storage and data management, …

  • online web forms

  • email [email protected]

  • 962-HELP (962-4357)

  • personal consultation


Engagement support and collaboration l.jpg

Engagement, Support and Collaboration

  • Research scientists with experience in computational chemistry, physics, grid computing, environmental modeling, mathematics, parallel computing and the life sciences are available for consultation and collaboration.

  • Digital Humanities Specialist

  • Extensive technical support for utilizing research computing resources.


Services secure data exchange l.jpg

Services: Secure Data Exchange

  • Capability to share secure and sensitive data using a secure “drop box” mechanism for anonymous or non-Onyen users or full FTP access for trusted Onyen accounts

  • Computing - challenges of flexibility needed for research and realities of cyber attacks

  • Networking – maximizing bandwidth for research endeavors vs. IPS/IDS inspection

  • Data – compliance requirements, data sharing, privacy, etc.


Services data grids irods l.jpg

Services: Data Grids –iRODS

  • Distributed data storage using the integrated Rule oriented Data System (iRodS). iRODS provides scientists with a secure, scalable system that can support many aspects of research data management

  • Enables data grids/repositories whose policies are implemented and enforced through rules

Research Computing is experimenting with hosting iRODS collections as a service.

Collaborating with UNC Libraries, Institute for the Environment, and RENCI.

www.irods.org


Desktop computing tarheel linux l.jpg

Desktop Computing –TarHeel Linux

Linux Image Pull

  • Desktop/Laptop Campus Machines

  • Build desktop machines tailored for the RC environment with additional customization by user.

  • Based on CentOS

  • Security Approved Build

    • nightly updates

  • Onyen

  • OpenAFS

  • Customized Applications

  • Firewall

  • http://tarheellinux.unc.edu

KickstartServer for Linux Distribution in ITS Manning Machine Room


Services research database support l.jpg

Services: Research Database Support

  • Full time DB admin to support UNC research databases

  • over 20 UNC Research Databases for research production, training and development

    • clients include School of Pharmacy, Lineberger Comprehensive Cancer Center (LCCC), Computer Science, SILS, Renci, Bioinformatics, Institute for the Environment, …


Slide29 l.jpg

Projects


Energy frontier research centers l.jpg

Energy Frontier Research Centers

http://www.er.doe.gov/bes/EFRC/index.html


Chemical approaches to artificial photosynthesis modular approach l.jpg

Chemical Approaches to Artificial Photosynthesis. Modular Approach

Light absorption, sensitization

Electron transfer quenching

Vectorial electron/proton transfer, redox splitting

Catalysis of water oxidation and reduction

Photosystem II

Meyer, Accounts of Chemical Research1989, 22, 163.

Meyer, et. al. Inorg. Chem.2005, 6802; Acc. Chem Res 1989, 163.


High throughput sequencing l.jpg

High Throughput Sequencing

  • The High Throughput Sequencing Facility (HTSF) provides core services primarily for

  • Lineberger Comprehensive Cancer Center (LCCC) and the TCGA (The Cancer Genome Atlas) project

  • Renci – NIDA project (National Inst. Drug Abuse)

  • UNC life sciences


High throughput deep sequencing infrastructure l.jpg

High Throughput Deep Sequencing Infrastructure

  • ~20 NextGen sequences

    • IlluminaHiSeq, Ion Torrent, …

  • RNAseqpipeline

  • DNAseqpipeline

  • Whole Genome pipeline

  • ChIP/FAIREseq pipeline

  • De novo assembly

  • Specialized Workflow Engine, Condor, LSF scheduling


High throughput deep sequencing infrastructure34 l.jpg

High Throughput Deep Sequencing Infrastructure

Data Collection Infrastructure

Isilon

1.7 PB

Aggregation Server

Compute Nodes

MaPSeq meta scheduler running multiple pipelines

Pipeline

Manager

Processing Pipeline


Slide35 l.jpg

  • TCGA is a project to catalog genetic mutations responsible for cancer. UNC is one of twelve national centers

  • Processed over 4500 samples in support of TCGA to date

  • Have processed over 700 samples in a week

  • Goal is to process 10,000 unique samples total over five years


Lumbee familial political factions l.jpg

Lumbee Familial Political Factions

Malinda Maynor Lowery, History


Brooklyn renaissance social graph l.jpg

Brooklyn Renaissance Social Graph

Melissa Bullard, History


Ancient world mapping application l.jpg

Ancient World Mapping Application


Questions and comments l.jpg

Questions and Comments?

  • For assistance with any of our services, please contact Research Computing

    • Email: [email protected]

    • Phone: 919-962-HELP

    • Submit help ticket at http://help.unc.edu


  • Login