high performance computing cluster hpcc n.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
High Performance Computing Cluster (HPCC) PowerPoint Presentation
Download Presentation
High Performance Computing Cluster (HPCC)

Loading in 2 Seconds...

play fullscreen
1 / 10

High Performance Computing Cluster (HPCC) - PowerPoint PPT Presentation


  • 68 Views
  • Uploaded on

High Performance Computing Cluster (HPCC). Mary Galvin Managing Principal, American Innovations Consulting http :// www.aicnova.com https:// www.linkedin.com/pub/mary-galvin/15/340/397. Big Data at LexisNexis. History of the HPCC.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'High Performance Computing Cluster (HPCC)' - yeo-diaz


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
high performance computing cluster hpcc

High Performance Computing Cluster (HPCC)

Mary Galvin

Managing Principal, American Innovations Consulting

http://www.aicnova.com

https://www.linkedin.com/pub/mary-galvin/15/340/397

slide3

History of the HPCC

Designed and Developed from the Ground-Up to Meet LexisNexis’ Internal Big Data Needs.

The Idea of Releasing the HPCC to the OSS Community was Presented to LexisNexis Corporate Management.

The Spread of HPCC Users has Gone Global, and as a Result, Innovation Ignites.

Google’s MapReduce Paper is Published.

2004

2007

2001

Late 90s/Early 2000s

2009

2011

2012

United States Government Sought After Getting LexisNexis’ Data Capabilities In-House for their Internal Data Mining Needs.

The HPCC is Officially Released to the Open Source Community!

First Release of Hadoop Available (designed after Map Reduce Papers).

ecl overview
ECL Overview

Task: Produce a set of records wherein a particular field contains a specific set of values

Typical approach for solving this in many programming languages

ecl overview cont d
ECL Overview (cont’d)

Task: Produce a set of records wherein a particular field contains a specific set of values

Approach for solving this problem in ECL

hpcc modules plugins
HPCC Modules & Plugins
  • Other
    • H2H Connector
    • Machine Learning Module
    • R Integration
    • Eclipse IDE
    • JDBC Driver
    • ……..
  • Scalable Automated Linking Technology (SALT)
    • Data Ingest
    • Data Profiling
    • Data Hygiene
    • Clustering
    • Relationship Extraction
  • Exploratory Data Analysis (EDA) Toolkit
hpcc academic program
HPCC Academic Program
  • Audience: Colleges and Universities
  • Benefits:
    • Internship opportunities
    • Invitation-only conferences
    • Free training for qualifying projects
    • Access to an external cluster, as available
additional learning options
Additional Learning Options
  • Online:
    • Includes both prerequisites and tailored courses depending on role type (ie, developers, analysts, and administrators)
    • http://hpccsystems.com/community/training-videos
  • In-Person:
    • http://hpccsystems.com/community/training-events/training