Shivnath Babu

CPS216: Advanced Database Systems (Data-intensive Computing Systems)Introduction to MapReduce and Hadoop Shivnath Babu

Word Count over a Given Set of Web Pages • see 1 • bob 1 • throw 1 • see 1 • spot 1 • run 1 • bob 1 • run 1 • see 2 • spot 1 • throw 1 • see bob throw • see spot run Can we do word count in parallel?

The MapReduce Framework (pioneered by Google)

Automatic Parallel Execution in MapReduce (Google) Handles failures automatically, e.g., restarts tasks if a node fails; runs multiples copies of the same task to avoid a slow task slowing down the whole job

MapReduce in Hadoop (1)

Data Flow in a MapReduce Program in Hadoop  1:many • InputFormat • Map function • Partitioner • Sorting & Merging • Combiner • Shuffling • Merging • Reduce function • OutputFormat

Map function Reduce function Run this program as a MapReduce job Lifecycle of a MapReduce Job

Lifecycle of a MapReduce Job Time Reduce Wave 1 Input Splits Reduce Wave 2 Map Wave 1 Map Wave 2 How are the number of splits, number of map and reduce tasks, memory allocation to tasks, etc., determined?

190+ parameters in Hadoop Set manually or defaults are used Job Configuration Parameters

How to sort data using Hadoop?

Let us look at a complete example MapReduce program in Hadoop

Shivnath Babu

Shivnath Babu

Presentation Transcript

Varudu - Mahesh Babu

Mahesh Babu - Telugu Actor

Babu Banarsi Das Institute of Technology, Duhai , Ghaziabad

Satish Babu

Colored Zee- Babu Model @ LHC & low energy experiments

PFM Architecture: Concept and Processes Babu Ram Subedi

John Ulimwengu Catherine Ragasa Suresh Babu

Srinu Babu . Matta Research Scholar

Adimulam Vinay Babu

Pradeep Kumar Gunda (Thanks to Jigar Doshi and Shivnath Babu for some slides)

Shivnath Babu

Herodotos Herodotou Shivnath Babu Duke University

Mahesh Babu Biography | Biography Of Mahesh Babu

Trending Mahesh Babu Dressing Style

Babu Banarsi Das University: A Center of Excellence

Short Story of Madhu Babu Dandamudi

Astrologer Shivnath Ji

Babu Ram Dawadi

Venkatesan Chakrapani, Priya Babu, Timothy Ebenezer

Shivnath Babu

Christine Babu Live

Shivnath Babu

Shivnath Babu

Presentation Transcript

Varudu - Mahesh Babu

Mahesh Babu - Telugu Actor

Babu Banarsi Das Institute of Technology, Duhai , Ghaziabad

Satish Babu

Colored Zee- Babu Model @ LHC &amp; low energy experiments

PFM Architecture: Concept and Processes Babu Ram Subedi

John Ulimwengu Catherine Ragasa Suresh Babu

Srinu Babu . Matta Research Scholar

Adimulam Vinay Babu

Pradeep Kumar Gunda (Thanks to Jigar Doshi and Shivnath Babu for some slides)

Shivnath Babu

Herodotos Herodotou Shivnath Babu Duke University

Mahesh Babu Biography | Biography Of Mahesh Babu

Trending Mahesh Babu Dressing Style

Babu Banarsi Das University: A Center of Excellence

Short Story of Madhu Babu Dandamudi

Astrologer Shivnath Ji

Babu Ram Dawadi

Venkatesan Chakrapani, Priya Babu, Timothy Ebenezer

Shivnath Babu

Christine Babu Live

Colored Zee- Babu Model @ LHC & low energy experiments