1 / 8

Hadoop And bIg data syllabous

Big Data Hadoop Training in Pune is available with Itelligence in different training formats. In our first format we provide hadoop training in classroom.

Download Presentation

Hadoop And bIg data syllabous

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. SUBJECT Big Data And HadoopSyllabus

  2. Introduction to Bigdata • Need for Bigdata Analytics. • Why Bigdata with hadoop? • History Of Hadoop. • Node, Rack , Cluster. • Architecture of Hadoop. • Characteristics of Namenode. • Significance of JobTracker and Tasktrackers. • hase co-ordinatiaon with JobTracker. • ScondaryNamenode usage and workaround. • Hadoop releases and their significance. • Workaround with datanodes. • YARN architecture. • Significance of scalability of operation. • Use cases where not to use Hadoop. • Use cases where Hadoop Is used. • Facebook,Twitter,Snapdeal, Flipkart.

  3. Hadoop Java API • Mapper Class and its Methods • What is Partitioner and types • Hadoop specific datatypes • Working on unstructured data analytics • What is an iterator and its usage techniques • Types of mappers and reducers • What is output collector and its significance • Workaround with Joining of datasets • Complications with mapreduce • Mapreduce anatomy • Anagram example,TeragenExample,Terasort Example • WordCount Example • Working with multiple mappers • Working with weather data on multiple datanodes in a Fullydistributed architecture • UseCases where mapreduce anatomy fails • Interview questions based on JAVA mapreduce

  4. Hbase & Zookeeper • HbaseArchitechtural point of view • Regionservers and their implementation • Client API's and their features  • How messaging system works • Columns and column families • Configuring hbase-site.xml • Available Client  • Loading Hbase with semi-structured data • Internal data storage in hbase • Timestamps • HBase Architecture  • Creating table with column families • MapReduce Integration. • HBase: Advanced Usage, Schema Design • Load data from pig to hbase • Sqoop architecture

  5. Yarn Architecture • Introduction to YARN and MR2 daemons. • Active and Standby Namenodes • Resource Manager and Application Master • Node Manager • Container Objects and Container • Namenode Federation • Cloudera Manager and Impala • Load balancing in cluster with namenode federation • Architectural differences between Hadoop 1.0 and 2.0

  6. Hadoop on Amazon Cloud • Introduction to cloud infrastructure. • Amazon SaaS, Paas and IaaS. • Creating EC2 instance for processing. • Creating S3 buckets • Deploying data on to the cloud. • Choosing size of our instance. • Configuration of EMR instance • Creating a virtual cluster on amazon • Deploying project and getting stats.

  7. Flume  • Introduction to Flume data integration • Flume installation on single node and multinode cluster • Flume architecture and various components • Data sources types and variants • Data target types and variants • Deploying an agent onto a single node cluster. • Problems associated with flume • Interview questions based on flume

  8. THANK YOU

More Related