1 / 17

Hadoop MapReduce vs Spark | Hadoop Tutorial For Beginners | Hadoop & Spark Tutorial | Edureka

( ** Apache Spark Training - https://www.edureka.co/apache-spark-s... ** ) <br>This Edureka tutorial on MapReduce vs Spark will help you to understand the differences between MapReduce and Spark by comparing them on various parameters like: <br><br>1. Current Market Situation <br>2. Hadoop Map-Reduce vs Apache Spark <br>a. Performance <br>b. Ease of Use <br>c. Cost <br>d. Data Processing <br>e. Security <br>f. Fault Tolerance <br>3. Real-time example of Map-Reduce <br>4. Real-time example of Spark <br><br>Check our complete Apache Spark and Scala playlist here: https://goo.gl/ViRJ2K

EdurekaIN
Download Presentation

Hadoop MapReduce vs Spark | Hadoop Tutorial For Beginners | Hadoop & Spark Tutorial | Edureka

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  2. Parameters to Compare Performance Cost Fault Tolerance Ease of Use Security Data Processing www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  3. Current Market Situation 47% + (2017) 14% + (2017) (2016) (2016) www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  4. Performance Performance Ease of Use Moves data through disk and network Cost Data Processing Security Performs better as data is cached in the memory Fault Tolerance www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  5. Ease of Use Performance Uses Java API’s and doesn’t support real time processing Ease of Use Cost Data Processing Uses Rich API’s and supports Interactive Mode in real time Security Fault Tolerance www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  6. Cost Performance Comparatively less costlier because of hard disk storage Ease of Use Cost Data Processing Security More costlier because of large amounts of RAM Fault Tolerance www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  7. Data Processing Performance Ease of Use Batch Processing Cost Data Processing Real-time as well as Batch Processing Security Fault Tolerance www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  8. Security Performance Ease of Use More secure & supports all security benefits like Knox Gateway Cost Data Processing Less secure & Authentication via Shared Secret Security Fault Tolerance www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  9. Fault Tolerance Performance Ease of Use Uses replication for fault Tolerance Cost Data Processing Security Uses RDD and other storage models Fault Tolerance www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  10. Real Time Use Case of Map-Reduce Copyright © 2018, edureka and/or its affiliates. All rights reserved.

  11. ETL & Data Analytics Data center 2 Data center 1 EXTRACT TRANSFORM LOAD www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  12. Real Time Use Case of Apache Spark Copyright © 2018, edureka and/or its affiliates. All rights reserved.

  13. Credit Card Fraud Detection Credit card data Spark Engine Spark Streaming Input data Batches of input data Batches of processed data Data Ingestion HDFS & HBase Storage Spark Streaming Analytic Interface www.edureka.co/big-data-and-hadoop www.edureka.co/apache-spark-scala-training Hadoop Certification Training Spark Certification Training

  14. www.edureka.co/apache-spark-scala-training Spark Certification Training

More Related