1 / 10

Data Analytics Architectures and Infrastructure

Data Analytics Architectures and Infrastructure. Daniel Silver March, 2014. The KDD Process. Interpretation and Evaluation. Data Mining. Knowledge. Selection and Preprocessing. p(x)=0.02. Data Consolidation. Patterns & Models. Prepared Data. Data Warehouse. Consolidated

chavez
Download Presentation

Data Analytics Architectures and Infrastructure

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Analytics Architectures and Infrastructure Daniel Silver March, 2014

  2. The KDD Process Interpretation and Evaluation Data Mining Knowledge Selection and Preprocessing p(x)=0.02 Data Consolidation Patterns & Models Prepared Data Data Warehouse Consolidated Data Data Sources

  3. The KDD Process The Architecture of a KDD System Graphical User Interface Data Mining Interpretation and Evaluation Selection and Preprocessing Data Consolidation Knowledge Warehouse Data Sources

  4. Big Data Analytics Infrastructure • Requires: • DM: Data Management • DW: Data Warehouse • BI/DS: Business Intelligence / Data Science • DAA: Data Analytics Architecture

  5. Big Data Analytics Infrastructure DW BI/DS DAA DM

  6. Relationship between DW and DM? Strategic Tactical Rationale for data consolidation Analysis Query/Reporting OLAP Data Mining Data Warehousing Source of consolidated data

  7. Data Warehousing OLAP Knowledge Workers “The Ideal Picture” Stats IDT Data Marts & Analytical Pocessors ANN One or more central repositories Data Warehouse Operational feedback from analytics Extraction Transformation Load Operational Data Store (ODS) Source Systems and Operational Users

  8. Top-Down – traditional DA Architecture Bottom-up – Big DA Architecture

  9. Hadoop and MapReduce • http://www.youtube.com/watch?v=9s-vSeWej1U • http://www.youtube.com/watch?v=4DgTLaFNQq0 • http://www.youtube.com/watch?v=RQr0qd8gxW8 • http://hci.stanford.edu/courses/cs448g/a2/files/map_reduce_tutorial.pdf

  10. References • http://www.datasciencecentral.com/profiles/blogs/big-data-analytics-infrastructure • Reference Architecture for Big Data and DW • http://www.youtube.com/watch?v=L_s-x1HAi5k • http://www.youtube.com/watch?v=bETjVsWJsAs

More Related