Data Mining and Predictive Analytics Toolkit - PowerPoint PPT Presentation

sarah-owens
data mining and predictive analytics toolkit n.
Skip this Video
Loading SlideShow in 5 Seconds..
Data Mining and Predictive Analytics Toolkit PowerPoint Presentation
Download Presentation
Data Mining and Predictive Analytics Toolkit

play fullscreen
1 / 8
Download Presentation
Data Mining and Predictive Analytics Toolkit
214 Views
Download Presentation

Data Mining and Predictive Analytics Toolkit

- - - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

  1. Data Mining and Predictive Analytics Toolkit December 2013, Jakub Miarka, University of Leeds

  2. RapidMiner • Formerly called YALE (Yet Another Language Environment) • Environment for machine learning, data and text mining, predictive and business analytics • Started in 2001 at the Artificial Intelligence Unit of the Dortmund University of Technology, Germany • 2006 – Rapid-I founded • AGPL open source license until November 2013 • Profitable company, growing organically • 3 millions downloads / 200,000 users • One of the leaders in the predictive analytics

  3. Usage • GUI for building data mining/analytics workflows • Highly scalable predictive analytics application • Learning schemes and attribute evaluators from WEKA • Integrates with popular enterprise data sources (60+, incl. SAP) • Supports both structured and unstructured data • Typically used for: • customer segmentation • loyalty and retention analysis • credit ratings • asset maintenance • resource planning

  4. A pie chart showing aggregated information Multiple results displayed simultaneously

  5. Benefits • No programming skills needed and easy to use (GUI, drag & drop…) • 1000+ analytical methods • 120+ models incl. decision trees and dozens of visualisations available • Powerful and scalable • Flexible, scriptable, supports plugins and extensions • Provides a GUI to design an analytical pipeline (the "operator tree") which defines the analytical processes the user wishes to apply to the data • Other applications can use the engine through API

  6. Popularity • Suitable for individuals and large enterprises as well • Some of the customers: • PayPal • PepsiCo • eBay • Volkswagen • Lufthansa • … and many more

  7. November 2013 and future • $5 millions investment • Rebranded from Rapid-I to RapidMiner • Core stays open source but new commercial packages introduced • When a new version is published, previous ones become free • In future, increased focus on Big Data and self-service-style interface for less technical and more business-focused users • A vision to become the industry standard for predictive analytics

  8. References http://rapidminer.com/products/rapidminer-studio/ http://sourceforge.net/p/rapidminer/wiki/Home/ http://en.wikipedia.org/wiki/RapidMiner http://techcrunch.com/2013/11/04/german-predictive-analytics-startup-rapid-i-rebrands-as-rapidminer-takes-5m-from-open-ocean-earlybird-to-tackle-the-u-s-market/ http://www.zdnet.com/rapid-i-gets-funded-re-brands-as-rapidminer-7000022757/