1 / 18

Evaluation of WEKA

Evaluation of WEKA. Waikato Environment for Knowledge Analysis Presented By: Manoj Wartikar & Sameer Sagade. Outline. Introduction to the WEKA System. Features Pros and Cons Enhancements. Introduction. A research project at the University of Waikato, NZ

trixie
Download Presentation

Evaluation of WEKA

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Evaluation of WEKA Waikato Environment for Knowledge Analysis Presented By: Manoj Wartikar & Sameer Sagade

  2. Outline • Introduction to the WEKA System. • Features • Pros and Cons • Enhancements

  3. Introduction • A research project at the University of Waikato, NZ • Weka is a collection of machine learning algorithms for solving real-world data mining problems. • Developed in Java 2

  4. Features • Documented features of WEKA • Attribute Selection • Clustering • Classification • Association Rules • Filters • Estimators

  5. Attribute Selection • A part of the Preprocessing phase in the Knowledge Discovery process. • Useful to specify the attributes and their values on which data can be mined.

  6. Attribute Selection contd…. • Algorithms Implemented • Best First • Forward Selection • Ranked Output First

  7. Clustering • Algorithms Implemented • Cobweb • Estimation Maximization • Clusterer • Distribution Clusterer

  8. Classification • Algorithms Implemented • K Nearest Neighbor • Naïve Bayes • Bagging • Boosting • Multi - Class Classifier

  9. Association Rules • Algorithms Implemented • Apriori

  10. Filters • Algorithms Implemented • Attribute Filter • Discretize Filter • Split Dataset Filter

  11. Estimators • Algorithms Implemented • Discrete Estimator • Kernel Estimator • Normal Estimator • Poisson Estimator

  12. Sample Execution java weka.associations.Apriori -t data/weather.nominal.arff -I yes Apriori ======= Minimum support: 0.2 Minimum confidence: 0.9 Number of cycles performed: 17 Generated sets of large itemsets: Size of set of large itemsets L(1): 12

  13. Sample Execution Best rules found: 1. humidity=normal windy=FALSE 4 ==> play=yes 4 (1) 2. temperature=cool 4 ==> humidity=normal 4 (1) 3. outlook=overcast 4 ==> play=yes 4 (1) 4. temperature=cool play=yes 3 ==> humidity=normal 3 (1) 5. outlook=rainy windy=FALSE 3 ==> play=yes 3 (1) 6. outlook=rainy play=yes 3 ==> windy=FALSE 3 (1) 7. outlook=sunny humidity=high 3 ==> play=no 3 (1) 8. outlook=sunny play=no 3 ==> humidity=high 3 (1)

  14. Boosting • ADA Boost • Logit Boost • Decision Stump

  15. Pros and Cons of WEKA • Covers the Entire Machine Learning Process • Easy to compare the results of the different algorithms implemented • Accepts one of the most widely used data formats as input i.e the ARFF format.

  16. Flexible APIs for programmers Customization possible Pros and Cons for WEKA

  17. Pros and Cons for WEKA • Textual User Interface • Requires the Java Virtual Machine to be installed for execution • Visualization of the mining results not possible

  18. Enhancements • The new version of WEKA 3.1.7 overcomes some of the decripancies of the previous version like • Graphical User Interface • Visualization of Results. • Mining of Non - local data bases

More Related