1 / 11

Experimental Perspectives on Learning from Imbalanced Data

Experimental Perspectives on Learning from Imbalanced Data. Presenter : Ai-Chen Liao Authors : Jason Van Hulse, Taghi M. Khoshgoftaar, Amri Napolitano. 2007 . ACM . Page : 935 - 942. Outline. Motivation Objective Method Experimental Result Conclusion Comments.

siran
Download Presentation

Experimental Perspectives on Learning from Imbalanced Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Experimental Perspectives on Learning from Imbalanced Data Presenter : Ai-Chen Liao Authors : Jason Van Hulse, Taghi M. Khoshgoftaar, Amri Napolitano 2007 . ACM . Page : 935 - 942

  2. Outline • Motivation • Objective • Method • Experimental Result • Conclusion • Comments

  3. Motivation • When classes are imbalanced, many leaning algorithms can suffer from the perspective of reduced performance. • Can data sampling be used to improve the performance of learners built from imbalanced data? • Is the effectiveness of sampling related to the type of learner? • Do the results change if the objective is to optimize different performance metrics?

  4. Objective • The objective of this research is to provide practical guidance to machine learning practitioners when building classifiers from imbalanced data, and to present to researchers some possible directions for future study.

  5. Method • random undersampling (RUS) • random oversampling (ROS) • one-sided selection (OSS) • cluster-based oversampling (CBOS) • Wilson’s editing (WE) • SMOTE (SM) • borderline-SMOTE (BSM).

  6. Experimental Results 214筆資料

  7. Experimental Results

  8. Experimental Results

  9. Experimental Results

  10. Conclusion • We have presented a comprehensive and systematic experimental analysis of learning from imbalanced data, using 11 learning algorithms with 35 real-world benchmark datasets from a variety of application domains. • Sampling is often critical to improving classifier performance, especially optimizing threshold-dependent measures such as the geometric mean or TPR.

  11. Comments • Advantage • A lot of experiments • Drawback • … • Application • Handling imbalanced data

More Related