1 / 15

Strengthening I-ReGEC classifier

Strengthening I-ReGEC classifier. G. Attratto, D. Feminiano, and M.R. Guarracino High Performance Computing and Networking Institute Italian National Research Council. Supervised learning. Supervised learning refers to the capability of a system to learn from a set of

bmillington
Download Presentation

Strengthening I-ReGEC classifier

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Strengthening I-ReGEC classifier G. Attratto, D. Feminiano, and M.R. Guarracino High Performance Computing and Networking Institute Italian National Research Council

  2. Supervised learning • Supervised learning refers • to the capability of a system • to learn from a set of • input/output couples: • Training Set.

  3. Classification • Consists of determining a model that it • allows to group elements according to • determined features • The groups are the classes

  4. Evaluation of classification methods • Accuracy It’s ability’s pointer of prediction model • Speed Some methods employ little time than others • Robustness The defined rules and the accuracy do not change considerable with various set • Scalability Possibility to classify dataset of great dimensions

  5. Goals • To render more efficient the examples’ choice during the training • Delete the redundant examples or insufficient informative contribution • Strengthening the training set, deleting the obsolete knowledge Building an efficient, scalabile and generalizable model

  6. Classification techniques • Decision tree (Optimal Tree) Based on tree • Bayesian Networks (Slow in training) Compute posterior probabilities with Bayes’ theorem • Neurals Networks (Slow in training) Simulate the behavior of the biological systems • Support Vector Machine (SVM) Calculate hyperplanes

  7. SVM: The state of the art • Find an examples set (support vectors) • representatives for classes Support vector Linear case Nonlinear case Separation margin Optimal Hyperplane

  8. Regec • Two Hyperplanes representative for classes (GEPSVM’s family) Based on Genralized Eigenvalue

  9. I-Regec • Select k points for each class with a clustering technique • (K-means) |S| = 2xK • Classify the test-set with the S points • Add misclassified points in incremental • mode to the S set • On proceede until the finish of misclassified points

  10. Strengthening • Apply I-ReGEC in order to obtain the training set • Each iteration delete a point from training set • Apply I-ReGEC in each iteration with new input set S • Strengthening the set (save new S) if accuracy is improved

  11. Microarray and matrix CLASSES FEATURES E X AM P L E S Gene expression

  12. Results

  13. Results and Diagrams Golub 2D I-Regec Strengthening Golub 3D I-Regec Strengthening

  14. Conclusions • The examples choice became more efficient • The reduntants or obsolete examples have been deleted • The training set are “strengthened”

  15. Future work • In order to optimize the execution time, the Strengthening technique would to go integrated into I-Regec.

More Related