1 / 13

Recursive partitioning for tumor classification with gene microarray data

Recursive partitioning for tumor classification with gene microarray data. Heping Zhang, Chang-Yung Yu, Burton Singer, Momian Xiong. What is Recursive Partitioning? Basic Idea:. Technical description of recursive partitioning Example:.

torgny
Download Presentation

Recursive partitioning for tumor classification with gene microarray data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Recursive partitioning for tumor classification with gene microarray data Heping Zhang, Chang-Yung Yu, Burton Singer, Momian Xiong

  2. What is Recursive Partitioning?Basic Idea:

  3. Technical description of recursive partitioningExample:

  4. Technical description of recursive partitioningAlgorithm: • Examine all of the available gene expression levels and all possible thresholds for each of the expression levels • Select the combination of gene expression level and threshold that results in the best separation of cancer and normal tissues on the basis of the node purity function Quality of the tree classification: Error rate based on cross-validation

  5. Technical description of recursive partitioningNodePurity: A little bit of math  One example of entropy function: P log(P) + (1-P) log(1-P), where P is the probability of a tissue being normal within the node • Note: • Maximum purity ( =0 ) When all tissues are of the same type within the node ( P = 0 or 1) • Minimum purity ( = -log2) When all tissues are of the same type within the node ( P = 0.5)

  6. Example from the article Expression profiles of 2,000 genes using an Affimetrix oligonucleotide array in 22 normal and 40 colon cancer tissues(www.sph.uth.tmc.edu/hgc)Results: Using 5-fold cross validation, The error rate is between 6-8%, which is much better than that obtained by exsiting analysis.

  7. Fig1. Classification trees for tissue types by using expression data from three genes ( M26383, R15447, M28214)

  8. Correlation among gene expression profiles

  9. Another Tree Based on A Different Set of Three Genes (Fig.6)

  10. Correlation Matrix among Genes in Fig.1 and Fig. 6

  11. Other clustering classification 1. Hierachical2. K-means3. Self-orgnizing maps4. Coupled two-way clustering

  12. Advantage of recursive partitioning classification methods 1. Efficient with large number of genes2. More than two types of tissues simultaneously3. Automatically selects valuable genes as predictors4. More precise than other classification methods

  13. 1.It is likely that the information contained in a large number of genes can be captured by a small number of genes without significant loss of information.2.The precision of classification of recursive partitioning is important for clinical application. Conclusion:

More Related