1 / 19

Extraction and Evaluation of Transcription Factor Gene-Disease Association

Extraction and Evaluation of Transcription Factor Gene-Disease Association. Warren Cheung Wyeth Wasserman Francis Ouellette. Purpose. Quantitatively Integrate Literature Evidence to Predict Gene-Disease Associations Transcription Factor Genes Brain Diseases. Thesis Outline.

sakina
Download Presentation

Extraction and Evaluation of Transcription Factor Gene-Disease Association

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Extraction and Evaluation of Transcription Factor Gene-Disease Association Warren Cheung Wyeth Wasserman Francis Ouellette

  2. Purpose • Quantitatively Integrate Literature Evidence to Predict Gene-Disease Associations • Transcription Factor Genes • Brain Diseases

  3. Thesis Outline • Gene-Disease Associations • Properties of Gene-Disease Associations • Clusters of Genes Related to Disease

  4. Overview

  5. Existing Methods • Machine Learning on Sequence Data • DGP: Properties of Disease Genes • Annotations • G2D: MeSH and GO links • Text Mining • CAESAR: Key terms from “expert” text • Integrating Multiple Methods • Endeavor

  6. Goals • Gene-Disease Associations • Mechanisms and Processes involved • Integrate diverse data sources • Quantitative manner • Verifiable supporting evidence • Transparent view of supporting data • User verification and further analysis • Validate results

  7. Core Entities and Example Data Sources • Genes • Entrez Gene • Evidence • PubMed • Disease • MeSH terms

  8. Data Model

  9. Example Relationships • Gene-Evidence • GeneRIFs • Evidence-Disease • MeSH annotation • Evidence-Evidence • Related Articles

  10. Other Data Sources • Other Annotations • Protein-Protein Interaction • Pathways • Protein Domains • Homology • Annotation in other organisms • Mouse orthologue

  11. Example Gene PubMed Article Disease GeneRIF MeSH Gene PubMed Article PubMed Article Disease GeneRIF Related Article MeSH Gene Gene PubMed Article Disease GeneRIF Interaction MeSH

  12. Scoring • Overrepresentation of terms • Hypergeometric distribution • “Selected” Articles • Gene+GeneRIF • Gene+GeneRIF+Related Article • Gene+Interaction+Related Article

  13. Integrating Scores • Arbitrary Scoring Methods • Average, Product • Combining P-values • Fisher’s Meta-analysis • Z-transform • Weighting • Confidence

  14. Multiple Testing Correction • Testing gene against all possible diseases • Controlling Type I Error • Bonferroni correction

  15. Validation • OMIM • Known disease-gene associations, with literature references • Predictive Performance • Results when using databases saved on date X • Compare with new gene-disease associations discovered after date X

  16. Sensitivity • Ratio • Number of True Positives Identified • Number of Actual True Positives • Only True Positives are known for certain

  17. Beyond Gene-Disease Associations • Properties involved in Gene-Disease Associations • Pathways • Mechanisms • Cluster genes based on disease association

  18. Conclusion • Extract Gene-Disease Associations • Mechanisms • Processes • Quantitative Analysis • Better Understanding of how Genes affect the human condition

More Related