1 / 59

PATO & Phenotypes: From model organisms to clinical medicine

PATO & Phenotypes: From model organisms to clinical medicine. Suzanna Lewis September 4th, 2008 Signs, Symptoms and Findings Workshop First Steps Toward an Ontology of Clinical Phenotypes.

tokala
Download Presentation

PATO & Phenotypes: From model organisms to clinical medicine

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. PATO & Phenotypes:From model organisms to clinical medicine Suzanna Lewis September 4th, 2008 Signs, Symptoms and Findings Workshop First Steps Toward an Ontology of Clinical Phenotypes

  2. Describing phenotype using ontologies will aid in the identification of models of disease & candidate causative genes • GWAS: Genome Wide Association Studies • Any study of genetic variation across the entire human genome that is designed to identify genetic associations with observable traits (such as blood pressure or weight), or the presence or absence of a disease or condition. • Given an identified gene, then what?

  3. Animal disease models Animal models Mutant Gene Mutant or missing ProteinMutant Phenotype (disease model)

  4. Animal disease models Humans Animal models Mutant Gene Mutant or missing ProteinMutant Phenotype (disease) Mutant Gene Mutant or missing ProteinMutant Phenotype (disease model)

  5. Animal disease models Humans Animal models Mutant Gene Mutant or missing ProteinMutant Phenotype (disease) Mutant Gene Mutant or missing ProteinMutant Phenotype (disease model)

  6. Animal disease models Humans Animal models Mutant Gene Mutant or missing ProteinMutant Phenotype (disease) Mutant Gene Mutant or missing ProteinMutant Phenotype (disease model)

  7. Phenotype data mining = text searching? • Text-based phenotype resources: • OMIM (NCBI) • DECIPHER (Sanger) • HGMD (Cardiff) • Disease-specific databases • MODs • PubMed

  8. Information retrieval from text-based resources (OMIM) is not straightforward: Thanks to: M Ashburner

  9. Even if we can find what we are looking for in one organism, how can we associate that with phenotypes observed in different organisms? Methods to link phenotypic descriptions of human diseases to animal models currently don’t exist.

  10. Goal: Turn text-based phenotypes into ontology-based computable annotations • Define a model for representing phenotypes

  11. SHH-/+ SHH-/- shh-/+ shh-/-

  12. Phenotype (clinical sign) = entity + attribute

  13. Phenotype (clinical sign) = entity + attribute P1 = eye + hypoteloric

  14. Phenotype (clinical sign) = entity + attribute P1 = eye + hypoteloric P2 = midface + hypoplastic

  15. Phenotype (clinical sign) = entity + attribute P1 = eye + hypoteloric P2 = midface + hypoplastic P3 = kidney + hypertrophied

  16. Phenotype (clinical sign) = entity + attribute P1 = eye + hypoteloric P2 = midface + hypoplastic P3 = kidney + hypertrophied PATO: hypoteloric hypoplastic hypertrophied ZFIN: eye midface kidney +

  17. Phenotype (clinical sign) = entity + attribute Anatomical ontology Cell & tissue ontology Developmental ontology Gene ontology biological process molecular function cellular component + PATO (phenotype and trait ontology)

  18. Phenotype (clinical sign) = entity + attribute P1 = eye + hypoteloric P2 = midface + hypoplastic P3 = kidney + hypertrophied Syndrome = P1 + P2 + P3 (disease) (package) = holoprosencephaly

  19. Phenotype annotation model Genetic Environment Evidence Qualifier Assertion Source Entity Quality relationship Attribution Properties Units Who makes the assertion When, what organization

  20. represents Dev Biol 2005 Jul 15;283(2):357-72 “Sonic hedgehog is required for cardiac outflow tract and neural crest cell development” OBD and annotations subj relation obj annotation Absence of aorta investigator read observation publish/ create Information entity bio-entity Experiment/ investigation communicate X Direct annotation query/ meta-analysis Agent (human/computer) Community/expert annotation Shh- Absence Of aorta local db influences local db local db submit/ consume Shh bio-entity Shh+ Heart development Participates in Multiple schemas Computational representation

  21. Goal: Turn text-based phenotypes into ontology-based computable annotations • Define a model for representing phenotypes • Develop and extend requisite ontologies • For the entities being described: anatomies, processes, …

  22. Building a suite of orthogonal interoperable reference (evidence based) ontologies in the biomedical domain. It is critical that ontologies are developed cooperatively so that their classification strategies augment one another. Truth springs from arguments amongst friends. (David Hume)

  23. Requisite ontologies • An ontology of qualities (PATO) • Organism specific anatomies • A controlled vocabulary of homologous and analogous anatomical structures (Uberon) • Gene Ontology • Cell Types

  24. Goal: Turn text-based phenotypes into ontology-based computable annotations • Define a model for representing phenotypes • Develop and extend requisite ontologies • For the entities being described: anatomies, processes, … • Develop an intuitive annotation environment for rigorously capturing phenotypes (“semantic authoring”)

  25. Phenote: Simple software for annotating using ontologies • Provide tool for ontology-based annotation • Standardized model to record annotations for increased compatibility of data between disparate communities. • Simple & intuitive user interface • (especially for users that don’t know/care about what an ontology is) • Easy-to-configure for different user-communities • Pluggable architecture for external applications to interface/embed in application • Provide interfaces with external SOAP and REST services for streamlined workflow (OBD, NCBI, EBI, etc). • www.phenote.org

  26. Ontologies can be utilized from various resources in OWL and OBO format BioPortal External site Local file CVS

  27. Phenote tour

  28. Editor

  29. Post-composition: Join together 2 (or more) terms for specificity: Apoptosis of neuron in skin (GO,CL,FMA) S-phase of colon cancer cell (GO,CL) Aster of human spermatocyte (GO,FMA) Combine terms from different ontologies Increase “information content” of an annotation Pre-composed: Have decomposed definitions of ~2/3rds of MP terms available to incorporate mouse data Refining terms on-the-spot

  30. Term Info Browser

  31. Annotation Table

  32. Retrieve data from NCBI: OMIM, PUBMED, … (SOAP plug-in)

  33. Graphical Viewer

  34. Goal: Turn text-based phenotypes into ontology-based computable annotations • Define a model for representing phenotypes • Develop and extend requisite ontologies • For the entities being described: anatomies, processes, … • Develop an intuitive annotation environment for rigorously capturing phenotypes (“semantic authoring”) • Develop a set of guidelines for biocurators • Annotate mutant phenotypes (OMIM and models)

  35. General Annotation Standards • Remarkable normality • Absence • Relative qualities (what does “small” mean?) • Rates/frequencies • does it inhere in the heart or a process? • Homeotic transformation • Phenotypes specific to a stage or temporal duration

  36. Testing the methodology • Annotated 11 gene-linked human diseases described in OMIM, and their homologs in zebrafish and fruitfly. • ATP2A1, BRODY MYOPATHY • EPB41, ELLIPTOCYTOSIS • EXT2, MULTIPLE EXOSTOSES • EYA1, EYES ABSENT • FECH, PROTOPORPHYRIA • PAX2, RENAL-COLOBOMA SYNDROME • SHH, HOLOPROSENCEPHALY • SOX9, CAMPOMELIC DYSPLASIA • SOX10, PERIPHERAL DEMYELINATING NEUROPATHY • TNNT2, FAMILIAL HYPERTROPHIC CARDIOMYOPATHY • TTN, MUSCULAR DYSTROPHY

  37. An OMIM Record

  38. An OMIM Record

  39. An OMIM Record

  40. An OMIM Record

  41. An OMIM Record

  42. Goal: Turn text-based phenotypes into ontology-based computable annotations • Define a model for representing phenotypes • Develop and extend requisite ontologies • For the entities being described: anatomies, processes, … • Develop an intuitive annotation environment for rigorously capturing phenotypes (“semantic authoring”) • Develop a set of guidelines for biocurators • Annotate mutant phenotypes (OMIM and models) • Collect & store annotations in a common resource (OBD) and make these broadly available

  43. 4355 genes and genotypes in OBD17782 entity-quality annotations in OBD

  44. OBD model: Requirements • Generic • We can’t define a rigid schema for all of biomedicine • Let the domain ontologies do the modeling of the domain • Expressive • Use cases vary from simple ‘tagging’ to complex descriptions of biological phenomena • Formal semantics • Amenable to logical reasoning • First Order Logic and/or OWL1.1 • Standards-compatible • Integratable with semantic web

  45. OBD Model: overview • Graph-based: nodes and links • Nodes: Classes, instances, relations • Links: Relation instances • Connect subject and object via relation plus additional properties • Annotations: Posited links with attribution / evidence • Equivalent expressivity as RDF and OWL • Links aka axioms and facts in OWL • Attributed links: • Named graphs • Reification • N-ary relation pattern • Supports construction of complex descriptions through graph model

  46. OBD Dataflow

  47. Example ofAnnotation in OBD Post-composition of complex anatomical entity descriptions Post-composition of phenotype classes (PATO EQ formalism) key

  48. OBD Architecture • Two stacks • Semantic web stack • Built using Sesame triplestore + OWLIM • Future iterations: Science-commons Virtuoso • OBD-SQL stack • Current focus • Traditional enterprise architecture • Plugs into Semantic Web stack via D2RQ

  49. Alpha version of API implemented Test clients access via SOAP Phenote current accesses via org.obo model & JDBC Wraps org.obo model and OBD schema Share relational abstraction layer Org.obo wraps OWLAPI Phenote currently connects via JDBC connectivity in org.obo OBD-SQL Stack

More Related