1 / 71

The Relation Ontology

The Relation Ontology. Barry Smith. Concepts, Types and Frames. Concepts, Types and Frames. TLR2:MyD88 complex. has_output. TLR2-MyD88 binding. has_disposition. has_participant. TLR2. LTA binding. has_participant. MyD88. regulated_by. preceded_by. has_lower_level_granularity.

gaurav
Download Presentation

The Relation Ontology

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The Relation Ontology Barry Smith

  2. Concepts, Types and Frames

  3. Concepts, Types and Frames

  4. TLR2:MyD88 complex has_output TLR2-MyD88 binding has_disposition has_participant TLR2 LTA binding has_participant MyD88 regulated_by preceded_by has_lower_level_granularity has_part process TLR2-TLR2 ligand binding has_participant TIR-TIR binding TIR domain TLR-2 signalling pathway

  5. how to define relations such as this?

  6. Uses of ‘ontology’ in PubMed abstracts

  7. By far the most successful: The Gene Ontology

  8. How to do biology across the genome? • MKVSDRRKFEKANFDEFESALNNKNDLVHCPSITLFESIPTEVRSFYEDEKSGLIKVVKFRTGAMDRKRSFEKVVISVMVGKNVKKFLTFVEDEPDFQGGPISKYLIPKKINLMVYTLFQVHTLKFNRKDYDTLSLFYLNRGYYNELSFRVLERCHEIASARPNDSSTMRTFTDFVSGAPIVRSLQKSTIRKYGYNLAPYMFLLLHVDELSIFSAYQASLPGEKKVDTERLKRDLCPRKPIEIKYFSQICNDMMNKKDRLGDILHIILRACALNFGAGPRGGAGDEEDRSITNEEPIIPSVDEHGLKVCKLRSPNTPRRLRKTLDAVKALLVSSCACTARDLDIFDDNNGVAMWKWIKILYHEVAQETTLKDSYRITLVPSSDGISLLAFAGPQRNVYVDDTTRRIQLYTDYNKNGSSEPRLKTLDGLTSDYVFYFVTVLRQMQICALGNSYDAFNHDPWMDVVGFEDPNQVTNRDISRIVLYSYMFLNTAKGCLVEYATFRQYMRELPKNAPQKLNFREMRQGLIALGRHCVGSRFETDLYESATSELMANHSVQTGRNIYGVDFSLTSVSGTTATLLQERASERWIQWLGLESDYHCSFSSTRNAEDV

  9. MKVSDRRKFEKANFDEFESALNNKNDLVHCPSITLFESIPTEVRSFYEDEKSGLIKVVKFRTGAMDRKRSFEKVVISVMVGKNVKKFLTFVEDEPDFQGGPIPSKYLIPKKINLMVYTLFQVHTLKFNRKDYDTLSLFYLNRGYYNELSFRVLERCHEIASARPNDSSTMRTFTDFVSGAPIVRSLQKSTIRKYGYNLAPYMFLLLHVDELSIFSAYQASLPGEKKVDTERLKRDLCPRKPIEIKYFSQICNDMMNKKDRLGDILHIILRACALNFGAGPRGGAGDEEDRSITNEEPIIPSVDEHGLKVCKLRSPNTPRRLRKTLDAVKALLVSSCACTARDLDIFDDNNGVAMWKWIKILYHEVAQETTLKDSYRITLVPSSDGISLLAFAGPQRNVYVDDTTRRIQLYTDYNKNGSSEPRLKTLDGLTSDYVFYFVTVLRQMQICALGNSYDAFNHDPWMDVVGFEDPNQVTNRDISRIVLYSYMFLNTAKGCLVEYATFRQYMRELPKNAPQKLNFREMRQGLIALGRHCVGSRFETDLYESATSELMANHSVQTGRNIYGVDSFSLTSVSGTTATLLQERASERWIQWLGLESDYHCSFSSTRNAEDVVAGEAASSNHHQKISRVTRKRPREPKSTNDILVAGQKLFGSSFEFRDLHQLRLCYEIYMADTPSVAVQAPPGYGKTELFHLPLIALASKGDVEYVSFLFVPYTVLLANCMIRLGRRGCLNVAPVRNFIEEGYDGVTDLYVGIYDDLASTNFTDRIAAWENIVECTFRTNNVKLGYLIVDEFHNFETEVYRQSQFGGITNLDFDAFEKAIFLSGTAPEAVADAALQRIGLTGLAKKSMDINELKRSEDLSRGLSSYPTRMFNLIKEKSEVPLGHVHKIRKKVESQPEEALKLLLALFESEPESKAIVVASTTNEVEELACSWRKYFRVVWIHGKLGAAEKVSRTKEFVTDGSMQVLIGTKLVTEGIDIKQLMMVIMLDNRLNIIELIQGVGRLRDGGLCYLLSRKNSWAARNRKGELPPKEGCITEQVREFYGLESKKGKKGQHVGCCGSRTDLSADTVELIERMDRLAEKQATASMSIVALPSSFQESNSSDRYRKYCSSDEDSNTCIHGSANASTNASTNAITTASTNVRTNATTNASTNATTNASTNASTNATTNASTNATTNSSTNATTTASTNVRTSATTTASINVRTSATTTESTNSSTNATTTESTNSSTNATTTESTNSNTSATTTASINVRTSATTTESTNSSTSATTTASINVRTSATTTKSINSSTNATTTESTNSNTNATTTESTNSSTNATTTESTNSSTNATTTESTNSNTSAATTESTNSNTSATTTESTNASAKEDANKDGNAEDNRFHPVTDINKESYKRKGSQMVLLERKKLKAQFPNTSENMNVLQFLGFRSDEIKHLFLYGIDIYFCPEGVFTQYGLCKGCQKMFELCVCWAGQKVSYRRIAWEALAVERMLRNDEEYKEYLEDIEPYHGDPVGYLKYFSVKRREIYSQIQRNYAWYLAITRRRETISVLDSTRGKQGSQVFRMSGRQIKELYFKVWSNLRESKTEVLQYFLNWDEKKCQEEWEAKDDTVVVEALEKGGVFQRLRSMTSAGLQGPQYVKLQFSRHHRQLRSRYELSLGMHLRDQIALGVTPSKVPHWTAFLSMLIGLFYNKTFRQKLEYLLEQISEVWLLPHWLDLANVEVLAADDTRVPLYMLMVAVHKELDSDDVPDGRFDILLCRDSSREVGEMKVSDRRKFEKANFDEFESALNNKNDLVHCPSITLFESIPTEVRSFYEDEKSGLIKVVKFRTGAMDRKRSFEKVVISVMVGKNVKKFLTFVEDEPDFQGGPIPSKYLIPKKINLMVYTLFQVHTLKFNRKDYDTLSLFYLNRGYYNELSFRVLERCHEIASARPNDSSTMRTFTDFVSGAPIVRSLQKSTIRKYGYNLAPYMFLLLHVDELSIFSAYQASLPGEKKVDTERLKRDLCPRKPIEIKYFSQICNDMMNKKDRLGDILHIILRACALNFGAGPRGGAGDEEDRSITNEEPIIPSVDEHGLKVCKLRSPNTPRRLRKTLDAVKALLVSSCACTARDLDIFDDNNGVAMWKWIKILYHEVAQETTLKDSYRITLVPSSDGISLLAFAGPQRNVYVDDTTRRIQLYTDYNKNGSSEPRLKTLDGLTSDYVFYFVTVLRQMQICALGNSYDAFNHDPWMDVVGFEDPNQVTNRDISRIVLYSYMFLNTAKGCLVEYATFRQYMRELPKNAPQKLNFREMRQGLIALGRHCVGSRFETDLYESATSELMANHSVQTGRNIYGVDSFSLTSVSGTTATLLQERASERWIQWLGLESDYHCSFSSTRNAEDVVAGEAASSNHHQKISRVTRKRPREPKSTNDILVAGQKLFGSSFEFRDLHQLRLCYEIYMADTPSVAVQAPPGYGKTELFHLPLIALASKGDVEYVSFLFVPYTVLLANCMIRLGRRGCLNVAPVRNFIEEGYDGVTDLYVGIYDDLASTNFTDRIAAWENIVECTFRTNNVKLGYLIVDEFHNFETEVYRQSQFGGITNLDFDAFEKAIFLSGTAPEAVADAALQRIGLTGLAKKSMDINELKRSEDLSRGLSSYPTRMFNLIKEKSEVPLGHVHKIRKKVESQPEEALKLLLALFESEPESKAIVVASTTNEVEELACSWRKYFRVVWIHGKLGAAEKVSRTKEFVTDGSMQVLIGTKLVTEGIDIKQLMMVIMLDNRLNIIELIQGVGRLRDGGLCYLLSRKNSWAARNRKGELPPKEGCITEQVREFYGLESKKGKKGQHVGCCGSRTDLSADTVELIERMDRLAEKQATASMSIVALPSSFQESNSSDRYRKYCSSDEDSNTCIHGSANASTNASTNAITTASTNVRTNATTNASTNATTNASTNASTNATTNASTNATTNSSTNATTTASTNVRTSATTTASINVRTSATTTESTNSSTNATTTESTNSSTNATTTESTNSNTSATTTASINVRTSATTTESTNSSTSATTTASINVRTSATTTKSINSSTNATTTESTNSNTNATTTESTNSSTNATTTESTNSSTNATTTESTNSNTSAATTESTNSNTSATTTESTNASAKEDANKDGNAEDNRFHPVTDINKESYKRKGSQMVLLERKKLKAQFPNTSENMNVLQFLGFRSDEIKHLFLYGIDIYFCPEGVFTQYGLCKGCQKMFELCVCWAGQKVSYRRIAWEALAVERMLRNDEEYKEYLEDIEPYHGDPVGYLKYFSVKRREIYSQIQRNYAWYLAITRRRETISVLDSTRGKQGSQVFRMSGRQIKELYFKVWSNLRESKTEVLQYFLNWDEKKCQEEWEAKDDTVVVEALEKGGVFQRLRSMTSAGLQGPQYVKLQFSRHHRQLRSRYELSLGMHLRDQIALGVTPSKVPHWTAFLSMLIGLFYNKTFRQKLEYLLEQISEVWLLPHWLDLANVEVLAADDTRVPLYMLMVAVHKELDSDDVPDGRFDILLCRDSSREVGE

  10. what cellular component? what molecular function? what biological process?

  11. GO used in curation of literature what cellular component? what molecular function? what biological process?

  12. and in integration of databases GlyProt MouseEcotope sphingolipid transporter activity DiabetInGene GluChem

  13. The GO Idea GlyProt MouseEcotope Holliday junction helicase complex DiabetInGene GluChem

  14. The GO Idea GlyProt MouseEcotope sphingolipid transporter activity DiabetInGene GluChem

  15. GO used in reasoning Clark et al., 2005 is_a part_of

  16. GO provides a controlled system of representations for use in annotating data • multi-species • multi-disciplinary • multi-granularity, from molecules to population

  17. Gene products involved in cardiac muscle development in humans

  18. $100 mill. invested in literature curation using GO over 11 million annotations relating gene products described in the UniProt, Ensembl and other databases to terms in the GO

  19. GO allows a new kind of biological research based on analysis and comparison of the massive quantities of annotations linking GO terms to the gene products described in scientific literature and in scientific databases

  20. GO is amazingly successful in overcoming data silo problems • but it covers only • cellular components • molecular functions • biological processes

  21. The Open Biomedical Ontologies (OBO) Foundry

  22. The OBO Foundry – to extend the GO to enable intelligent integration of gigantic bodies of heterogeneous data across the entire domain of the life sciences, including clinical medicine – to create an evolving, map-like, computable representation of the entire domain of biological and medical reality

  23. The OBO Foundry Initial Candidate Members • GO Gene Ontology • CL Cell Ontology • SO Sequence Ontology • ChEBI Chemical Ontology • PATO Phenotype (Quality) Ontology • FMA Foundational Model of Anatomy • ChEBI Chemical Entities of Biological Interest • CARO Common Anatomy Reference Ontology • PRO Protein Ontology

  24. The OBO Foundry Under development • Disease Ontology • Infectious Disease Ontology • Mammalian Phenotype Ontology • Plant Trait Ontology • Environment Ontology • Ontology for Biomedical Investigations • Behavior Ontology • RNA Ontology   • RO Relation Ontology

  25. A success story in top-down information integration • Ontologies configured as extensions of a single upper level ontology (BFO) • Used by 100s of researchers to promote interoperability of experimental data in scores of high-throughput domains of biology and medicine via semantic annotation

  26. The linguistic approach • Bottoms-up, focused on linguistic properties manifested by the contents of a large corpus viewed from a cognitive perspective (mapping/modeling meanings or concepts rather than entities in reality)

  27. Automatic mining of “assocations” from MEDLINE FACTA: Finding Associated Concepts with Text Analysis • What diseases are related to a particular chemical? • What proteins are related to a particular disease? http://text0.mib.man.ac.uk/software/facta/

  28. For the linguistic approach • fiction may be no less important than fact • English has no privileged status (the larger the corpus, the better) • consistency (and thus additivity) of annotations is not important, because cognitive perspectives differ • goal is automatic generation of semantic annotations via pattern- matching

  29. For the scientific approach • factual discourse alone important • English is lingua franca • regimentation is allowed • goal of truth: to create a single computer-processable map of reality via painstaking Handarbeit • truth is one  we strive for consistency of annotations

  30. The linguistic approach is concerned with knowledge representation • The scientific approach is concerned with reality representation

  31. OBO Relation Ontology (RO 1.0)

  32. Relation Ontology • supports consistent linkage of OBO Foundry ontologies through a common system of formally defined relations • to enable reasoning both within and across ontologies, and thus also within and between the literature annotated in its terms

  33. Relation Ontology • instance_of • is_a (= is a subtype of) • depends_on • part_of • inheres_in • has_input • has_participant • …. • http://obofoundry.org/ro/

  34. Basic Formal Ontology (BFO) Continuant Occurrent (Process, Event) Independent Continuant Dependent Continuant http://ifomis.uni-saarland.de/bfo/

  35. Fundamental Dichotomy • Continuants preserve their identity through change • Occurrents (aka processes) • have temporal parts • unfold themselves in successive phases • exist only in their phases • have all their parts of necessity

  36. instance_of types Continuant Occurrent process, event Independent Continuant thing Dependent Continuant quality .... ..... ....... instances

  37. types vs. instances • compare OWL: T-box vs. A-box • (terminology vs. assertions)

  38. 3 kinds of (binary) relations • Between types • human is_a mammal • human heart part_ofhuman • Between an instance and a type • this human instance_of the type human • this human allergic_to the type tamiflu • Between instances • Mary’s heart part_of Mary • Mary’s aorta connected_to Mary’s heart

  39. depends_on Continuant Occurrent process, event Independent Continuant thing Dependent Continuant quality quality depends on bearer .... ..... .......

  40. Dependent continuants • the whiteness quality of this cheese • your role as lecturer • the disposition of this peach to ripen

  41. depends_on Continuant Occurrent process Independent Continuant thing Dependent Continuant quality temperature depends on bearer .... ..... .......

  42. depends_on Continuant Occurrent process, event Independent Continuant thing Dependent Continuant quality, … event depends on participant .... ..... .......

  43. Type-level relations presuppose the underlying instance-level relations • A is_a B =def. A and B are types and all instances of A are instances of B • A part_of B =def. All instances of A are instance-level-parts-of some instance of B

  44. The assertions linking terms in ontologies must hold universally Hence all type-level relations in RO are provided with All-Some definitions (For linguists, Some-Some relations are equally important)

  45. Including only All-Some relations means: All relations evaluable as • Transitive • Symmetric • Reflexive • Anti-Symmetric All relations support logical reasoning – as contrasted with: is_related_to, is_associated_with, is_narrower_in_meaning_than …

  46. Reasoning should be able to cascade from one relational assertion (A R1 B) to the next (B R2 C). • Find all DNA binding proteins should also Find all transcription factor proteins because • Transcription factor is_a DNA binding protein Only the All-Some structure guarantees such cascading of relational assertions

  47. Organisms are continuants • they are entities which endure through time through gain and loss of parts • Processes are occurrents • they are entities which unfold through time, and have all their parts as a matter of necessity

  48. human testis part_of adult human being • but not • human being has_part human testis • and not even • male human being has_part human testis

  49. part_of for continuant types • A part_of B =def. • For all x, t if x instance_of A at t then there is some y, y instance_of B at t and x instance_level_part_of y at t • cell membrane part_of cell

  50. part_of for occurrent types • A part_of B =def. • For all x, if x instance_of A then there is some y, y instance_of B and x instance_level_part_of y • EVERY A IS PART OF SOME B

More Related