1 / 36

Olivier Bodenreider

Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA. ISMB 2002 Fifth Annual Bio-Ontologies Meeting August 8, 2002. Experiences in visualizing and navigating biomedical ontologies and knowledge bases. Introduction 1. Biomedical knowledge

mirit
Download Presentation

Olivier Bodenreider

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Olivier Bodenreider Lister Hill National Centerfor Biomedical CommunicationsBethesda, Maryland - USA ISMB 2002 Fifth Annual Bio-Ontologies Meeting August 8, 2002 Experiences in visualizing and navigating biomedical ontologies and knowledge bases

  2. Introduction 1 • Biomedical knowledge • Terminologies (names) • Ontologies (objects) • Knowledge bases (facts) • Common features • Terms / Concepts • Inter-concept relationships • Hierarchical • Associative

  3. knowledge term Introduction 2 • Challenges • Volume of information • 104 -106 concepts • 105 -107 relationships • Orientation • Mapping to concepts • Visualizing concept spaces • Navigating concept spaces

  4. SemNav UMLS browser Entry point: biomedical term Display related concepts Display properties of interconcept relationships Allow navigation among concepts GenNav GO browser Entry point: GO term or gene product name/symbol Display related GO terms and gene products Display properties of term/term and term/gene product relationships Allow navigation between GO terms and gene products Introduction 3

  5. Outline • Background • Unified Medical Language System (UMLS) • Gene Ontology • Overview of the browsers • SemNav • GenNav • Common features • Differences

  6. UMLS and GO

  7. Unified Medical Language System • Developed at NLM since 1990 • 13th edition in 2002 • Integrates some 60 terminological resources • Clinical vocabularies (including specialties) • Core terminologies (anatomy, drugs, med. devices) • Administrative terminologies, standards • Integration • Synonymous terms are clustered in a concept • Hierarchies (trees) are combined in a graph structure

  8. MeSH, SNOMED CTV3, Jablonski, CRISP, DxPlain, MedDRA, LOINC Duchenne muscular dystrophy Duchenne’s muscular dystrophy COSTAR Duchenne de Boulogne muscular dystrophy Jablonski Duchenne type progressive muscular dystrophy SNOMED MeSH, CTV3 SNOMED pseudohypertrophic muscular dystrophy X-liked recessive muscular dystrophy Jablonski severe generalized familial muscular dystrophy SNOMED Terminology integration Terms

  9. Adrenal Gland Diseases Adrenal Cortex Diseases SNOMED MeSH AOD Read Codes Hypoadrenalism Adrenal Gland Hypofunction UMLS Adrenal cortical hypofunction Addison’s Disease Terminology integration Relationships

  10. Semantic Network Semantic Type categorization Concept Metathesaurus UMLS • Two-level structure • Semantic Network • 134 Semantic Types (STs) • 54 types of relationshipsamong STs • Metathesaurus • 800,000 concepts • ~10 M inter-conceptrelationships • Link = categorization

  11. Semantic Types Anatomical Structure Fully Formed Anatomical Structure Embryonic Structure Disease or Syndrome Body Part, Organ or Organ Component Semantic Network Pharmacologic Substance Population Group Metathesaurus Medias-tinum Saccular Viscus 4 Angina Pectoris 97 Esophagus 12 Heart Cardiotonic Agents 225 Left PhrenicNerve Tissue Donors Heart Valves Fetal Heart 22 9 31 Concepts

  12. Gene Ontology • Developed by the GO Consortium • Several components • Ontology (~11,000 concepts) • Molecular functions • Cellular components • Biological processes • Gene products (~125,000) • Associations between Gene products and GO concepts (~357,000)

  13. SemNav

  14. MeSH Browser

  15. SemNav Visualization options

  16. Semantic Types Biologically ActiveSubstance Disease orSyndrome Amino Acid,Peptide or Protein MuscularDystrophy,Duchenne Dystrophin 55 Concepts SemNav Relationships

  17. GenNav

  18. Material and Methods

  19. Common features and differences

  20. Mapping query terms • Mapping terms to concepts • Matching criteria (exact, approximate) • Normalization techniques • work well on clinical terms • less applicable to gene names • Query disambiguation • With semantic type in SemNav • With species in GenNav

  21. Visualization • Graph vs. Trees (Forest) • Multiple inheritance is better visualized by graphs than by trees • Off-the-shelf, freely available graph visualization packages are available (GraphViz) • Need to reduce complexity • Transitive reduction on complex graphs • Feature selection • e.g., a given vocabulary in SemNav • e.g., a given species in GenNav

  22. Navigation • Tool for exploration • Navigation among concepts(SemNav and GenNav) • Navigation between two poles(Gene products and GO concepts in GenNav) • Self-contained (SemNav)or opened to external resources (GenNav)

  23. Conclusions

  24. Conclusions • Most of the lessons learned while developing SemNav (for browsing general biomedical knowledge) were applicable to GenNav (for browsing molecular biology knowledge) • The lexical techniques suitable for mapping text to clinical terminologies require adaptation to the specificity of molecular biology terminologies

  25. Olivier Bodenreider Lister Hill National Centerfor Biomedical CommunicationsBethesda, Maryland - USA Contact: olivier@nlm.nih.gov SemNav http://umlsks.nlm.nih.gov* ► Resources ► Semantic Navigator (* free UMLS registration required) GenNav http://etbsun2.nlm.nih.gov:8000/perl/gennav.pl

More Related