1 / 66

Terminologies, Ontologies, & SNOMED What are they for? What would Quality Assurance mean?

Terminologies, Ontologies, & SNOMED What are they for? What would Quality Assurance mean?. Alan Rector School of Computer Science / Northwest Institute of Bio-Health Informatics rector@cs.man.ac.uk with special acknowledgement to Jeremy Rogers

chelsey
Download Presentation

Terminologies, Ontologies, & SNOMED What are they for? What would Quality Assurance mean?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Terminologies, Ontologies, & SNOMEDWhat are they for? What would Quality Assurance mean? Alan RectorSchool of Computer Science / Northwest Institute of Bio-Health Informaticsrector@cs.man.ac.uk with special acknowledgement to Jeremy Rogers www.co-ode.orgwww.clinical-escience.orgwww.opengalen.org

  2. Terminology and ontologies in Healthcare:What for? What is meant by Quality? • A Talk in three parts • Part 1 • A review of a bit of history of clinical terminology and ontologies • Some fundamental issues • Part 2 • Focus on Quality Assurance • Quality for what? • Dimensions of quality • Part 3 • Quality issues in SNOMED • What might be done to address them • Summary

  3. Medical Terminology:A bit of history • It all started with public health, vital statistics and epidemiology…

  4. London Bills of Mortalityevery Thursday from 1603 until the 1830s

  5. Origins of modern terminologies100 years of epidemiology • ICD - Farr in 1860s to ICD9 in 1979 • International reporting of morbidity/mortality • ICPC - 1980s • Clinically validated epidemiology in primary care • Now expanded for use in Dutch GP software

  6. … then took on new tasks .Organising Care • Librarianship • MeSH - NLM from around 1900 - Index Medicus & Medline • EMTree - from Elsevier in 1950s - EMBase • Remuneration • ICD9-CM (Clinical Modification) 1980 • 10 x larger than ICD; aimed at US insurance reimbursement • CPT, … • Pathology indexing • SNOMED 1970s to 1990 (SNOMED International) • First faceted or combinatorial system • Topology, morphology, aetiology, function • Specialty Systems • Mostly similar hierarchical systems • ACRNEMA/SDM - Radiology • NANDA, ICNP… - Nursing

  7. Early computer systems Aimed at saving space on early computers 1-5 Mbyte / 10,000 patients Read (1987 - 1995) Hierarchical modelled on ICD9 Detailed signs and symptoms for primary care Purchased by UK government in 1990 Single use Medical Entities Dictionary (MED) Jim Cimino, Hospital support, Columbia, USA OXMIS READ competitor Flat list of codes Derived from empirical data Defunct circa 1999 ICPC Epidemiologically tested, Dutch LOINC For laboratory data DICOM (sdm) For images MEDDRA Adverse Reactions … and then with computersDocumenting/Reporting Care

  8. Unified Medical Language System • US National Library of Medicine • De facto common registry for vocabularies • Metathesaurus • 1.8 million concepts • categorised by semantic net types • Semantic Net • 135 Types • 54 Links • Specialist Lexicon & other language tools

  9. Code Code SUI LUI SUI Code LUI SUI CUI Code LUI SUI Code Unified Medical Language System • Concept Unique Identifiers (CUIs) • Lexical Unique Identifiers (LUIs) • String Unique Identifiers (SUIs)

  10. Term CTV3 Bounty bar Crème egg Kit Kat Mars Bar Milky Way Smarties Twix Snicker UbOVv UbOW2 UbOW3 UbOW4 UbOW5 UbOW6 UbOW7 Ub1pT ?   …but …The Coding of ChocolateAn international conversion guide C-F0811 C-F0816 C-F0817 C-F0819 C-F081A C-F081B C-F081C C-F0058 SNOMED-CT 

  11. 1990s: a Paradigm Shift • Human-Human and Human-Machine to Machine-Machine • From paper to software • From single use to multiple re-use • From coding clerks to direct entry by clinicians • From pre-defined reporting to decision support • Machine mediated human communication vsMachine to machine communication From Books to Software

  12. Clinical research &Decision Support Clinical Terminology Data Entry Clinical Record Data Entry Electronic Health Records GALENOntologies &Descriptionlogics Clinical researchDecision Support &Knowledge Presentation Healthcare Mr Ivor Bigun Dun Roamin Anytown Any country 4431 3654 90273 Where I come from Best Practice Best Practice

  13. Why is it so hard?Fundamental problems:Enumeration doesn’t scale

  14. Predicted Actual The scaling problem: The combinatorial explosion • It keeps happening! • “Simple” brute force solutions do not scale up! • Conditions x sites x modifiers x activity x context • Huge number of terms to author • Software CHAOS

  15. Effort per term What we might accept What we would like Things to build Combination of things to be done & time to do each thing • Terms and forms needed • Increases exponentially • Effort per term or form • Must decrease tocompensate • To give the effectiveness we want • Or might accept

  16. The exploding bicycle • 1972 ICD-9 (E826) 8 • READ-2 (T30..) 81 • READ-3 87 • 1999 ICD-10 ……

  17. 1999 ICD10: 587 codes • V31.22 Occupant of three-wheeled motor vehicle injured in collision with pedal cycle, person on outside of vehicle, nontraffic accident, while working for income • W65.40 Drowning and submersion while in bath-tub, street and highway, while engaged in sports activity • X35.44 Victim of volcanic eruption, street and highway, while resting, sleeping, eating or engaging in other vital activities

  18. Defusing the exploding bicycle:500 codes in pieces • 10 things to hit… • Pedestrian / cycle / motorbike / car / HGV / train / unpowered vehicle / a tree / other • 5 roles for the injured… • Driving / passenger / cyclist / getting in / other • 5 activities when injured… • resting / at work / sporting / at leisure / other • 2 contexts… • In traffic / not in traffic V12.24 Pedal cyclist injured in collision with two- or three-wheeled motor vehicle, unspecified pedal cyclist, nontraffic accident, while resting, sleeping, eating or engaging in other vital activities

  19. Structured Data Entry File Edit Help Cycling Accident What you hit Your Role Activity Location Conceptual Lego… it could be...Goodbye to picking lists…

  20. Intelligent Forms

  21. And generate it in language …a great missed opportunity in SNOMED

  22. hand extremity body abnormal normal Logic as the clips for “Conceptual Lego” gene protein polysacharide cell expression chronic Lung acute infection inflammation bacterium deletion polymorphism ischaemic virus mucus GALEN, SNOMED-RT/CT

  23. Logic as the clips for “Conceptual Lego” “SNPolymorphismofCFTRGene causing Defect in MembraneTransport of Chloride Ion causing Increase in Viscosity of Mucus in CysticFibrosis…” “Handwhich isanatomically normal”

  24. Protein Gene in humans Disease caused by abnormality inFunction ofProtein coded bygene in humans Protein coded bygene in humans Function ofProtein coded bygene in humans Species Genes Build complex representations from modularisedprimitives Function Disease

  25. Structure Function Structure Part-whole Part-whole Function Normalising (untangling) Ontologies

  26. Problem:System may be perfect…butUsers still fallible

  27. X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X User ProblemsInter-rater variability Headcloth Cloth Scarf Model Person Woman Adults Standing Background Brown Blue Chemise Dress Tunics Clothes Suitcase Luggage Attache case Brass Instrument French Horn Horn Tuba

  28. User ProblemsInter-rater variability New codes added per Dr per year • READ CODE Practice A Practice B • Sore Throat Symptom 0.6 117 • Visual Acuity 0.4 644 • ECG General 2.2 300 • Ovary/Broad Ligament Op 7.8 809 • Specific Viral Infections 1.4 556 • Alcohol Consumption 0 106 • H/O Resp Disease 0 26 • Full Blood Count 0 838

  29. RepeatabilityInter-rater reliability • Only ICPC has taken seriously • Originally less than 2000 well tested rubrics with proven inter-rater reliability across five languages • As it has been put into wider use, has grown and is less tested • Includes the delivery software • Confounding, but we can’t ignore it • …but we all have ignored it • GALEN was at least as bad as the rest

  30. Where next?The genome / ’omics explosion • Open Biolological Ontologies (OBO) • Gene Ontology, Gene expression ontology (MGED), Pathway ontology (BioPAX), … • 400+ bio databases and growing • National Cancer Institute Thesaurus • CDISC/BRIDG - Clinical Trials • HL7 genomics model… • … Coming to an Electronic Healthcare Record near you!

  31. Key issue 1: Creating an open community • Centrally created terminologies have succeeded for three reasons • Coercion - use them or don’t get paid • ICD-CM, CPT, MEDDRA, Read 2 • They belonged to the community and were useful or key to software • LOINC, HL7v2, Gene Ontology, Read 1 … • They gave access to a key resource • MeSH, BNF, …

  32. Logic + Web liberates users to “own” the ontology: Open ‘Just-in-time Terminology’ • If you can test the consequences then you can give users the freedom to develop • New compositions • New additions to established lists

  33. …but logic & formal ontology scare users • Require months to train even to a minimal level • People make errors • Even experienced people • Logic inferences can be directly contrary to language maxims • Make logic (& formal ontology) the “assembly language” • Give users high level languages • Ideally built with meta-authoring language construction kit

  34. Before: Intermediate Representations“Close to user forms”

  35. Intermediate Representations“Close to user forms” After: "Open fixation of a fracture of the neck of the left femur" • MAIN fixing ACTS_ON fracture HAS_LOCATION neck of long bone IS_PART_OF femur HAS_LATERALITY left HAS_APPROACH open From 3-6 months training to 3-6 days training

  36. global cycle &work at centre Supports Loosely coupled distributedontology development local cycles: work by users From authoring to meta-authoring& managing QA From 80% central/global effort to 10% central/global effortincluding doing QA

  37. Key issue II:Applications centric development • If it is built for everything it will be fit for nothing! • Must have a way to see if it works • If it is built for just one thing it will not be fit even for that • Change is the only constant • Cannot predict which abstractions needed in advance • Even very large ontologies tend to be missing 50% or more when applied in practice • Compose them when you need them and share

  38. Key issue III:Binding to Applications & the EHR • HL7 v3 + SNOMED = Chaos • Unless we can formalise the mutual constraints • The documentation is beyond human capacity • To write or to understand • Templates/Archetypes + SNOMED = Missed opportunities • Unless we trivialise the terminology … or chaos if we attempt to use the terminology • Requires new tools • Formalisms probably inadequate

  39. Key issue IV:Decision support • Meaningful decision support is still rare • Terminology is not the only problem • But it is one barrier • Terminology/ontology should be the scaffolding • But requires the terminology to be reliable & computable • Inter-rater reliability crucial • Can we afford GIGO for patient management? • Semantics of combined EHR+Terminology must be well defined

  40. Key issue V:Avoiding “Pregacy” • Prebuilt legacy • Obsolescence or known errors built in from the beginning • ≤ .01% of SNOMED coded data to be held in 10 years time has been collected • Fixes now will be less expensive than fixes later • Quality should be dealt with now • Rigorous schemas rigorously adhered to • Conformance and Regression testing • Cannot depend on people to do it right • Must be formally verifiable • It’s software - Let’s have some basic software engineering!

  41. Key issue VI:Empirical data • Need empirical data on • What’s worth doing - what’s essential • Language used by doctors • Terms used • What works • Reliability of terms used - errors made • Effect on Decision Support and other applications • What scales • What are the consequences of design decisions • Effort required to develop software • Usability of development tools • Effort required by users • Usability of interfaces and clinical systems • Where is the science base for our work?

  42. Key issue VII:Human Factors-Helping with a humanly impossible task • Language technology will help • But will always have limitations • Tailored forms will help • But we must beet the combinatorial explosion • Language generation will help • And why aren’t we using it? • Also one of the best means of QA • The best people may not be informaticians, medics, or biomedical scientists GET THEE TO A USABILITY LAB! RECRUIT ETHNOGRAPHERS!

  43. Key Issue VIII:SustainabilityThere is no free lunch! It will have to be paid for somehow • Maintaining any national/regional system will cost money • Regardless of cost of SNOMED membership, there will be costs for • Translation / Localisation / Europeanisation • Coordination with centre and other bodies • Liaison with professions and outreach • Training • Quality assurance locally • Guestimate: €1.5M/year/centre … + “free” professional time • IF we are convinced of the value, trivial in comparison to total healthcare IT costs • … but real money that could be spent on something else • … or be spent anyway in bits inefficiently and ineffectively • … and the issue is really shortage of skilled people

  44. Key issue IX: Internationalisation • What is needed? • Translations? • New ‘concepts’? • How many? • What for? • New Processes • New Authorities • New Quality assurance • Does it meet specific countries’ requirements? • You tell us?

  45. Key issue X: Quality…

  46. Part II:Quality and Quality Assurance: • Quality assuranceProcess • It is the process of that assures • Quality ofProduct • The goal and evidence for the process

  47. Quality against purpose • A controlled vocabulary - Can a configuration engineer can put identifiers behind a form? • Management of identifiers / change management • “Coverage” / “Sensitivity”, Inter-rater reliability • A browsable index and term finding - Can a user can choose a code reliably? • Lexicon of “Terms” • Specificity, Inter-rater reliability • Organisation and structure • Classification/retrieval for epidemiology -Can the codes be transformed into required classifications reliably • Can it be used on its own or must there still be manual coding? • Formal representation: Intelligent interfaces and applications -Can the right inferences be drawn - can applications depend on it? • Sound formalism & schemas consistently applied • Correct definitions and hierarchies

  48. An orthogonal quality assurance framework • Product • Context • Content • Consequences • Maintainability, evolution • Transparency • Human factors • Process • Technical • Organisational • Sustainability • Inter-community acceptance

  49. Some specific issues of Quality:Content • A Priori Coverage just a matter of size • Test against what purposes • Are the constructs there? Are the building blocks there? • Every application needs different abstractions • Leeds to 25% - 50% raw coverage in clinical systems

  50. Consequences: Inferences and Engineering • Ontologies are mathematical theories and engineering artifacts • They are tested by • whether the ‘correct’ inferences follow from them • whether they can be engineered robustly • Manifest in hierarchies and ‘inheritance’ of definitions • First test is whether things are in the right place • It is easier to see commissions than omissions • Classification should help - turns omission into commission

More Related