1 / 55

Semantic Models for CDISC Based Standards and Metadata Management

Semantic Models for CDISC Based Standards and Metadata Management. Presented by Kerstin Forsberg, R&D, AstraZeneca Frederik Malfait, IMOS Consulting and Hoffmann-La Roche. Key Message. Things converge to create new and unique opportunities.

lovie
Download Presentation

Semantic Models for CDISC Based Standards and Metadata Management

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Semantic Models for CDISC Based Standards and Metadata Management Presented by Kerstin Forsberg, R&D, AstraZeneca Frederik Malfait, IMOS Consulting and Hoffmann-La Roche

  2. Key Message • Things converge to create new and unique opportunities. • The coverage and maturity of existing CDISC standards. • The establishment of these standards within the industry. • The use of these standards as a foundation for metadata driven systems. • The upcoming role of semantic web standards and linked data principles.

  3. Two real world use of semantic web standards and linked data principles

  4. Today’s Situation • “Not if and when, but how” to best adopt CDISC based data standards is becoming the leading question. • We see a variety of CDISC standards at different levels of maturity, not linked together and published in different formats. • Sponsors are faced with challenges on all levels: architecture, process, and application.

  5. An Emerging Insight • The CDISC standards is all about the meaning of what is studied in the biological and clinical reality (often referred to as concepts). • How these concepts are represented as data elements from protocol to submission, and beyond. • We are dealing with semantics and metadata for biomedical and clinical research knowledge and data. • “Put semantic into the semantic”Use semantic web standards and linked data principles.

  6. RDF Triples • Resource Description Framework (RDF) A general model of how any piece of data, and representations of knowledge, can be expressed as so called triples. subject predicate object (or value) Stockholm type place Stockholm capital Sweden Stockholm subject Port cities in Sweden Stockholm areaCode “+46-8” “http://en.wikipedia.org/wiki/Stockholm” primaryTopic Stockholm

  7. RDF Triples • Triples can be aggregated into graphs with subject and objects as nodes, and predicates as arcs. type City capital Sweden Stockholm subject Port cities in Sweden areaCode “+46-8” “http://en.wikipedia.org/wiki/Stockholm” primaryTopic

  8. RDF Triples • Graphs of triples can be extended across different sources and for different purpose. type City Country type CDISC capital Sweden Stockholm subject Port cities in Sweden subject CDISC InterchangeEU 2012 areaCode “+46-8” Gothenburg “http://en.wikipedia.org/wiki/Stockholm” primaryTopic

  9. RDF Triples • RDF Schema and the RDF based Web Ontology Language (OWL) add a typing mechanism to classify subjects and objects into hierarchies. Thing subClass Place subClass subClass Organization Event Adm.Area subClass subClass type subClass type City BusinessEvent Country type CDISC capital Sweden type Stockholm subject Port cities in Sweden subject CDISC InterchangeEU 2012 areaCode “+46-8” Gothenburg “http://en.wikipedia.org/wiki/Stockholm” primaryTopic

  10. RDF Triples • Google, Bing (Microsoft) and Yahoo use OWL publish a joint vocabulary. Thing subClass Place subClass subClass Organization Event Adm.Area subClass subClass subClass City BusinessEvent Country Exempelhttp://schema.org/City

  11. RDF Triples • NCI use OWL to publish NCI Thesaurus (the source for CDISC’s CT:s) in an RDF/XML format. Hematology Test LaboratoryProcedure CDISC LaboratoryTest NameTerminology CDISC LaboratoryTest Terminology subClass Concept inSubset Has NCIHDParent Concept inSubset HemoglobinMeasurement definition “A quantitative measurement of the amount of hemoglobin present in a sample.” NCI Thesaurushttp://ncicb.nci.nih.gov/download/evsportal.jsp

  12. Linked Open Data Cloud http://lod-cloud.net/ Richard Cyganiak and AnjaJentzsch

  13. Real world use • Two examples of how sponsors have started to use semantic web standards and apply linked data principles. • AstraZeneca: • Integrative Informatics (i2) program establishing the components to let a Linked Data cloud grow across AstraZeneca R&D • Roche • Implementing an internally built MDR.

  14. AZ R&D Linked Data cloud http://research.data.astrazeneca.com/id/clinicalstudy/D5890C00003 http://research.vocab.astrazeneca.com/uDisease/DOID/2841

  15. Roche Biomedical MDR Schema Architecture Production Partial / Future CDISC Standards MetadataManagement Knowledge Management

  16. Roche Biomedical MDR Content • External content • SDTM 1.2, SDTMIG 3.1.2 • NCI Thesaurus, CDISC Controlled Terminology • Integrated Data Standards, Roche and Genentech • Safety and every Roche TA, ~ 2000 data elements • Data Collection and Data Tabulation • Value level metadata • Lab measurements, Unit conversions, Questionnaires • Looking at metadata for • SDTM Conformance Checking, Biomarker (HGNC), …

  17. Roche Biomedical MDR Information Architecture Transformation Models Study & Project Level Metadata Roche GlobalData Standards CDISCData Standards ADaM PRM CDASH SDTM Define +++ BRIDG +++ SHARE +++ NCI Thesaurus +++ Data Element Concepts +++ BiomedicalDomain Model Production Partial Study Design Data Collection Data Tabulation Data Analysis Regulatory Submission Future

  18. Roche Biomedical MDR System Architecture Content Management Content Publishing Metadata Repository Single Point of Access

  19. Roche Biomedical MDR Value Proposition • Current • Integrated knowledge, metadata, and data standards management • System independent information asset • Single point of access • Future • Leverage the SOA interface to create a framework for integrated metadata driven workflow • Integrate MDR and Component Based Authoring capabilities (study design, protocol, CSR)

  20. Key Message • We now see all of these things converge to create new and unique opportunities. • The coverage and maturity of existing CDISC standards. • The establishment of these standards within the industry at large. • The use of these standards as a foundation for metadata driven systems. • The upcoming role of semantic web standards and linked data principles.

  21. TopBraid Semantic Modeling Workbench

  22. Roche Global Data Standards Browser

  23. Publishing and Item Level Versioning

  24. Using Web Services to Export to…

  25. Oh well, if you really want that Excel sheet

More Related