1 / 25

Synchronising Diversely Implemented Databases to Support Administration of Clinical Research

Synchronising Diversely Implemented Databases to Support Administration of Clinical Research. Stuart Anderson Mark Hartswood Conrad Hughes CRISP (Clinical Research Information Systems Project) School of Informatics University of Edinburgh. Many administrative databases, much the same data.

caden
Download Presentation

Synchronising Diversely Implemented Databases to Support Administration of Clinical Research

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Synchronising Diversely Implemented Databases to Support Administration of Clinical Research Stuart Anderson Mark Hartswood Conrad Hughes CRISP (Clinical Research Information Systems Project) School of Informatics University of Edinburgh

  2. Many administrative databases,much the same data Project R#30 Title “A very important study” …

  3. Share the data automatically! Project R#30 Title “A very important study” …

  4. Research organisations • Research Organisations (ROs) in NHS Lothian administering clinical research projects: • NHS Research & Development Office • Welcome Trust Clinical Research Facility (WTCRF) • Scottish Cancer Research Network (SCRN) • Experimental Cancer Medicine Centre (ECMC) • NHS R&D involved in all projects, at least in terms of handling approvals

  5. Project meta-information • Project title • Project start and end dates • Project ethics status and research approval • Project sponsors, funders and finance data • Project personnel • Sponsor and personnel contact details • Patient lists and activity records • … Supposedly the same data, but in different databases

  6. A CRISPy Opportunity • We could: • Reduce data entry costs • Improve data quality • Improve awareness of activity • ...if we find ways to share common data between databases • Suits government “bureaucracy busting”

  7. Options • Looked at commercial solutions: • Some didn’t understand the complexity and risks (e.g. rsync in two directions) • Competent-sounding ones were prohibitively expensive (e.g. £170k per site) • Our solution: DIY approach using free software

  8. Harmony • Document synchronisation framework • By Benjamin Pierce et al.: http://www.seas.upenn.edu/~harmony • Reconciles changes made to multiple disconnected structured documents containing the same data (or subsets thereof - the “view update” problem), e.g. • Internet browser bookmarks files • Calendar applications • Strong theoretical approach with emphasis on provable safety: changes only propagated under well-defined circumstances

  9. Archive (~Old X) RO1’s Document X RO2’s Document X Harmony Updated Document X (RO1) Updated Document X (RO2) Log of changes and conflicts New Archive Overview of Harmony

  10. Harmony operation: Equality After running Harmony:

  11. Harmony operation: Changes

  12. Harmony operation: Conflict

  13. project Conflicts 600 pre-roll-out conflicts to resolve; these examples are fairly trivial

  14. Provenance issues? • Trust • Alignment • Form and meaning • Authority • Control

  15. Trust • Organisations are allowing other participants to write to their databases • Do you trust them? • Alignment of goals • Need to establish confidence in each other’s procedures and practices • Established through regular meetings • Others might know more than you do

  16. Alignment: record identity • Need to identify which records in different databases refer to the same project, funding body or person • Use R&D Number, assigned by NHS R&D, for projects • Creation complicated because projects may initially be entered (without R&D#) by ROs • Deletion complicated because some projects may leave scope but no projects should really be deleted • Funding bodies and persons are handled more loosely • Identity and duplication less critical here

  17. Syncing two database tables Database 1 Database 2 7 7 9 3 3 9 Unique Shared Keys identify records across databases SK SK Establishing identity Synchronising tables across two databases depends on having a unique shared key. This value has to be guaranteed to be unique within each table, and to identify corresponding records uniquely across databases.

  18. Do they have the same meaning? • Start/end date • Approval? Funding? Recruitment? Analysis? • Often driven by reporting requirements • Some fields too contentious, not useful to share, so not included in sync • Option to synchronise separate meanings as separate fields • Get parties to agree on common meanings • Valuable communications exercise among participants

  19. Shared meaning = shared form? • Field types/sizes • Field values • N/A na None No Pending • Funder classification varies from DB to DB • Personnel roles • One column per role or one row per role? • Some adjustment and convergence possible to participants’ databases • Transform data to “standard” on export/import

  20. Authority • Harmony is symmetric: no peer to a sync gets priority • Some information should only be sourced by R&D (responsible for approvals) • Some information is best sourced by ROs (personnel, funding) • But: • Databases involved don’t record sources of information • Strict rules impair usability and make for an unpopular (and unused) system • Solution: • Emphasise audit over control • But provide limited inter-site control at data import stage

  21. Control • Each database contains organisation-specific (and private) information • Some content is just irrelevant to others • Some patient data! • Solution: import/export script run locally by each organisation only exports a chosen subset of tables, rows and columns

  22. Benefits • Data only entered once for all • Everyone takes responsibility for data they’re most expert in • Disagreement (“conflict”) is permitted, and may be resolved through human-human communication • Limited (inter-site) audit operating so expect/hope for responsible behaviour

  23. Conclusion • Real data synchronisation application has been far from the theoretical ideal • Issues of alignment, scope, identity, policy, trust, data quality, form and meaning • Solutions to problems encountered aren’t just technical: organisational engagement and trust have been essential in keeping the task tractable • Rolling out now, so reality yet to be seen • Depends on fair balance of effort and reward among participants

  24. Thank you! conrad.hughes@ed.ac.uk School of Informatics University of Edinburgh

More Related