Verifiable and Redactable Medical Documents

Jordan Brown (jbrown6@gatech.edu) & Douglas M. Blough Verifiable and Redactable Medical Documents

Problem It is difficult and time consuming to distribute different views of verifiable medical records. We want to make the process more manageable and efficient.

Proposed Process Data Provider Data Consumers Institutional Boundaries Intermediary

Related Works • Application of the work seen in paper by Bauer, Blough, and Cash (ACM 2008) • Other similar approaches – (CDA Documents) • Wu et al (JMS 2010) • Slamanigand Stingl (IEEE 2009) • Slamanig and Rass (Springer 2010)

Continuity of Care Document (CCD)

Cryptographic Primitives Concepts For Building Merkle Hash Trees Public Key Signatures Use secret key in combination with message to sign Send signed message and original message Using public key on signed message returns the original message If actual message matches calculated message the signature verifies • Hash Function • One-way function • Variable length input • Maps to fixed length output • Statistically unlikely to find/calculate collisions • Computationally cheap compared to signatures

Merkle Hash Tree (MHT)

MHT Continued • Redaction • Remove unused data • Keep Hashes • Prune Tree • Verification • Reconstruct remainder of tree • Verify the root signature 1 2

Multi-Level MHTs … … … Root … … … Multi-level signatures Comprehensive document across multiple sources

CCD Contained in MHT

Continued

Performance Results

Overview Setup Dataset 206 Records Average element count of 190 Maximum element count was 828 Average extraction time was 312 ms Optimizations have since been made (~10%) Remaining results found with permutations of a single record • All times (CPU) • Eclipse 3.6.2 with Java SE 1.6 • Windows 7 PC with 2.4 GHz Intel Core i5 and 4GB RAM

Data Provider Overhead • Not included in time • Process single document • Extract relevant items • Included • Create leaves • Form tree • Sign root • Structure construction much more efficient than extracting elements Tree Construction

Intermediary Overhead • Setup • Create multi-level tree with 20 sub-trees • Process • Randomly redact from even distribution of trees • Prune after each redaction • Very fast operation Tree Redaction

Data Consumer Overhead • Not included: • Document reconstruction • Included: • Reconstruct hashes • Verify root signature • Cost comparable with construction • Document reconstruction expensive Tree Verification

Conclusions &Future Additions • Computationally Efficient Verifiable Redactable Data • Dependencies – Bauer et al. (ACM 2009) • Redaction Tracking – Izu et al. (2005) • Pseudonymization – Haber et al. (ACM 2008) • Sanitization (Invisibility) – Miyazaki et al. (ACM 2006) • Distributed Approach to Research Data Access Tracking and Control (joint work with Emory University Center for Clinical Informatics)

Questions/Comments?

Verifiable and Redactable Medical Documents

Verifiable and Redactable Medical Documents

Presentation Transcript

Verifiable Programming

Verifiable Databases and RAM Programs

How to write your medical documents?

Developing Verifiable Concurrent Software

Publicly verifiable authenticated encryption

Scalable and Scalably-Verifiable Sequential Synthesis

Verifiable Mixnets

f or Medical Documents

Verifiable Classroom Voting

Measurable, Reportable and Verifiable

Asynchronous Verifiable Secret Sharing

Verifiable Costs

Verifiable Programming

Verifiable visual cryptography

Verifiable Classroom Voting

Verifiable Oblivious Storage

How to write your medical documents?

Objectively Verifiable Indicators

Text Mining of Medical Documents

Verifiable Mixing Protocol

Translation of Your Medical Documents