1 / 13

The LongNow

The LongNow. Why FERPA?. The Sequel:. Key Problems. Entity Resolution. Regulatory Hurdles. Entity Resolution. LEA: 6002007 George Castillo 9/30/1997 M L 906773502. LEA: 4907023 Jorge Castillo-Estrada 9/30/1997 M L 437659887. Name Counts.

rowena
Download Presentation

The LongNow

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The LongNow

  2. Why FERPA?

  3. The Sequel:

  4. Key Problems Entity Resolution Regulatory Hurdles

  5. Entity Resolution • LEA: 6002007 • George Castillo • 9/30/1997 • M • L • 906773502 • LEA: 4907023 • Jorge Castillo-Estrada • 9/30/1997 • M • L • 437659887

  6. Name Counts • There are ~55,000 unique first names among students in Arkansas and ~40,000 last names. • Approximately 20% of Arkansas students share both the same first and last name with another student.

  7. More Data Issues • There are 4,026 students in Arkansas that share an SSN with at least one other student in the state. • Between August and January, 874 student transfers to other schools resulted in an SSN change. • Between August and January, an additional 1,018 students changed their SSN—we have records for only 300 of these changes. • Between August and January, 21,255 students moved to another district in the state—only 18,986 students were marked as “withdrawn.”

  8. The Knowledge Base Approach “Indicative” information from multiple data sources is stored and merged into an “equivalence class” for each entity, using both fuzzy and logical associations. Knowledge base identifiers are used to manage the references. Knowledge Base Bob Smith, Barton Elementary Robert Smith, Barton Elementary Fuzzy Match Logical Match (Drop/Enroll) Bob Smith, Wilson Elementary

  9. Two Agencies, Two Regulations HIPPA FERPA

  10. Trusted Broker A trusted broker maintains a cross reference table, encoding the identifiers for various agencies and for various representations of the entities. Trusted Broker ACHI ADE

  11. Encoded Links The trusted broker can provide multiple agencies with encoded versions of the (hidden) knowledge base identifiers, protecting all future data requests. ACHI ADE Trusted Broker

  12. Data Requests The trusted broker translates encoded links between agencies for data requests and no personally identifying information needs to be exchanged. ADE ACHI ED3508 Score: 385 ED4297 Score: 242 ED8516 Score: 417 What are the test scores for the following students? AC0236 AC0651 AC1327 Trusted Broker AC0236 ↔ED4297 AC0651 ↔ED8516 AC1327 ↔ED3508 Brokered Result 1 AC0236 Score: 242 AC0651 Score: 417 AC1327 Score: 385 Brokered Result 2 Score: 242 Score: 385 Score: 417 Brokered Result 3 Average Score: 348

  13. Result Options The trusted broker may deliver results between agencies in a variety of ways, without exchanging personally identifying information. Trusted Broker Brokered Result 1 AC0236 Score: 242 AC0651 Score: 417 AC1327 Score: 385 Brokered Result 2 Score: 242 Score: 385 Score: 417 Brokered Result 3 Average Score: 348 Individual level results with encoded links (safe, encoded) Individual level results without links, random (safe, anonymous) Aggregated results (safe, anonymous)

More Related