Standardization of Data Dictionaries: A Case Study from the Pharmacogenomics Research Network. Introduction. Methods. Results. Conclusions. Background.
Qian Zhu, PhD; Robert R. Freimuth, PhD; Matthew J. Durski, MA; Jyotishman Pathak, PhD; Christopher G. Chute, MD, DrPH
Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, MN
The variety of disease phenotypes that are studied in the PGRN, as well as differences in clinical systems in use at each PGRN site, lead to data that is heterogeneous, non-standardized, and institution-specific. This not only hinders aggregation of data among sites that are collaborating on a given study, but also it complicates or prevents secondary use of the data (e.g., in meta-analyses).
~ 1/3 of the variables were selected for the initial analysis
Example: Medication History, Drug List