1 / 18

Query your data with QD

Query your data with QD. B. van der Holt / W.L.J. van Putten University Hospital Rotterdam - Daniel Department of Trials & Statistics / HOVON Data Center. Introduction. Correct data are crucial for any analysis;

stanley
Download Presentation

Query your data with QD

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Query your data with QD B. van der Holt / W.L.J. van Putten University Hospital Rotterdam - Daniel Department of Trials & Statistics / HOVON Data Center Stata Users Meeting, Maastricht

  2. Introduction • Correct data are crucial for any analysis; • Ideally data should come cleaned and error free, but this is hardly ever the case. • In clinical trial data may be distributed among several files. • One way to check is to use if-expressions Stata Users Meeting, Maastricht

  3. Example • Data from trial with over 400 patients • t1 Randomisation/On study • t2 PA form • t3 Treatment form ** • t4 Evaluation form ** • t5 Off treatment form • t6 Follow-up form ** • t7 Infection form ** • t8 Side effects form ** Stata Users Meeting, Maastricht

  4. Checks for t1.dta Stata Users Meeting, Maastricht

  5. Checks for t3.dta Stata Users Meeting, Maastricht

  6. Disadvantage of if-expressions • Errors in data of one patient may be found anywhere in the ouput: not neatly sorted by patient number. This will hamper easy admission of the data. (Remember that in this trial there were more than 400 patients) Stata Users Meeting, Maastricht

  7. Solution QD.ADO Stata Users Meeting, Maastricht

  8. QD (Query Data) • Is a program by which one may query the quality and validity of data; • Developed for clinical trials, but not restricted to it; • Define a series of tests on the data in dta-files; • The data may be in one ore more dta-files, linked by a common index variable (a unique case-identifier); Stata Users Meeting, Maastricht

  9. QD (Query Data) - continued • Generates formatted reports of variable- values with messages associated with the test; • And, very important: the reports are organised by case (e.g. patient), and not listwise by test! All tests that are true for a single case are displayed together. Stata Users Meeting, Maastricht

  10. QD output for patient 2 Stata Users Meeting, Maastricht

  11. QD output for patient 5 Stata Users Meeting, Maastricht

  12. Syntax of QD.ADO • The general syntax of the program is: qd options • The commands that were used to generate the previous output will be used for illustration Stata Users Meeting, Maastricht

  13. * QDCHECK.DO - 1 • qd clear • removes all previously defined QD parameters (global macro’s) from memory • qd files(t1 t2 t3) • required. Contains the (valid) names of one or more dta-files. The same dta-file may appear more than once. The files are numbered by QD in the natural order of appearance in the string as file 1, file 2, ... Stata Users Meeting, Maastricht

  14. * QDCHECK.DO - 2 • qd by(hvnr) • required. Defines an ordering on file 1 and variable by which the records in the different files are linked • qd title(HOVON xx NHL Checks on data) • This title will be printed on top of the output of the data of each case Stata Users Meeting, Maastricht

  15. * QDCHECK.DO - 3 Stata Users Meeting, Maastricht

  16. * QDCHECK.DO - 4 Stata Users Meeting, Maastricht

  17. * QDCHECK.DO - 5 • qd log(qdcheck.log) • defines the name (path) of the file that will serve as a log file to which the output will be written. When not defined, the output will only be shown on the screen • qd print(if hvnr<=5) • this is the instruction to display the results of all the tests on the screen and, if defined, to a log-file. Stata Users Meeting, Maastricht

  18. Further information • findit qd • email: holt@stah.azr.nl Stata Users Meeting, Maastricht

More Related