170 likes | 282 Views
Jan Cheetham , PhD (DoIT Academic Technology) Barry Radler, PhD (UW Institute on Aging). Using Spreadsheets in Research – Best Practices. RDS Brownbag, October 8, 2013. Spreadsheets in Research. Usage is ubiquitous Problems with documentation, reproducibility Example:
E N D
Jan Cheetham, PhD (DoIT Academic Technology) Barry Radler, PhD (UW Institute on Aging) Using Spreadsheets in Research – Best Practices RDS Brownbag, October 8, 2013
Spreadsheets in Research • Usage is ubiquitous • Problems with documentation, reproducibility • Example: • Reinhart and Rogoff paper scandal (April, 2013) • Cited 567 times since 2010 • Referenced in stimulus/austerity debates
Recommendations for Spreadsheets • Save a RAW copy of each spreadsheet • Take a class, become a power spreadsheet user • See resources at: http://researchdata.wisc.edu/manage-your-data/spreadsheets/ • Consider moving to another software for analysis • Make your spreadsheets more: • Machine readable • Human readable
Human-readable A separate document that describes and explains: • The data set • Variables (field names, column headings, etc.) • Data values (codes, data labels, etc) • Type of values and formats • What data values mean; code lists • Formulas and analysis steps
One Tool: Colectica for Excel Variable Dataset Codes Catgories?
Colectica for Excel • Documents and identifies: • Datasets • Variables • Code lists • Categories • Add more metadata than Excel supports on its own • Colectica for Excel is based on DDI • www.ddialliance.org
Colectica for Excel • Enables reuse of metadata through unique identification • Variables can share the same code list • Records audit trail through versioning
Publish Documentation • Generate Codebooks from Excel • PDF, HTML, Word, XSL-FO • Publish DDI 3.1 XML
Resources Colectica for Excel: www.colectica.com/software/colecticaforexcel DataUp dataup.cdlib.org/
Resources Research Data Services researchdata.wisc.edu “Practice good data management!”