1 / 48

Software for data management: The contribution of Stata

Software for data management: The contribution of Stata. Dr Karen Robson, Senior Research Fellow, The Geary Institute, University College Dublin, Ireland. Getting acquainted with Stata. StataCorp develops and distributes Stata, software for statistical analysis.

Download Presentation

Software for data management: The contribution of Stata

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Software for data management: The contribution of Stata Dr Karen Robson, Senior Research Fellow, The Geary Institute, University College Dublin, Ireland

  2. Getting acquainted with Stata • StataCorp develops and distributes Stata, software for statistical analysis. • Stata is available for Windows, Macintosh, and Unix computers. • Stata is used by medical researchers, biostatisticians, epidemiologists, economists, sociologists, political scientists, geographers, psychologists, social scientists, and other research professionals needing to analyze data. Gaining popularity in the social and medical sciences • Particularly useful for handling large-scale longitudinal data

  3. Stata SE (for large data sets) • can analyze datasets with as many as 32,766 variables, and the only limit on observations is the amount of RAM on your computer • can handle string variables with a maximum length of 244 characters • can handle matrices up to 11,000 x 11,000. • requires at least 512 megabytes of RAM and 80 megabytes of disk space

  4. Stata/Intercooled (the standard one) • can analyze datasets with as many as 2,047 variables, and the only limit on observations is the amount of RAM on your computer • can handle string variables with a maximum length of 244 characters • can handle matrices up to 800 x 800.

  5. Small Stata • A smaller, student version of Stata (for educational purchases only)

  6. Stata MP • The fastest version of Stata (for dual-core and multicore/multiprocessor computers) • Stata/MP is the fastest and largest version of Stata.

  7. Resources • StataCorp website (www.stata.com)

  8. Resources • StataCorp website (www.stata.com) • Timberlake website (www.timberlake.co.uk)

  9. Resources • StataCorp website (www.stata.com) • Timberlake website (www.timberlake.co.uk) • UCLA Stata “portal” (http://www.ats.ucla.edu/stat/)

  10. Resources • StataCorp website (www.stata.com) • Timberlake website (www.timberlake.co.uk) • UCLA Stata “portal” (statcomp.ats.ucla.edu/stata) • Statalist (www.hsph.harvard.edu/statalist)

  11. Resources • StataCorp website (www.stata.com) • Timberlake website (www.timberlake.co.uk) • UCLA Stata “portal” (statcomp.ats.ucla.edu/stata) • Statalist (www.hsph.harvard.edu/statalist) • Stata Journal (www.stata-journal.com)

  12. As well, available Dec 2008

  13. Launching Stata • OS contingent • Default window preferences • Window preferences fully adjustable • Auto memory set

  14. Comparing with SPSS • Start up differences

  15. Comparing with SPSS • Start up differences • With data file open

  16. Comparing with SPSS • Start up differences • With data file open • Viewing data • data viewer, data editor

  17. Comparing with SPSS • Start up differences • With data file open • Viewing data • data viewer, data editor • Viewing variables

  18. Comparing with SPSS • Start up differences • With data file open • Viewing data • data viewer, data editor • Viewing variables • Viewing output/commands • output window buffer, log files

  19. Comparing with SPSS • Start up differences • With data file open • Viewing data • data viewer, data editor • Viewing variables • Viewing output/commands • output window buffer, log files • Syntax and “do files”

  20. Variable window INPUT Stata command window Do file Pull-down menu Review window Computation RESULTS Output window Log file

  21. User driven Free STBs Dedicated journal Web active Memory requirements Backward compatible Change! SPSS dominance Orientated to writing syntax/code Pull-down windows debate! Now in version 8 forward Advantages and disadvantages of Stata

  22. Easier code Easier data handling Clarity of operations/ feedback Results table function Before version 8, limited graphics Now, complex graphics Variable labelling Editing of output Advantages and disadvantages of Stata

  23. Nested/master do files Flexible terminology Setting types of data Interactive help Switch output (log file) on/off Copy and paste Advantages and disadvantages of Stata

  24. Overview of analytic techniques • Too numerous to mention! • Comprehensive manuals • A selection: • All types of regression • Survey package • Epidemiological package • Multilevel modelling • Time series functions • Cluster analysis

  25. Data • Data files .dta • Stat/Transfer software

  26. Stata – using wide and long file formats • Wide file formats (everything you add goes to the right of the existing data) • Long file formats (everything you add goes underneath the existing data)

  27. MERGE APPEND Data 1 Data 2 Data 1 Data 2

  28. _merge values Data 1 (indi) ‘master’ 1 Data 2 (indj) ‘using’ 3 2

More Related