330 likes | 490 Views
Secondary data: collection and use. Presented by. Arnout van Delden Methodologist Statistics Netherlands. Secondary data. specific. Secondary Sources. Registers. Base registers. Statistical registers. FUTURE. PRESENT. PAST. Official Statistics. Post-war II Identifiers
E N D
Presented by Arnout van Delden Methodologist Statistics Netherlands
specific Secondary Sources Registers Base registers Statistical registers
FUTURE PRESENT PAST
Official Statistics Post-war II • Identifiers • Concepts: variable, units, time • Population registers • Administrative Census • Denmark (1981), Finland (1991), Netherlands (2001)
Use(EU/EFTA Survey 2010) • Frame • Observations • Auxiliary data • Model parameters • Data quality
In sum • Many types of data sources • Long history • Potentially very useful
Existence • Data protection act • Organisation registers data under DPA
Existence • Data protection act • Organisation registers data under DPA
In Sum • Explore potential data sources • Access: legal uses and public consent
Exploration phase Source Meta
Processing phase: data useful? March ‘04 Dec ‘04 Turnover VAT data Turnover Sample Survey Turnover Sample Survey
Administrative data: • Many merits • Explore • More than adding up
Access Set of base registers • data re-used • report errors • 1 contact person in NSI • large dependency users
2 Can I use of a specific data source?What ‘steps’ are needed? • Existence • Access • Fitness for use • Fall back scenario’s • Processing
Processing: data integration • Linkage • Micro-integration • Imputation/weighting • Macro-integration
Fall back scenarios Quarterly turnover from Survey en Admin data • Risk only data from month 1 and 2 • Model: missing units predicted from respondents • Indicator: how many and which units to call
Fall back scenarios • Risk analyses • Strategy fall back scenario • Obtain missing data elsewhere? • Model-based approach • Inform users • Postpone publication
Processing: robust estimation • Medical expenses (volume, prices) • Coding system for medical treatments • First coding in 2008 • Coding slightly revised 2009 • New coding system 2010
Fitness foruse Data
Concluding remarks • Merits • Reduction response burden • Detailed & Longitudinal • Longitudinal data • Consequences • Relations with administrative data holder • Prone to changes