Editing near the source - PowerPoint PPT Presentation

Statistical data editing near the source using cloud computing concepts
Download
1 / 9

  • 84 Views
  • Uploaded on
  • Presentation posted in: General

Statistical data editing near the source using cloud computing concepts George Pongas, Christine Wirtz -Eurostat MSIS 2011 – 23-25 May 2011, Luxembourg. Editing near the source. Accelerates speed of final delivery to users and institutions Checks and imputations are near the respondent

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.

Download Presentation

Editing near the source

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Editing near the source

Statistical data editing near the source using cloud computing conceptsGeorge Pongas, Christine Wirtz -EurostatMSIS 2011 – 23-25 May 2011, Luxembourg

MSIS 2011 – 23-25 May 2011, Luxembourg


Editing near the source

Editing near the source

  • Accelerates speed of final delivery to users and institutions

  • Checks and imputations are near the respondent

  • Data knowledge is frequently more profound in the primary collector institutions

  • Logical proximity is better than physical: Data and application sharing


Cloud and soa in few lines

Cloud and SOA in few Lines

  • Separates ownership and usage of data storage computer power and application development and execution (cloud)

  • Cloud variants are IaaS, PaaS, SaaS

  • Cloud architectures are:

    • Public

    • Private

    • Mixed

    • Community

  • Based on web technologies and independent software components to interlink on demand (SOA)


Data editing in eurostat

Data Editing in Eurostat

  • High volume of arrivals (>60.000 per year)

  • Format heterogeneity

  • Data checking absorbs substantial volume of human resources

  • Erroneous data imply communications with MS

  • Eurostat as a rule does not Impute…

  • Interest to have a Common distributed solutions


Eurostat s web enabled system for editing editing building block ebb

Eurostat’s web enabled system for editing(Editing building block (Ebb)

  • Completely Metadata Driven

  • Exists in 2 versions:

    • PC version

    • Web-based version

  • Technologies used:

    • ANTLR

    • Java

    • Tomcat or Weblogic

    • Hibernate

    • Postgres or Oracle


Ebb information flow

EBB Information Flow


Implementation details

Implementation Details

EBB is written using a set of Web services of the following types:

  • Administration

  • Program

  • Job


Ebb functionalities

EBB functionalities

  • Support of categorical, text and numeric variables

  • Separation of programmer and user interfaces

  • Conditional and unconditional rules

  • Multi-record rules

  • Deterministic imputation

  • Use of auxiliary data

  • File operations

  • Special functions (unicity, duplication checks ...)

  • Outliers (HB, Sigma Gap, Terror)

  • Input/output of data/metadata

  • Reporting


Usage until now

Usage until now

  • Embedded in SAS (for microdata editing)

  • To distribute to data providers as standalone version

    • FDI (foreign direct investments)

    • ITS (international trade in services)

    • SBS (structural business statistics)

    • CVTS (continuous vocational training survey),

    • AES (adult education survey)


  • Login