1 / 32

TheDataWeb: A New Framework for Data Integration and Dissemination

Learn about TheDataWeb, a comprehensive framework for data integration and dissemination that helps users make informed decisions in business or government. This framework includes HotReports, DataFerrett, and TheDataWeb Browser, catering to different user needs and providing easy access to relevant data. With statistical intelligence and collaboration at its core, TheDataWeb offers a smart data-networking solution for handling diverse datasets efficiently.

ejoshua
Download Presentation

TheDataWeb: A New Framework for Data Integration and Dissemination

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. TheDataWeb: a New Framework for Data Cavan Capps, Chief TheDataWeb Applications Branch Data Integration Division Howard Hogan, Director Demographic Programs Directorate

  2. “In God we Trust -- -- for everything else we need data” Michael Bloomberg .... on making decisions for business or government

  3. Data = A number in a context • “10.0%” is NOT data • “The 2005 poverty rate for the U.S. is 10.0%” is data • “The 2006 poverty rate for the U.S. as collected by the ACS for the housing unit population is 10.0%” is more data • Information on questionnaire, sample size, rotation, imputation, weighting, etc., is still more data

  4. The Wider Context • One datum is seldom useful • Analysis requires putting the data point in context • Related variables • Other geographies • Other time periods

  5. Dissemination Challenges • How to present the right data with the right context to meet users actual needs • How to ensure that the most recent and most correct data are displayed

  6. Dissemination Challenges • Different issues • Different audiences Solution = Different views of the same data

  7. A Three Part Approach • HotReports • DataFerrett • TheDataWeb

  8. HotReports • Targeted a local decision-makers with limited time and statistical background • Bring together relevant variables for local areas • Topically oriented • Updated dynamically • Can be designed to support decision-making • Guided use of statistical data

  9. Relatively Quick to Build • Drag & drop layout • Statistically smart • Gives an analyst a chance to layout data for a problem • Creates information

  10. Relatively Quick to Build • 50% of time is designing HotReport (finding right data and laying it out) • 20% of time is creating HotReport • 30% of time is reviewing and fact checking

  11. Typical HotReport Users • Regional economic developers • Emergency planning and coordination • Public health planning • Grant eligibility • Performance indicators

  12. DataFerrett: a data browser • Targeted at sophisticated data users • Brings together multiple data sets • Updated dynamically • Brings data context along with the numbers

  13. DataFerrett: a data browser • Speeds analysis • Data manipulation • Advanced tabulation and descriptive statistics • Mapping and business graphics using statistical rules • Adding regressions and other advanced statistics

  14. TheDataWeb Browser Data set collections are in folders

  15. TheDataWeb Browser Highlighted data sets can be searched

  16. TheDataWeb Browser . Variables returned from search

  17. TheDataWeb Browser Multiple kinds of datasets supported

  18. TheDataWeb Browser Before selecting, examine variable documentation with questions, universes and response labels or ranges

  19. TheDataWeb Browser Selected variables are tabulated in the spreadsheet controlled by statistical rules

  20. TheDataWeb Browser Mapping, and business graphics are available for all data

  21. DataFerret Users • Federal and state government • (.gov) = 7,876 users • (.us) = 5,923 user accounts • Education (.edu) = 42,828 user accounts • Non-profit (.org) = 10,792 user accounts • Private companies (.com) = 100,384 • Press - Consulting Retail • Marketing - Insurance and Financial • Pharmaceuticals

  22. TheDataWeb • “TheDataWeb” is the software engineering that make DataFerrett and HotReports possible

  23. A Smart Data-Networking Framework • Capacity to handle different kinds of data in the same environment or framework • Empowered by statistical intelligence • documentation • statistical usage rules • data integration rules • Stores the data one time, use it many times • More data in the network the more useful

  24. TheDataWeb Framework

  25. TheDataWeb Framework

  26. TheDataWeb Framework

  27. TheDataWeb Framework

  28. TheDataWeb Framework

  29. Based on Collaboration • “Open Source” statistical partnership with Australian Bureau of Statistics and other interested agencies • Based on statistical analysts providing statistical rules • Based on analysts creating a presentation and analytical review

  30. Useful Links • http://dataferrett.census.gov • www.thedataweb.org • www.thedataweb.org/twiki • www.thedataweb.org/forum

  31. Contact Cavan Cappscavan.paul.capps@census.gov 301-763-3778 work 866-437-0171 toll free 301-908-6216 cell DataFerrett HelpDesk: Toll Free: 866-437-0171 DataFerrettTeam Email:dsd_ferrett@census.gov

More Related