1 / 17

An Insight To Data Warehouse Testing

An Insight To Data Warehouse Testing. By Rohit Agrawal Jan 2013. To Start With. Is the data correct…? Would table Cutomer_fact get loaded ever if Control Table has the following..?

zuwena
Download Presentation

An Insight To Data Warehouse Testing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. An Insight To Data Warehouse Testing By Rohit Agrawal Jan 2013

  2. To Start With.. Is the data correct…? Would table Cutomer_fact get loaded ever if Control Table has the following..? What if the DW load frequency is daily and ETL execution time exceeds 24 hrs…?

  3. Agenda • Introduction • Testing Process • Focus Points • Challenges • Best Practices

  4. Introduction

  5. Introduction To Data Warehouse Testing • What ? • Exhaustive testing of a Data Warehouse during its design and on an on-going basis • Why ? • Organisation decisions depend entirely on the Enterprise data and the data should be of utmost quality • Where ? • Starting from Source till Reporting • When ? • Designing phase till Production

  6. Testing Process

  7. Testing Process

  8. Focus Points

  9. Focus Points

  10. Focus Point-Underlying Data All Customer data from different bank branches are loaded Insert data of Customers with age greater than 60 in senior citizen category Error out the records if Customer does not belong to the Bank

  11. Focus Point-Underlying Data (Cont..) Check February has 29 days in leap years only A row in stage with AccountID=123 has the expected data in DW Zip Code is of 6 digits, State names are properly abbreviated

  12. DW Components Impact of Executing complex queries during data load/ Rendering reports in 30 sec Source system scheduling conflicts Incremental loads as per Audit columns like LastUpdateDate/Incremental Flag

  13. Challenges Voluminous data from heterogeneous sources Data Quality not assured at source Business knowledge. Organisation-wide Enterprise data knowledge may not be feasible Very high cost of quality .This is because, any defect slippage will translate into high cost for the organisation The heterogeneous data sources will be updated asynchronously

  14. Normal Vs. Data Warehouse testing

  15. Best Practices “If you torture data sufficiently, it will confess to almost anything.”

  16. References White Paper on Data Warehouse Testing- By Manoj Philip Mathen Adventures with Testing BI/DW Application- http://msdn.microsoft.com www.google.com A Comprehensive Approach to Data Warehouse Testing- By Matteo Golfarelli

  17. Thank You..!

More Related