1 / 2

What Is Data Quality

What Is Data Quality

Andrew77
Download Presentation

What Is Data Quality

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. What Is Data Quality? Data Quality is a term that describes the state of information. This can be quantitative or qualitative information. There are several definitions of data quality, but in general data is considered high-quality when it is fit for its intended use. Here are three ways to measure data quality. - Look for completeness, consistency, and currency. Measures of data quality Data quality is often measured by the degree to which it conforms to pre-defined business rules. For example, if a company assigns IDs based on an employee's last name, date of hire, and job classification, the data will be valid if the data matches the rules. However, it is a challenge to monitor the quality of data that has changed over time. In some cases, data can become inaccurate over time, such as when the date format changes from the U.S. to the European Union. Poor data can hinder business opportunities, leaving gaps in your customer base's understanding. In one recent example, Nissan Europe realized that it was using unreliable customer data, which was spread across disconnected systems, making it difficult to generate personalized advertising. As a result, the company improved its Data Quality. Poor data can also waste time and effort. Reconciling disparate data sets manually can be time-consuming and inefficient.

  2. Measures of data completeness To be able to create accurate scoring models, data completeness is a critical factor. Measures of data completeness check for changes in missing data, and they assess whether model features are consistent with expected ranges. When data contain values that are out of range or in the wrong format, a model may not be able to recognize them correctly. Measures of data completeness can be generated from a variety of sources, such as data registers for health care. They can include demographic data compiled by statistical authorities, which allows for the estimation of incidence rates. In addition, clinical registers can be used to compare data from health-care providers to external benchmarks. These external benchmarks allow for more accurate comparisons of quality indicators among different institutions. Measures of data consistency Data quality is an increasingly important issue in today's digital world. While data consistency is an important aspect of data quality, it's not the same thing as data quality. Rather, data consistency is the degree to which items in a dataset are consistent with one another. When the data is inconsistent, it can lead to inaccurate and incomplete datasets. There are several dimensions of data quality: completeness, uniqueness, timeliness, consistency, and granularity. Using these measurements helps us ensure our data is reliable. Visit here Data consistency can be measured using statistics. One measure is the standard deviation, which measures the deviation of each measurement from the mean. A larger standard deviation indicates greater dispersion of data, and a smaller value means more consistency. Measures of data currency Data currency is a term that describes how valuable data is to a business. It can be used to quantify how important a data set is to a business and is important when planning for disaster recovery, business continuity, or data management. It often involves relative valuation and prioritization and is important because it helps identify the cost of replacing data, which can be a critical component of business strategy. There are two types of measures of data currency: static and dynamic. Dynamic formats display different currencies and unit values in each cell. Static formats are useful when the currency values are the same, but not when the units are different.

More Related