Loading in 2 Seconds...
Loading in 2 Seconds...
DQ Assessment and Measurement (and not a trace of trust (nor trust in traces) and no truth either). Gunter, Cinzia, Vipul, Felix, and one other that did not come and two others from the Scientific DB that chose to ignore us. Assessment vs. Measurement. Measurement More objective
Gunter, Cinzia, Vipul, Felix, and one other that did not come and two others from the Scientific DB that chose to ignore us
Assumption of soundness and completeness
Past results (Traces)
Includes data & metadata
The data itself
Subsets / partitions
Data-oriented (objective in nature?)
We need some
Which criteria need them?
Size of the world for completeness
verifiabilityOLD SLIDE:An assessment-oriented classification
Offline; data-centric; objective
verifiabilityOLD SLIDE:How does it fit? Not really…
GeneDB: Ability is often lacking
E.g. Schema Evolvability
Noisy data intrinsic to GeneDB
Scientific DB: Announcements
DBLP: Unknown (less in summer)
Trust and Reputation
Identification of domain hinders assessment
GeneDB unable to assess (or define)
DBLP: Mentions scripts
Response TimeInput to DQ Assessment (Comparison)
Criteria for DBLP
Schema and data stabilityDQ criteria
High Quality Integrated Geneomic Data into your face!
Accuracy: Quotient of the number of correct values in the source and the overall number of values in the source.
Amount of data: Size of result.
Believability: Degree to which the information is accepted as correct.
Completeness: Quotient of the number of response items and the number of real world items.
Concise representation: Degree to which the structure of the information matches the information itself.
Consistent representation: Degree to which the structure of the information conforms to that of other sources.
Customer support: Amount and usefulness of online support through text, email, phone etc.
Documentation: Amount and usefulness of documents with meta information.
Interpretability: Degree to which the information conforms to technical ability of the consumer.
Latency: Amount of time until first information reaches user.IQ-Criteria
Price: Monetary charge per query.
Relevancy: Degree to which information satisfies the users need.
Reliability: Degree to which the user can trust the information.
Reputation: Degree to which the information or its source is in high standing.
Response time: Amount of time until complete response reaches the user.
Security: Degree to which information is passed privately from user to information source and back.
Timeliness: Age of information.
Understandability: Degree to which the information can be comprehended by the user.
Value-added: Amount of benefit the use of the information provides.
Verifiability: Degree and ease with which the information can be checked for correctness.IQ-Criteria