Re: [CODE4LIB] How to measure quality of a record

Robert Sandusky Wed, 06 May 2015 14:43:00 -0700

I recommend this article as an entry point into a research program oninformation quality:

Stvilia, B., Gasser, L., Twidale, M. B. and Smith, L. C. (2007), Aframework for information quality assessment. J. Am. Soc. Inf. Sci., 58:1720–1733. doi:10.1002/asi.20652 Available at:http://stvilia.cci.fsu.edu/wp-content/uploads/2011/03/IQAssessmentFramework.pdf

One cannot manage information quality (IQ) without first being able tomeasure it meaningfully and establishing a causal connection between thesource of IQ change, the IQ problem types, the types of activitiesaffected, and their implications. In this article we propose a generalIQ assessment framework. In contrast to context-specific IQ assessmentmodels, which usually focus on a few variables determined by localneeds, our framework consists of comprehensive typologies of IQproblems, related activities, and a taxonomy of IQ dimensions organizedin a systematic way based on sound theories and practices. The frameworkcan be used as a knowledge resource and as a guide for developing IQmeasurement models for many different settings. The framework wasvalidated and refined by developing specific IQ measurement models fortwo large-scale collections of two large classes of information objects:Simple Dublin Core records and online encyclopedia articles.


Bob

On 5/6/2015 4:32 PM, Diane Hillmann wrote:

You might try this blog post, by Thomas Bruce, who was my co-author on an
earlier article (referred to in the post):
https://blog.law.cornell.edu/voxpop/2013/01/24/metadata-quality-in-a-linked-data-context/

Diane

On Wed, May 6, 2015 at 5:24 PM, Kyle Banerjee <kyle.baner...@gmail.com>
wrote:

On May 6, 2015, at 7:08 AM, James Morley <james.mor...@europeana.eu>

wrote:


I think a key thing is to determine to what extent any definition of

'completeness' is actually a representation of 'quality'.  As Peter says,
making sure not just that metadata is present but then checking it conforms
with rules is a big step towards this.

This.

Basing quality measures too much on the presence of certain data points or
the volume of data is fraught with peril. In experiments in the distant
past, my experience was that looking for structure and syntax patterns that
indicate good/bad quality as well as considering record sources was useful.
Also keep in mind that any scoring system is to some extent arbitrary, so
you don't want to read more into what it generates than appropriate.

Kyle

Re: [CODE4LIB] How to measure quality of a record

Reply via email to