TY - GEN
T1 - Managing information quality in e-science
T2 - SIGMOD 2007: ACM SIGMOD International Conference on Management of Data
AU - Missier, Paolo
AU - Embury, Suzanne M.
AU - Greenwood, Mark
AU - Preece, Alun
AU - Jin, Binling
PY - 2007
Y1 - 2007
N2 - Data-intensive e-science applications often rely on third-party data found in public repositories, whose quality is largely unknown. Although scientists are aware that this uncertainty may lead to incorrect scientific conclusions, in the absence of a quantitative characterization of data quality properties they find it difficult to formulate precise data acceptability criteria. We present an Information Quality management workbench, called Qurator, that supports data experts in the specification of personal quality models, and lets them derive effective criteria for data acceptability. The demo of our working prototype will illustrate our approach on a real e-science workflow for a bioinformatics application.
AB - Data-intensive e-science applications often rely on third-party data found in public repositories, whose quality is largely unknown. Although scientists are aware that this uncertainty may lead to incorrect scientific conclusions, in the absence of a quantitative characterization of data quality properties they find it difficult to formulate precise data acceptability criteria. We present an Information Quality management workbench, called Qurator, that supports data experts in the specification of personal quality models, and lets them derive effective criteria for data acceptability. The demo of our working prototype will illustrate our approach on a real e-science workflow for a bioinformatics application.
KW - Information quality management
KW - Semantic modelling of information quality
UR - http://www.scopus.com/inward/record.url?scp=35448939702&partnerID=8YFLogxK
U2 - 10.1145/1247480.1247638
DO - 10.1145/1247480.1247638
M3 - Conference contribution
AN - SCOPUS:35448939702
SN - 1595936866
SN - 9781595936868
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 1150
EP - 1152
BT - SIGMOD 2007
Y2 - 12 June 2007 through 14 June 2007
ER -