Measuring population-based completeness for Single Nucleotide Polymorphism (SNP) databases

Nurul A. Emran, Suzanne Embury, Paolo Missier

Research output: Contribution to journalArticlepeer-review

Abstract

Completeness of data sets is an important aspect of data quality as observed in biological domain such as Single Nucleotide Polymorphism (SNP). In order to decide on the acceptability of the data sets of concerned, biologists need to measure the completeness of the data sets. One type of data completeness measure is population-based completeness (PBC) that has been identified as relevant to deal with data completeness problem in this domain. In this paper, the implementation of PBC measurement will be presented as a system prototype involving real SNP data sets. The result of the analysis on the practical problems encountered during the implementation of PBC will also be presented.

Original languageEnglish
Pages (from-to)173-182
Number of pages10
JournalStudies in Computational Intelligence
Volume551
DOIs
Publication statusPublished - 2014

Bibliographical note

Publisher Copyright:
© Springer International Publishing Switzerland 2014.

Keywords

  • Data completeness measurement
  • Population-based completeness (PBC)
  • Single Nucleotide Polymorphism (SNP)

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Measuring population-based completeness for Single Nucleotide Polymorphism (SNP) databases'. Together they form a unique fingerprint.

Cite this