Telcordia's database reconciliation and data quality analysis tool

Francesco Caruso, Munir Cochinwala, Urna Ganapathy, Gail Lalk, Paolo Missier

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This demonstration illustrates how a comprehensive database reconciliation tool can provide the ability to charactenze data-quality and data-reconciliation issues in complex real-world applications. Telcordia's data reconciliation and data quality analysis tool includes rapid generation of appropriate pre-processing and matching rules applied to a training set created from samples of the data. Once tuned, the appropriate rules can be applied efficiently to the complete data sets. The tool uses a modular JavaBeans-based architecture that allows for customized matching functions and iterative runs that build upon previously learned information. Telcordia has been able to provide significant insights to clients who recognize that they have data reconciliation problems but cannot determine root causes effectively when using currently available off-the-shelf tools. A description of the analysis of a duplicate-record problem in a set of taxpayer databases is included in this report to illustrate the effective use of the tool.

Original languageEnglish
Title of host publicationProceedings of the 26th International Conference on Very Large Data Bases, VLDB'00
Pages615-618
Number of pages4
Publication statusPublished - 2000
Event26th International Conference on Very Large Data Bases, VLDB 2000 - Cairo, Egypt
Duration: 10 Sept 200014 Sept 2000

Publication series

NameProceedings of the 26th International Conference on Very Large Data Bases, VLDB'00

Conference

Conference26th International Conference on Very Large Data Bases, VLDB 2000
Country/TerritoryEgypt
CityCairo
Period10/09/0014/09/00

ASJC Scopus subject areas

  • Hardware and Architecture
  • Information Systems
  • Software
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Telcordia's database reconciliation and data quality analysis tool'. Together they form a unique fingerprint.

Cite this