TY - GEN
T1 - Telcordia's database reconciliation and data quality analysis tool
AU - Caruso, Francesco
AU - Cochinwala, Munir
AU - Ganapathy, Urna
AU - Lalk, Gail
AU - Missier, Paolo
PY - 2000
Y1 - 2000
N2 - This demonstration illustrates how a comprehensive database reconciliation tool can provide the ability to charactenze data-quality and data-reconciliation issues in complex real-world applications. Telcordia's data reconciliation and data quality analysis tool includes rapid generation of appropriate pre-processing and matching rules applied to a training set created from samples of the data. Once tuned, the appropriate rules can be applied efficiently to the complete data sets. The tool uses a modular JavaBeans-based architecture that allows for customized matching functions and iterative runs that build upon previously learned information. Telcordia has been able to provide significant insights to clients who recognize that they have data reconciliation problems but cannot determine root causes effectively when using currently available off-the-shelf tools. A description of the analysis of a duplicate-record problem in a set of taxpayer databases is included in this report to illustrate the effective use of the tool.
AB - This demonstration illustrates how a comprehensive database reconciliation tool can provide the ability to charactenze data-quality and data-reconciliation issues in complex real-world applications. Telcordia's data reconciliation and data quality analysis tool includes rapid generation of appropriate pre-processing and matching rules applied to a training set created from samples of the data. Once tuned, the appropriate rules can be applied efficiently to the complete data sets. The tool uses a modular JavaBeans-based architecture that allows for customized matching functions and iterative runs that build upon previously learned information. Telcordia has been able to provide significant insights to clients who recognize that they have data reconciliation problems but cannot determine root causes effectively when using currently available off-the-shelf tools. A description of the analysis of a duplicate-record problem in a set of taxpayer databases is included in this report to illustrate the effective use of the tool.
UR - http://www.scopus.com/inward/record.url?scp=0013117789&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:0013117789
SN - 1558607153
SN - 9781558607156
T3 - Proceedings of the 26th International Conference on Very Large Data Bases, VLDB'00
SP - 615
EP - 618
BT - Proceedings of the 26th International Conference on Very Large Data Bases, VLDB'00
T2 - 26th International Conference on Very Large Data Bases, VLDB 2000
Y2 - 10 September 2000 through 14 September 2000
ER -