New data mining and calibration approaches to the assessment of water treatment efficiency

M Bieroza, A Baker, Jonathan Bridgeman

Research output: Contribution to journalArticle

27 Citations (Scopus)


For the first time, the application of different robust data mining techniques to the assessment of water treatment performance is considered. Principal components analysis (PCA), parallel factor analysis (PARAFAC), and a self-organizing map (SOM) were used in the analysis of multivariate data characterising organic matter (OM) removal at 16 water treatment works. Decomposed fluorescence data from PCA. PARAFAC and SOM were used as input to calibrate fluorescence data with OM concentrations using step-wise regression (SR), partial least squares (PLS), multiple linear regression (MLR), and neural network with back-propagation algorithm (BPNN). The best results were obtained with combined PARAFAC/PLS and SOM/BPNN. Both the numerical accuracy and feasibility of the adopted solutions were compared and recommendations on the use of the above techniques for fluorescence data analysis are presented. (C) 2011 Civil-Comp Ltd and Elsevier Ltd. All rights reserved.
Original languageEnglish
Pages (from-to)126-135
Number of pages10
JournalAdvances in Engineering Software
Issue number1
Publication statusPublished - 1 Feb 2012


  • Pattern recognition
  • Data mining
  • Multivariate analysis
  • Artificial neural networks
  • Fluorescence spectroscopy
  • Organic matter removal


Dive into the research topics of 'New data mining and calibration approaches to the assessment of water treatment efficiency'. Together they form a unique fingerprint.

Cite this