Generalized RLS Approach to the Training of Neural Networks

Yong Xu; KW Wong; CS Leung

doi:10.1109/TNN.2005.860857

Generalized RLS Approach to the Training of Neural Networks

Yong Xu, KW Wong, CS Leung

Research output: Contribution to journal › Article

41 Citations (Scopus)

Abstract

Recursive least square (RLS) is an efficient approach to neural network training. However, in the classical RLS algorithm, there is no explicit decay in the energy function. This will lead to an unsatisfactory generalization ability for the trained networks. In this paper, we propose a generalized RLS (GRLS) model which includes a general decay term in the energy function for the training of feedforward neural networks. In particular, four different weight decay functions, namely, the quadratic weight decay, the constant weight decay and the newly proposed multimodal and quartic weight decay are discussed. By using the GRLS approach, not only the generalization ability of the trained networks is significantly improved but more unnecessary weights are pruned to obtain a compact network. Furthermore, the computational complexity of the GRLS remains the same as that of the standard RLS algorithm. The advantages and tradeoffs of using different decay functions are analyzed and then demonstrated with examples. Simulation results show that our approach is able to meet the design goals: improving the generalization ability of the trained network while getting a compact network.

Original language	English
Pages (from-to)	19-34
Number of pages	16
Journal	IEEE Transactions on Neural Networks
Volume	17
Issue number	1
DOIs	https://doi.org/10.1109/TNN.2005.860857
Publication status	Published - 1 Jan 2006

Keywords

recursive least square (RLS) algorithm
neural network
extended Kalman filtering (EKF)
weight decay

Access to Document

10.1109/TNN.2005.860857

Cite this

@article{0b22313e8f8d48ee92f7067a807dcb74,

title = "Generalized RLS Approach to the Training of Neural Networks",

abstract = "Recursive least square (RLS) is an efficient approach to neural network training. However, in the classical RLS algorithm, there is no explicit decay in the energy function. This will lead to an unsatisfactory generalization ability for the trained networks. In this paper, we propose a generalized RLS (GRLS) model which includes a general decay term in the energy function for the training of feedforward neural networks. In particular, four different weight decay functions, namely, the quadratic weight decay, the constant weight decay and the newly proposed multimodal and quartic weight decay are discussed. By using the GRLS approach, not only the generalization ability of the trained networks is significantly improved but more unnecessary weights are pruned to obtain a compact network. Furthermore, the computational complexity of the GRLS remains the same as that of the standard RLS algorithm. The advantages and tradeoffs of using different decay functions are analyzed and then demonstrated with examples. Simulation results show that our approach is able to meet the design goals: improving the generalization ability of the trained network while getting a compact network.",

keywords = "recursive least square (RLS) algorithm, neural network, extended Kalman filtering (EKF), weight decay",

author = "Yong Xu and KW Wong and CS Leung",

year = "2006",

month = jan,

day = "1",

doi = "10.1109/TNN.2005.860857",

language = "English",

volume = "17",

pages = "19--34",

journal = "IEEE Transactions on Neural Networks",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "1",

}

TY - JOUR

T1 - Generalized RLS Approach to the Training of Neural Networks

AU - Xu, Yong

AU - Wong, KW

AU - Leung, CS

PY - 2006/1/1

Y1 - 2006/1/1

N2 - Recursive least square (RLS) is an efficient approach to neural network training. However, in the classical RLS algorithm, there is no explicit decay in the energy function. This will lead to an unsatisfactory generalization ability for the trained networks. In this paper, we propose a generalized RLS (GRLS) model which includes a general decay term in the energy function for the training of feedforward neural networks. In particular, four different weight decay functions, namely, the quadratic weight decay, the constant weight decay and the newly proposed multimodal and quartic weight decay are discussed. By using the GRLS approach, not only the generalization ability of the trained networks is significantly improved but more unnecessary weights are pruned to obtain a compact network. Furthermore, the computational complexity of the GRLS remains the same as that of the standard RLS algorithm. The advantages and tradeoffs of using different decay functions are analyzed and then demonstrated with examples. Simulation results show that our approach is able to meet the design goals: improving the generalization ability of the trained network while getting a compact network.

AB - Recursive least square (RLS) is an efficient approach to neural network training. However, in the classical RLS algorithm, there is no explicit decay in the energy function. This will lead to an unsatisfactory generalization ability for the trained networks. In this paper, we propose a generalized RLS (GRLS) model which includes a general decay term in the energy function for the training of feedforward neural networks. In particular, four different weight decay functions, namely, the quadratic weight decay, the constant weight decay and the newly proposed multimodal and quartic weight decay are discussed. By using the GRLS approach, not only the generalization ability of the trained networks is significantly improved but more unnecessary weights are pruned to obtain a compact network. Furthermore, the computational complexity of the GRLS remains the same as that of the standard RLS algorithm. The advantages and tradeoffs of using different decay functions are analyzed and then demonstrated with examples. Simulation results show that our approach is able to meet the design goals: improving the generalization ability of the trained network while getting a compact network.

KW - recursive least square (RLS) algorithm

KW - neural network

KW - extended Kalman filtering (EKF)

KW - weight decay

UR - http://www.scopus.com/inward/record.url?scp=33144485895&partnerID=8YFLogxK

U2 - 10.1109/TNN.2005.860857

DO - 10.1109/TNN.2005.860857

M3 - Article

C2 - 16526473

VL - 17

SP - 19

EP - 34

JO - IEEE Transactions on Neural Networks

JF - IEEE Transactions on Neural Networks

IS - 1

ER -

Generalized RLS Approach to the Training of Neural Networks

Abstract

Keywords

Access to Document

Fingerprint

Cite this