Evolving Efficient Learning Algorithms for Binary Mappings

John Bullinaria

doi:10.1016/S0893-6080(03)00093-5

Evolving Efficient Learning Algorithms for Binary Mappings

John Bullinaria

Computer Science

Research output: Contribution to journal › Article

16 Citations (Scopus)

Abstract

Gradient descent training of sigmoidal feed-forward neural networks on binary mappings often gets stuck with some outputs totally wrong. This is because a sum-squared-error cost function leads to weight updates that depend on the derivative of the output sigmoid which goes to zero as the output approaches maximal error. Although it is easy to understand the cause, the best remedy is not so obvious. Common solutions involve modifying the training data, deviating from true gradient descent, or changing the cost function. In general, finding the best learning procedures for particular classes of problem is difficult because each usually depends on a number of interacting parameters that need to be set to optimal values for a fair comparison. In this paper I shall use simulated evolution to optimise all the relevant parameters, and come to a clear conclusion concerning the most efficient approach for learning binary mappings.

Original language	English
Pages (from-to)	793-800
Number of pages	8
Journal	Neural Networks
Volume	16
Issue number	5-6
DOIs	https://doi.org/10.1016/S0893-6080(03)00093-5
Publication status	Published - 1 Jul 2003

Access to Document

10.1016/S0893-6080(03)00093-5

Cite this

@article{c0b703c386da4ae594cbdc112894b357,

title = "Evolving Efficient Learning Algorithms for Binary Mappings",

abstract = "Gradient descent training of sigmoidal feed-forward neural networks on binary mappings often gets stuck with some outputs totally wrong. This is because a sum-squared-error cost function leads to weight updates that depend on the derivative of the output sigmoid which goes to zero as the output approaches maximal error. Although it is easy to understand the cause, the best remedy is not so obvious. Common solutions involve modifying the training data, deviating from true gradient descent, or changing the cost function. In general, finding the best learning procedures for particular classes of problem is difficult because each usually depends on a number of interacting parameters that need to be set to optimal values for a fair comparison. In this paper I shall use simulated evolution to optimise all the relevant parameters, and come to a clear conclusion concerning the most efficient approach for learning binary mappings.",

author = "John Bullinaria",

year = "2003",

month = jul,

day = "1",

doi = "10.1016/S0893-6080(03)00093-5",

language = "English",

volume = "16",

pages = "793--800",

journal = "Neural Networks",

publisher = "Elsevier",

number = "5-6",

}

TY - JOUR

T1 - Evolving Efficient Learning Algorithms for Binary Mappings

AU - Bullinaria, John

PY - 2003/7/1

Y1 - 2003/7/1

N2 - Gradient descent training of sigmoidal feed-forward neural networks on binary mappings often gets stuck with some outputs totally wrong. This is because a sum-squared-error cost function leads to weight updates that depend on the derivative of the output sigmoid which goes to zero as the output approaches maximal error. Although it is easy to understand the cause, the best remedy is not so obvious. Common solutions involve modifying the training data, deviating from true gradient descent, or changing the cost function. In general, finding the best learning procedures for particular classes of problem is difficult because each usually depends on a number of interacting parameters that need to be set to optimal values for a fair comparison. In this paper I shall use simulated evolution to optimise all the relevant parameters, and come to a clear conclusion concerning the most efficient approach for learning binary mappings.

AB - Gradient descent training of sigmoidal feed-forward neural networks on binary mappings often gets stuck with some outputs totally wrong. This is because a sum-squared-error cost function leads to weight updates that depend on the derivative of the output sigmoid which goes to zero as the output approaches maximal error. Although it is easy to understand the cause, the best remedy is not so obvious. Common solutions involve modifying the training data, deviating from true gradient descent, or changing the cost function. In general, finding the best learning procedures for particular classes of problem is difficult because each usually depends on a number of interacting parameters that need to be set to optimal values for a fair comparison. In this paper I shall use simulated evolution to optimise all the relevant parameters, and come to a clear conclusion concerning the most efficient approach for learning binary mappings.

UR - http://www.scopus.com/inward/record.url?scp=0038355078&partnerID=8YFLogxK

U2 - 10.1016/S0893-6080(03)00093-5

DO - 10.1016/S0893-6080(03)00093-5

M3 - Article

C2 - 12850036

VL - 16

SP - 793

EP - 800

JO - Neural Networks

JF - Neural Networks

IS - 5-6

ER -

Evolving Efficient Learning Algorithms for Binary Mappings

Abstract

Access to Document

Fingerprint

Cite this