A version of Geiringer-like theorem for decision making in the environments with randomness and incomplete information

Boris Mitavskiy; Jonathan Rowe; Christopher Cannings

doi:10.1108/17563781211208233

A version of Geiringer-like theorem for decision making in the environments with randomness and incomplete information

Boris Mitavskiy, Jonathan Rowe, Christopher Cannings

Computer Science

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

15 Downloads (Pure)

Abstract

Purpose – The purpose of this paper is to establish a version of a theorem that originated from population genetics and has been later adopted in evolutionary computation theory that will lead to novel Monte-Carlo sampling algorithms that provably increase the AI potential.

Design/methodology/approach – In the current paper the authors set up a mathematical framework, state and prove a version of a Geiringer-like theorem that is very well-suited for the development of Mote-Carlo sampling algorithms to cope with randomness and incomplete information to make decisions.

Findings – This work establishes an important theoretical link between classical population genetics, evolutionary computation theory and model free reinforcement learning methodology. Not only may the theory explain the success of the currently existing Monte-Carlo tree sampling methodology, but it also leads to the development of novel Monte-Carlo sampling techniques guided by rigorous mathematical foundation.

Practical implications – The theoretical foundations established in the current work provide guidance for the design of powerful Monte-Carlo sampling algorithms in model free reinforcement learning, to tackle numerous problems in computational intelligence.

Originality/value – Establishing a Geiringer-like theorem with non-homologous recombination was a long-standing open problem in evolutionary computation theory. Apart from overcoming this challenge, in a mathematically elegant fashion and establishing a rather general and powerful version of the theorem, this work leads directly to the development of novel provably powerful algorithms for decision making in the environment involving randomness, hidden or incomplete information.

Original language	English
Pages (from-to)	36-90
Journal	International Journal of Intelligent Computing and Cybernetics
Volume	5
Issue number	1
DOIs	https://doi.org/10.1108/17563781211208233
Publication status	Published - 2012

Keywords

Decision making
Evolutionary computation theory
Geiringer theorem
Markov chains
Reinforcement learning
Markov processes
Monte Carlo methods
Monte Carlo tree search
Partially observable Markov decision processes
Programming and algorithm theory

Access to Document

10.1108/17563781211208233

http://arxiv.org/abs/1110.4657

Cite this

@article{3c9e5ae5f76c48fa81f650154ee025ea,

title = "A version of Geiringer-like theorem for decision making in the environments with randomness and incomplete information",

abstract = "Purpose – The purpose of this paper is to establish a version of a theorem that originated from population genetics and has been later adopted in evolutionary computation theory that will lead to novel Monte-Carlo sampling algorithms that provably increase the AI potential.Design/methodology/approach – In the current paper the authors set up a mathematical framework, state and prove a version of a Geiringer-like theorem that is very well-suited for the development of Mote-Carlo sampling algorithms to cope with randomness and incomplete information to make decisions.Findings – This work establishes an important theoretical link between classical population genetics, evolutionary computation theory and model free reinforcement learning methodology. Not only may the theory explain the success of the currently existing Monte-Carlo tree sampling methodology, but it also leads to the development of novel Monte-Carlo sampling techniques guided by rigorous mathematical foundation.Practical implications – The theoretical foundations established in the current work provide guidance for the design of powerful Monte-Carlo sampling algorithms in model free reinforcement learning, to tackle numerous problems in computational intelligence.Originality/value – Establishing a Geiringer-like theorem with non-homologous recombination was a long-standing open problem in evolutionary computation theory. Apart from overcoming this challenge, in a mathematically elegant fashion and establishing a rather general and powerful version of the theorem, this work leads directly to the development of novel provably powerful algorithms for decision making in the environment involving randomness, hidden or incomplete information.",

keywords = "Decision making, Evolutionary computation theory, Geiringer theorem, Markov chains, Reinforcement learning, Markov processes, Monte Carlo methods, Monte Carlo tree search, Partially observable Markov decision processes, Programming and algorithm theory",

author = "Boris Mitavskiy and Jonathan Rowe and Christopher Cannings",

year = "2012",

doi = "10.1108/17563781211208233",

language = "English",

volume = "5",

pages = "36--90",

journal = "International Journal of Intelligent Computing and Cybernetics",

issn = "1756-378X",

publisher = "Emerald",

number = "1",

}

TY - JOUR

T1 - A version of Geiringer-like theorem for decision making in the environments with randomness and incomplete information

AU - Mitavskiy, Boris

AU - Rowe, Jonathan

AU - Cannings, Christopher

PY - 2012

Y1 - 2012

N2 - Purpose – The purpose of this paper is to establish a version of a theorem that originated from population genetics and has been later adopted in evolutionary computation theory that will lead to novel Monte-Carlo sampling algorithms that provably increase the AI potential.Design/methodology/approach – In the current paper the authors set up a mathematical framework, state and prove a version of a Geiringer-like theorem that is very well-suited for the development of Mote-Carlo sampling algorithms to cope with randomness and incomplete information to make decisions.Findings – This work establishes an important theoretical link between classical population genetics, evolutionary computation theory and model free reinforcement learning methodology. Not only may the theory explain the success of the currently existing Monte-Carlo tree sampling methodology, but it also leads to the development of novel Monte-Carlo sampling techniques guided by rigorous mathematical foundation.Practical implications – The theoretical foundations established in the current work provide guidance for the design of powerful Monte-Carlo sampling algorithms in model free reinforcement learning, to tackle numerous problems in computational intelligence.Originality/value – Establishing a Geiringer-like theorem with non-homologous recombination was a long-standing open problem in evolutionary computation theory. Apart from overcoming this challenge, in a mathematically elegant fashion and establishing a rather general and powerful version of the theorem, this work leads directly to the development of novel provably powerful algorithms for decision making in the environment involving randomness, hidden or incomplete information.

AB - Purpose – The purpose of this paper is to establish a version of a theorem that originated from population genetics and has been later adopted in evolutionary computation theory that will lead to novel Monte-Carlo sampling algorithms that provably increase the AI potential.Design/methodology/approach – In the current paper the authors set up a mathematical framework, state and prove a version of a Geiringer-like theorem that is very well-suited for the development of Mote-Carlo sampling algorithms to cope with randomness and incomplete information to make decisions.Findings – This work establishes an important theoretical link between classical population genetics, evolutionary computation theory and model free reinforcement learning methodology. Not only may the theory explain the success of the currently existing Monte-Carlo tree sampling methodology, but it also leads to the development of novel Monte-Carlo sampling techniques guided by rigorous mathematical foundation.Practical implications – The theoretical foundations established in the current work provide guidance for the design of powerful Monte-Carlo sampling algorithms in model free reinforcement learning, to tackle numerous problems in computational intelligence.Originality/value – Establishing a Geiringer-like theorem with non-homologous recombination was a long-standing open problem in evolutionary computation theory. Apart from overcoming this challenge, in a mathematically elegant fashion and establishing a rather general and powerful version of the theorem, this work leads directly to the development of novel provably powerful algorithms for decision making in the environment involving randomness, hidden or incomplete information.

KW - Decision making

KW - Evolutionary computation theory

KW - Geiringer theorem

KW - Markov chains

KW - Reinforcement learning

KW - Markov processes

KW - Monte Carlo methods

KW - Monte Carlo tree search

KW - Partially observable Markov decision processes

KW - Programming and algorithm theory

U2 - 10.1108/17563781211208233

DO - 10.1108/17563781211208233

M3 - Article

SN - 1756-378X

VL - 5

SP - 36

EP - 90

JO - International Journal of Intelligent Computing and Cybernetics

JF - International Journal of Intelligent Computing and Cybernetics

IS - 1

ER -

A version of Geiringer-like theorem for decision making in the environments with randomness and incomplete information

Abstract

Keywords

Access to Document

Fingerprint

Cite this