Markov decision processes with iterated coherent risk measures

Shanyun Chu; Yi Zhang

doi:10.1080/00207179.2014.909947

Markov decision processes with iterated coherent risk measures

Shanyun Chu, Yi Zhang

Mathematics

Research output: Contribution to journal › Article › peer-review

7 Citations (Scopus)

Abstract

This paper considers a Markov decision process in Borel state and action spaces with the aggregated (or say iterated) coherent risk measure to be minimised. For this problem, we establish the Bellman optimality equation as well as the value and policy iteration algorithms, and show the existence of a deterministic stationary optimal policy. The cost function, while being allowed to be unbounded from below (in the sense that its negative part needs be bounded by some nonnegative real-valued possibly unbounded weight function), can be arbitrarily unbounded from above and possibly infinitely valued.

Original language	English
Pages (from-to)	2286-2293
Number of pages	8
Journal	International Journal of Control
Volume	87
Issue number	11
DOIs	https://doi.org/10.1080/00207179.2014.909947
Publication status	Published - 2014

Bibliographical note

Publisher Copyright:
© 2014 Taylor & Francis.

Keywords

Iterated coherent risk measure
Markov decision process
Optimality equation

ASJC Scopus subject areas

Control and Systems Engineering
Computer Science Applications

Access to Document

10.1080/00207179.2014.909947

Cite this

@article{955aa55c37ef42adaece03a0fb69ddea,

title = "Markov decision processes with iterated coherent risk measures",

abstract = "This paper considers a Markov decision process in Borel state and action spaces with the aggregated (or say iterated) coherent risk measure to be minimised. For this problem, we establish the Bellman optimality equation as well as the value and policy iteration algorithms, and show the existence of a deterministic stationary optimal policy. The cost function, while being allowed to be unbounded from below (in the sense that its negative part needs be bounded by some nonnegative real-valued possibly unbounded weight function), can be arbitrarily unbounded from above and possibly infinitely valued.",

keywords = "Iterated coherent risk measure, Markov decision process, Optimality equation",

author = "Shanyun Chu and Yi Zhang",

note = "Publisher Copyright: {\textcopyright} 2014 Taylor & Francis.",

year = "2014",

doi = "10.1080/00207179.2014.909947",

language = "English",

volume = "87",

pages = "2286--2293",

journal = "International Journal of Control",

issn = "0020-7179",

publisher = "Taylor & Francis",

number = "11",

}

TY - JOUR

T1 - Markov decision processes with iterated coherent risk measures

AU - Chu, Shanyun

AU - Zhang, Yi

PY - 2014

Y1 - 2014

N2 - This paper considers a Markov decision process in Borel state and action spaces with the aggregated (or say iterated) coherent risk measure to be minimised. For this problem, we establish the Bellman optimality equation as well as the value and policy iteration algorithms, and show the existence of a deterministic stationary optimal policy. The cost function, while being allowed to be unbounded from below (in the sense that its negative part needs be bounded by some nonnegative real-valued possibly unbounded weight function), can be arbitrarily unbounded from above and possibly infinitely valued.

AB - This paper considers a Markov decision process in Borel state and action spaces with the aggregated (or say iterated) coherent risk measure to be minimised. For this problem, we establish the Bellman optimality equation as well as the value and policy iteration algorithms, and show the existence of a deterministic stationary optimal policy. The cost function, while being allowed to be unbounded from below (in the sense that its negative part needs be bounded by some nonnegative real-valued possibly unbounded weight function), can be arbitrarily unbounded from above and possibly infinitely valued.

KW - Iterated coherent risk measure

KW - Markov decision process

KW - Optimality equation

UR - http://www.scopus.com/inward/record.url?scp=84898767227&partnerID=8YFLogxK

U2 - 10.1080/00207179.2014.909947

DO - 10.1080/00207179.2014.909947

M3 - Article

AN - SCOPUS:84898767227

SN - 0020-7179

VL - 87

SP - 2286

EP - 2293

JO - International Journal of Control

JF - International Journal of Control

IS - 11

ER -

Markov decision processes with iterated coherent risk measures

Abstract

Bibliographical note

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this