Note on discounted continuous-time Markov decision processes with a lower bounding function

Xin Guo; Alexey Piunovskiy; Yi Zhang

doi:10.1017/jpr.2017.53

Note on discounted continuous-time Markov decision processes with a lower bounding function

Xin Guo^*, Alexey Piunovskiy, Yi Zhang

^*Corresponding author for this work

Mathematics

Research output: Contribution to journal › Article › peer-review

Abstract

We consider the discounted continuous-time Markov decision process (CTMDP), where the negative part of each cost rate is bounded by a drift function, say w, whereas the positive part is allowed to be arbitrarily unbounded. Our focus is on the existence of a stationary optimal policy for the discounted CTMDP problems out of the more general class. Both constrained and unconstrained problems are considered. Our investigations are based on the continuous-time version of the Veinott transformation. This technique has not been widely employed in the previous literature on CTMDPs, but it clarifies the roles of the imposed conditions in a rather transparent way.

Original language	English
Pages (from-to)	1071-1088
Number of pages	18
Journal	Journal of Applied Probability
Volume	54
Issue number	4
DOIs	https://doi.org/10.1017/jpr.2017.53
Publication status	Published - 1 Dec 2017

Bibliographical note

Publisher Copyright:
Copyright © Applied Probability Trust 2017.

Keywords

Continuous-time Markov decision process
discounted criterion

ASJC Scopus subject areas

Statistics and Probability
Mathematics(all)
Statistics, Probability and Uncertainty

Access to Document

10.1017/jpr.2017.53

Cite this

@article{e340e48e28e34d8989db2e0b5382f13d,

title = "Note on discounted continuous-time Markov decision processes with a lower bounding function",

abstract = "We consider the discounted continuous-time Markov decision process (CTMDP), where the negative part of each cost rate is bounded by a drift function, say w, whereas the positive part is allowed to be arbitrarily unbounded. Our focus is on the existence of a stationary optimal policy for the discounted CTMDP problems out of the more general class. Both constrained and unconstrained problems are considered. Our investigations are based on the continuous-time version of the Veinott transformation. This technique has not been widely employed in the previous literature on CTMDPs, but it clarifies the roles of the imposed conditions in a rather transparent way.",

keywords = "Continuous-time Markov decision process, discounted criterion",

author = "Xin Guo and Alexey Piunovskiy and Yi Zhang",

note = "Publisher Copyright: Copyright {\textcopyright} Applied Probability Trust 2017.",

year = "2017",

month = dec,

day = "1",

doi = "10.1017/jpr.2017.53",

language = "English",

volume = "54",

pages = "1071--1088",

journal = "Journal of Applied Probability",

issn = "0021-9002",

publisher = "University of Sheffield",

number = "4",

}

TY - JOUR

T1 - Note on discounted continuous-time Markov decision processes with a lower bounding function

AU - Guo, Xin

AU - Piunovskiy, Alexey

AU - Zhang, Yi

PY - 2017/12/1

Y1 - 2017/12/1

N2 - We consider the discounted continuous-time Markov decision process (CTMDP), where the negative part of each cost rate is bounded by a drift function, say w, whereas the positive part is allowed to be arbitrarily unbounded. Our focus is on the existence of a stationary optimal policy for the discounted CTMDP problems out of the more general class. Both constrained and unconstrained problems are considered. Our investigations are based on the continuous-time version of the Veinott transformation. This technique has not been widely employed in the previous literature on CTMDPs, but it clarifies the roles of the imposed conditions in a rather transparent way.

AB - We consider the discounted continuous-time Markov decision process (CTMDP), where the negative part of each cost rate is bounded by a drift function, say w, whereas the positive part is allowed to be arbitrarily unbounded. Our focus is on the existence of a stationary optimal policy for the discounted CTMDP problems out of the more general class. Both constrained and unconstrained problems are considered. Our investigations are based on the continuous-time version of the Veinott transformation. This technique has not been widely employed in the previous literature on CTMDPs, but it clarifies the roles of the imposed conditions in a rather transparent way.

KW - Continuous-time Markov decision process

KW - discounted criterion

UR - http://www.scopus.com/inward/record.url?scp=85041348120&partnerID=8YFLogxK

U2 - 10.1017/jpr.2017.53

DO - 10.1017/jpr.2017.53

M3 - Article

AN - SCOPUS:85041348120

SN - 0021-9002

VL - 54

SP - 1071

EP - 1088

JO - Journal of Applied Probability

JF - Journal of Applied Probability

IS - 4

ER -

Note on discounted continuous-time Markov decision processes with a lower bounding function

Abstract

Bibliographical note

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this