Constrained total undiscounted continuous-time Markov decision processes

Xianping Guo; Yi Zhang

doi:10.3150/15-BEJ793

Constrained total undiscounted continuous-time Markov decision processes

Xianping Guo, Yi Zhang

Mathematics

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

Abstract

The present paper considers the constrained optimal control problem with total undiscounted criteria for a continuous-time Markov decision process (CTMDP) in Borel state and action spaces. The cost rates are nonnegative. Under the standard compactness and continuity conditions, we show the existence of an optimal stationary policy out of the class of general nonstationary ones. In the process, we justify the reduction of the CTMDP model to a discrete-time Markov decision process (DTMDP) model based on the studies of the undiscounted occupancy and occupation measures. We allow that the controlled process is not necessarily absorbing, and the transition rates are not necessarily separated from zero, and can be arbitrarily unbounded; these features count for the main technical difficulties in studying undiscounted CTMDP models.

Original language	English
Pages (from-to)	1694-1736
Number of pages	43
Journal	Bernoulli
Volume	23
Issue number	3
DOIs	https://doi.org/10.3150/15-BEJ793
Publication status	Published - Aug 2017

Bibliographical note

Publisher Copyright:
© 2017 ISI/BS.

Keywords

Constrained optimality
Continuous-time Markov decision processes
Total undiscounted criteria

ASJC Scopus subject areas

Statistics and Probability

Access to Document

10.3150/15-BEJ793

Cite this

@article{5ce38f2817fb48d193518fff4d7ff4ec,

title = "Constrained total undiscounted continuous-time Markov decision processes",

abstract = "The present paper considers the constrained optimal control problem with total undiscounted criteria for a continuous-time Markov decision process (CTMDP) in Borel state and action spaces. The cost rates are nonnegative. Under the standard compactness and continuity conditions, we show the existence of an optimal stationary policy out of the class of general nonstationary ones. In the process, we justify the reduction of the CTMDP model to a discrete-time Markov decision process (DTMDP) model based on the studies of the undiscounted occupancy and occupation measures. We allow that the controlled process is not necessarily absorbing, and the transition rates are not necessarily separated from zero, and can be arbitrarily unbounded; these features count for the main technical difficulties in studying undiscounted CTMDP models.",

keywords = "Constrained optimality, Continuous-time Markov decision processes, Total undiscounted criteria",

author = "Xianping Guo and Yi Zhang",

note = "Publisher Copyright: {\textcopyright} 2017 ISI/BS.",

year = "2017",

month = aug,

doi = "10.3150/15-BEJ793",

language = "English",

volume = "23",

pages = "1694--1736",

journal = "Bernoulli",

issn = "1350-7265",

publisher = "Bernoulli Society for Mathematical Statistics and Probability",

number = "3",

}

TY - JOUR

T1 - Constrained total undiscounted continuous-time Markov decision processes

AU - Guo, Xianping

AU - Zhang, Yi

PY - 2017/8

Y1 - 2017/8

N2 - The present paper considers the constrained optimal control problem with total undiscounted criteria for a continuous-time Markov decision process (CTMDP) in Borel state and action spaces. The cost rates are nonnegative. Under the standard compactness and continuity conditions, we show the existence of an optimal stationary policy out of the class of general nonstationary ones. In the process, we justify the reduction of the CTMDP model to a discrete-time Markov decision process (DTMDP) model based on the studies of the undiscounted occupancy and occupation measures. We allow that the controlled process is not necessarily absorbing, and the transition rates are not necessarily separated from zero, and can be arbitrarily unbounded; these features count for the main technical difficulties in studying undiscounted CTMDP models.

AB - The present paper considers the constrained optimal control problem with total undiscounted criteria for a continuous-time Markov decision process (CTMDP) in Borel state and action spaces. The cost rates are nonnegative. Under the standard compactness and continuity conditions, we show the existence of an optimal stationary policy out of the class of general nonstationary ones. In the process, we justify the reduction of the CTMDP model to a discrete-time Markov decision process (DTMDP) model based on the studies of the undiscounted occupancy and occupation measures. We allow that the controlled process is not necessarily absorbing, and the transition rates are not necessarily separated from zero, and can be arbitrarily unbounded; these features count for the main technical difficulties in studying undiscounted CTMDP models.

KW - Constrained optimality

KW - Continuous-time Markov decision processes

KW - Total undiscounted criteria

UR - http://www.scopus.com/inward/record.url?scp=85016182639&partnerID=8YFLogxK

U2 - 10.3150/15-BEJ793

DO - 10.3150/15-BEJ793

M3 - Article

AN - SCOPUS:85016182639

SN - 1350-7265

VL - 23

SP - 1694

EP - 1736

JO - Bernoulli

JF - Bernoulli

IS - 3

ER -

Constrained total undiscounted continuous-time Markov decision processes

Abstract

Bibliographical note

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this