A survey of recent results on continuous-time Markov decision processes

Xianping Guo; Onésimo Hernández-Lerma; Tomás Prieto-Rumeau

doi:10.1007/BF02837562

A survey of recent results on continuous-time Markov decision processes

Xianping Guo ¹
Onésimo Hernández-Lerma ²
Tomás Prieto-Rumeau ³

1 Zhongshan University, China
2 CINVESTAV-IPN, Mexico
3 Universidad Nacional de Educaci´on a Distancia, España

Zugehörigkeit anzeigen +

Zeitschrift:

Top

ISSN: 1863-8279, 1134-5764

Datum der Publikation: 2006

Ausgabe: 14

Nummer: 2

Seiten: 177-261

Art: Artikel

DOI: 10.1007/BF02837562 DIALNET GOOGLE SCHOLAR Open Access editor

Andere Publikationen in: Top

Zusammenfassung

This paper is a survey of recent results on continuous-time Markov decision processes (MDPs) withunbounded transition rates, and reward rates that may beunbounded from above and from below. These results pertain to discounted and average reward optimality criteria, which are the most commonly used criteria, and also to more selective concepts, such as bias optimality and sensitive discount criteria. For concreteness, we consider only MDPs with a countable state space, but we indicate how the results can be extended to more general MDPs or to Markov games.

Fuente de los datos: Dialnet