Date Available
12-7-2011
Year of Publication
2007
Document Type
Thesis
College
Engineering
Department
Computer Science
First Advisor
Judy Goldsmith
Abstract
Markov decision processes (MDPs) are a general framework used by Artificial Intelligence (AI) researchers to model decision theoretic planning problems. Solving real world MDPs has been a major and challenging research topic in the AI literature. This paper discusses two main groups of approaches in solving MDPs. The first group of approaches combines the strategies of heuristic search and dynamic programming to expedite the convergence process. The second makes use of graphical structures in MDPs to decrease the effort of classic dynamic programming algorithms. Two new algorithms proposed by the author, MBLAO* and TVI, are described here.
Recommended Citation
Dai, Peng, "FASTER DYNAMIC PROGRAMMING FOR MARKOV DECISION PROCESSES" (2007). University of Kentucky Master's Theses. 428.
https://uknowledge.uky.edu/gradschool_theses/428