Integrating value-directed compression and belief space analysis for POMDP decomposition

Xin Li*, Kwok Wai CHEUNG, Jiming LIU

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Partially observable Markov decision process (POMDP) is a commonly adopted framework to model planning problems for agents to act in a stochastic environment. Obtaining the optimal policy of POMDP for large-scale problems is known to be intractable, where the high dimension of its belief state is one of the major causes. The use of the compression approach has recently been shown to be promising in tackling the curse of dimensionality problem. In this paper, a novel value-directed belief compression technique is proposed,together with clustering of belief states for further reducing the underlying computational complexity. We first cluster some sampled belief states into disjoint partitions and then apply a non-negative matrix factorization (NMF) based projection to each belief state cluster for dimension reduction. We then compute the optimal policy is then computed using a pointed-based value iteration algorithm defined in the low-dimensional projected belief state space. The proposed algorithm has been evaluated using a synthesized navigation problem. Solutions with quality comparable to the original POMDP were obtained at a much lower computational cost.

Original languageEnglish
Title of host publicationProceedings - 2006 IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT 2006 Main Conference Proceedings), IAT'06
PublisherIEEE Computer Society
Pages45-51
Number of pages7
ISBN (Print)9780769527482
DOIs
Publication statusPublished - 2006
Event2006 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT'06 - Hong Kong, China
Duration: 18 Dec 200622 Dec 2006

Publication series

NameProceedings - 2006 IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT 2006 Main Conference Proceedings), IAT'06

Conference

Conference2006 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT'06
Country/TerritoryChina
CityHong Kong
Period18/12/0622/12/06

Scopus Subject Areas

  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'Integrating value-directed compression and belief space analysis for POMDP decomposition'. Together they form a unique fingerprint.

Cite this