Reducing the Number of Queries in Interactive Value Iteration

9 years 29 days ago

Download www-desir.lip6.fr

To tackle the potentially hard task of deﬁning the reward function in a Markov Decision Process (MDPs), a new approach, called Interactive Value Iteration (IVI) has recently been proposed by Weng and Zanuttini (2013). This solving method, which interweaves elicitation and optimization phases, computes a (near) optimal policy without knowing the precise reward values. The procedure as originally presented can be improved in order to reduce the number of queries needed to determine an optimal policy. The key insights are that 1) asking queries should be delayed as much as possible, avoiding asking queries that might not be necessary to determine the best policy, 2) queries should be asked by following a priority order because the answers to some queries can enable to resolve some other queries, 3) queries can be avoided by using heuristic information to guide the process. Following these ideas, a modiﬁed IVI algorithm is presented and experimental results show a signiﬁcant decrease...

Hugo Gilbert, Olivier Spanjaard, Paolo Viappiani,

Real-time Traffic

ALDT 2015 | Algorithms |

claim paper

» Genomephysics interaction as a new concept to reduce the number of genetic parameters in a...

» A New Iterative Method for Solving Initial Value Problems

» Product Recommendation with Interactive Query Management and Twofold Similarity

» Iterative Constructions and Private Data Release

» Learning LargeAlphabet and Analog Circuits with Value Injection Queries

» The HybridLayer Index A synergic approach to answering topk queries in arbitrary subspaces

» Fast network querying algorithm for searching largescale biological networks

» XIST An XML Index Selection Tool

» Hierarchical Bitmap Index An Efficient and Scalable Indexing Technique for SetValued Attri...

Post Info
More Details (n/a)

Added	15 Apr 2016
Updated	15 Apr 2016
Type	Journal
Year	2015
Where	ALDT
Authors	Hugo Gilbert, Olivier Spanjaard, Paolo Viappiani, Paul Weng

Comments (0)

Sciweavers

Reducing the Number of Queries in Interactive Value Iteration

ALDT 2015 | Algorithms |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers