Feature-Discovering Approximate Value Iteration Methods

16 years 8 days ago

Download cobweb.ecn.purdue.edu

Sets of features in Markov decision processes can play a critical role ximately representing value and in abstracting the state space. Selection of features is crucial to the success of a system and is most often conducted by a human. We study the problem of automatically selecting problem features, and propose and evaluate a simple approach reducing the problem of selecting a new feature to standard classiﬁcation learning. We learn a classiﬁer that predicts the sign of the Bellman error over a training set of states. By iteratively adding new classiﬁers as features with this method, training between iterations with approximate value iteration, we ﬁnd a Tetris feature set that outperforms randomly constructed features signiﬁcantly, and obtains a score of about three-tenths of the highest score obtained by using a carefully hand-constructed feature set. We also show that features learned with this method outperform those learned with the previous method of Patrascu et al. [4] ...

Jia-Hong Wu, Robert Givan

Real-time Traffic