large relational markov

113

PKDD
2010
Springer

122views Data Mining» more PKDD 2010»

15 years 16 days ago

Abstract. One of the key problems in model-based reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large relational domains, in wh...

Tobias Lang, Marc Toussaint, Kristian Kersting

claim paper

Read More »

click to vote

NIPS
2003

196views Information Technology» more NIPS 2003»

Approximate Policy Iteration with a Policy Language Bias

15 years 3 months ago

Download www.jair.org

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers