Search Sciweavers | Sciweavers

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

165

click to vote

ICML
2008
IEEE

110views Machine Learning» more ICML 2008»

Non-parametric policy gradients: a unified treatment of propositional and relational domains

16 years 7 months ago

Download www-kd.iai.uni-bonn.de

Policy gradient approaches are a powerful instrument for learning how to interact with the environment. Existing approaches have focused on propositional and continuous domains on...

Kristian Kersting, Kurt Driessens

claim paper

Read More »

181

click to vote

EPIA
2007
Springer

155views Artificial Intelligence» more EPIA 2007»

Generalization and Transfer Learning in Noise-Affected Robot Navigation Tasks

16 years 1 months ago

Download www.aussagekraft.de

Abstract. When a robot learns to solve a goal-directed navigation task with reinforcement learning, the acquired strategy can usually exclusively be applied to the task that has be...

Lutz Frommberger

claim paper

Read More »

200

click to vote

ECML
2003
Springer

149views Machine Learning» more ECML 2003»

Could Active Perception Aid Navigation of Partially Observable Grid Worlds?

16 years 6 days ago

Download homepages.inf.ed.ac.uk

Due to the unavoidable fact that a robot’s sensors will be limited in some manner, it is entirely possible that it can ﬁnd itself unable to distinguish between diﬀering state...

Paul A. Crook, Gillian Hayes

claim paper

Read More »

« Prev « First page 3 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers