Search Sciweavers | Sciweavers

181 search results - page 14 / 37

» On Policy Learning in Restricted Policy Spaces

click to vote

ICML
2002
IEEE

128views Machine Learning» more ICML 2002»

Pruning Improves Heuristic Search for Cost-Sensitive Learning

14 years 8 months ago

Download web.engr.oregonstate.edu

This paper addresses cost-sensitive classification in the setting where there are costs for measuring each attribute as well as costs for misclassification errors. We show how to ...

Valentina Bayer Zubek, Thomas G. Dietterich

claim paper

Read More »

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Fitted Q-iteration by Advantage Weighted Regression

13 years 9 months ago

Download www.kyb.mpg.de

Recently, fitted Q-iteration (FQI) based methods have become more popular due to their increased sample efficiency, a more stable learning process and the higher quality of the re...

Gerhard Neumann, Jan Peters

claim paper

Read More »

click to vote

ECML
2006
Springer

146views Machine Learning» more ECML 2006»

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

13 years 11 months ago

Download www.montefiore.ulg.ac.be

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

13 years 7 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

click to vote

IJAIT
2008

146views more IJAIT 2008»

Learning to Behave in Space: a Qualitative Spatial Representation for Robot Navigation with Reinforcement Learning

13 years 7 months ago

Download www.aussagekraft.de

ion mechanism to create a representation of space consisting of the circular order of detected landmarks and the relative position of walls towards the agent's moving directio...

Lutz Frommberger

claim paper

Read More »

« Prev « First page 14 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers