Search Sciweavers | Sciweavers

272 search results - page 23 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

174

click to vote

ICANN
2007
Springer

132views Neural Networks» more ICANN 2007»

MaxSet: An Algorithm for Finding a Good Approximation for the Largest Linearly Separable Set

16 years 29 days ago

Download www.lcc.uma.es

Finding the largest linearly separable set of examples for a given Boolean function is a NP-hard problem, that is relevant to neural network learning algorithms and to several prob...

Leonardo Franco, José Luis Subirats, Jos&ea...

claim paper

Read More »

209

click to vote

PRICAI
2000
Springer

193views Artificial Intelligence» more PRICAI 2000»

Generating Hierarchical Structure in Reinforcement Learning from State Variables

15 years 10 months ago

Download www.csee.umbc.edu

This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...

Bernhard Hengst

claim paper

Read More »

195

click to vote

AAAI
2006

190views Intelligent Agents» more AAAI 2006»

Action Selection in Bayesian Reinforcement Learning

15 years 8 months ago

Download www.aaai.org

My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...

Tao Wang

claim paper

Read More »

187

click to vote

SIAMCO
2002

121views more SIAMCO 2002»

Consistent Approximations and Approximate Functions and Gradients in Optimal Control

15 years 6 months ago

Download www.ann.jussieu.fr

As shown in [7], optimal control problems with either ODE or PDE dynamics can be solved efficiently using a setting of consistent approximations obtained by numerical discretizati...

Olivier Pironneau, Elijah Polak

claim paper

Read More »

236

click to vote

ECML
2006
Springer

146views Machine Learning» more ECML 2006»

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

15 years 10 months ago

Download www.montefiore.ulg.ac.be

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

« Prev « First page 23 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers