Search Sciweavers | Sciweavers

160 search results - page 22 / 32

» Optimization on a Budget: A Reinforcement Learning Approach

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

13 years 7 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

click to vote

NIPS
1993

134views Information Technology» more NIPS 1993»

Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming

13 years 9 months ago

Download www.cs.cmu.edu

Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...

Christopher G. Atkeson

claim paper

Read More »

click to vote

PRL
2006

148views more PRL 2006»

An agent based evolutionary approach to path detection for off-road vehicle guidance

13 years 7 months ago

Download www.ce.unipr.it

This paper describes an ant colony optimization approach adopted to decide on road-borders to automatically guide a vehicle developed for the DARPA Grand Challenge 2004, available...

Alberto Broggi, Stefano Cattani

claim paper

Read More »

click to vote

ESANN
2007

122views Neural Networks» more ESANN 2007»

The Recurrent Control Neural Network

13 years 9 months ago

Download www.dice.ucl.ac.be

This paper presents our Recurrent Control Neural Network (RCNN), which is a model-based approach for a data-eﬃcient modelling and control of reinforcement learning problems in di...

Anton Maximilian Schäfer, Steffen Udluft, Han...

claim paper

Read More »

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

13 years 6 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

« Prev « First page 22 / 32 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers