Search Sciweavers | Sciweavers

343 search results - page 6 / 69

» Action discovery for reinforcement learning

click to vote

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

14 years 8 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

click to vote

Book

392views

Reinforcement Learning: An Introduction

15 years 5 months ago

Download www.cs.ualberta.ca

"Reinforcement learning is learning what to do how to map situations to actions so as to maximize a numerical reward signal. The learner is not told which actions to take, as ...

Richard S. Sutton, Andrew G. Barto

posted by scimaster

Read More »

click to vote

ATAL
2006
Springer

103views Intelligent Agents» more ATAL 2006»

Rule value reinforcement learning for cognitive agents

13 years 11 months ago

Download vega.soi.city.ac.uk

RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule...

Christopher Child, Kostas Stathis

claim paper

Read More »

click to vote

AI
2001
Springer

118views Artificial Intelligence» more AI 2001»

Imitation and Reinforcement Learning in Agents with Heterogeneous Actions

14 years 5 days ago

Download www.cs.toronto.edu

Reinforcement learning techniques are increasingly being used to solve di cult problems in control and combinatorial optimization with promising results. Implicit imitation can acc...

Bob Price, Craig Boutilier

claim paper

Read More »

click to vote

IEAAIE
2001
Springer

98views Artificial Intelligence» more IEAAIE 2001»

On the Relationship between Learning Capability and the Boltzmann-Formula

14 years 3 days ago

Download members.iif.hu

In this paper a combined use of reinforcement learning and simulated annealing is treated. Most of the simulated annealing methods suggest using heuristic temperature bounds as the...

Péter Stefán, Laszlo Monostori

claim paper

Read More »

« Prev « First page 6 / 69 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers