Search Sciweavers | Sciweavers

166 search results - page 10 / 34

» Safe exploration for reinforcement learning

click to vote

GECCO
2005
Springer

98views Optimization» more GECCO 2005»

Intelligent exploration method for XCS

14 years 2 months ago

Download www.cs.bham.ac.uk

Exploration/Exploitation equilibrium is one of the most challenging issues in reinforcement learning area as well as learning classifier systems such as XCS. In this paper1 , an i...

Ali Hamzeh, Adel Rahmani

claim paper

Read More »

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

14 years 9 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

13 years 10 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

NN
2002
Springer

113views Neural Networks» more NN 2002»

Control of exploitation-exploration meta-parameter in reinforcement learning

13 years 8 months ago

Download www.fil.ion.ucl.ac.uk

In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...

Shin Ishii, Wako Yoshida, Junichiro Yoshimoto

claim paper

Read More »

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

13 years 10 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

« Prev « First page 10 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers