Sciweavers

166 search results - page 12 / 34
» Safe exploration for reinforcement learning
Sort
View
ATAL
2006
Springer
14 years 9 days ago
Probabilistic policy reuse in a reinforcement learning agent
We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...
Fernando Fernández, Manuela M. Veloso
SBIA
2004
Springer
14 years 1 months ago
Heuristically Accelerated Q-Learning: A New Approach to Speed Up Reinforcement Learning
This work presents a new algorithm, called Heuristically Accelerated Q–Learning (HAQL), that allows the use of heuristics to speed up the well-known Reinforcement Learning algori...
Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...
ICML
1999
IEEE
14 years 9 months ago
Using Reinforcement Learning to Spider the Web Efficiently
Consider the task of exploring the Web in order to find pages of a particular kind or on a particular topic. This task arises in the construction of search engines and Web knowled...
Jason Rennie, Andrew McCallum
CVPR
2012
IEEE
11 years 11 months ago
RALF: A reinforced active learning formulation for object class recognition
Active learning aims to reduce the amount of labels required for classification. The main difficulty is to find a good trade-off between exploration and exploitation of the lab...
Sandra Ebert, Mario Fritz, Bernt Schiele
IJCAI
2003
13 years 10 months ago
A Bayesian Approach to Imitation in Reinforcement Learning
In multiagent environments, forms of social learning such as teaching and imitation have been shown to aid the transfer of knowledge from experts to learners in reinforcement lear...
Bob Price, Craig Boutilier