Search Sciweavers | Sciweavers

157

ECML
2003
Springer

118views Machine Learning» more ECML 2003»

A New Way to Introduce Knowledge into Reinforcement Learning

16 years 16 hour ago

We present in this paper a method to introduce a priori knowledge into reinforcement learning using temporally extended actions. The aim of our work is to reduce the learning time ...

Pascal Garcia

claim paper

Read More »

197

click to vote

CAEPIA
2011
Springer

188views Artificial Intelligence» more CAEPIA 2011»

Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test

14 years 6 months ago

Download users.dsic.upv.es

In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation of a popular reinforcement learning algorithm, Q-learning. We show that a general...

Javier Insa-Cabrera, David L. Dowe, José He...

claim paper

Read More »

246

click to vote

CVPR
2012
IEEE

218views Computer Vision» more CVPR 2012»

RALF: A reinforced active learning formulation for object class recognition

13 years 9 months ago

Download www.d2.mpi-inf.mpg.de

Active learning aims to reduce the amount of labels required for classiﬁcation. The main difﬁculty is to ﬁnd a good trade-off between exploration and exploitation of the lab...

Sandra Ebert, Mario Fritz, Bernt Schiele

claim paper

Read More »

217

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Kernel-Based Reinforcement Learning on Representative States

13 years 9 months ago

Download www.bkveton.com

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...

Branislav Kveton, Georgios Theocharous

claim paper

Read More »

211

click to vote

AAAI
2011

202views Intelligent Agents» more AAAI 2011»

Value Function Approximation in Reinforcement Learning Using the Fourier Basis

14 years 6 months ago

Download people.csail.mit.edu

We describe the Fourier Basis, a linear value function approximation scheme based on the Fourier Series. We empirically evaluate its properties, and demonstrate that it performs w...

George Konidaris, Sarah Osentoski, Philip Thomas

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers