reinforcement | Sciweavers

221

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 9 months ago

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

200

click to vote

AGI
2011

231views Artificial Intelligence» more AGI 2011»

Reinforcement Learning and the Bayesian Control Rule

14 years 10 months ago

Download metatip.com

We present an actor-critic scheme for reinforcement learning in complex domains. The main contribution is to show that planning and I/O dynamics can be separated such that an intra...

Pedro Alejandro Ortega, Daniel Alexander Braun, Si...

claim paper

Read More »

185

click to vote

CORR
2011
Springer

136views Education» more CORR 2011»

Reinforcement Learning for Agents with Many Sensors and Actuators Acting in Categorizable Environments

14 years 10 months ago

Download www.aaai.org

In this paper, we confront the problem of applying reinforcement learning to agents that perceive the environment through many sensors and that can perform parallel actions using ...

Enric Celaya, Josep M. Porta

claim paper

Read More »

163

click to vote

NECO
2010

103views more NECO 2010»

Posterior Weighted Reinforcement Learning with State Uncertainty

15 years 5 months ago

Download www.maths.bris.ac.uk

Reinforcement learning models generally assume that a stimulus is presented that allows a learner to unambiguously identify the state of nature, and the reward received is drawn f...

Tobias Larsen, David S. Leslie, Edmund J. Collins,...

claim paper

Read More »

178

click to vote

JIRS
2010

120views more JIRS 2010»

Designing Decentralized Controllers for Distributed-Air-Jet MEMS-Based Micromanipulators by Reinforcement Learning

15 years 5 months ago

Download www.smartsurface.cnrs.fr

Distributed-air-jet MEMS-based systems have been proposed to manipulate small parts with high velocities and without any friction problems. The control of such distributed systems ...

Laëtitia Matignon, Guillaume J. Laurent, Nadi...

claim paper

Read More »

186

click to vote

ICRA
2010
IEEE

137views Robotics» more ICRA 2010»

Robot reinforcement learning using EEG-based reward signals

15 years 5 months ago

Download webdiis.unizar.es

Abstract— Reinforcement learning algorithms have been successfully applied in robotics to learn how to solve tasks based on reward signals obtained during task execution. These r...

Iñaki Iturrate, Luis Montesano, Javier Ming...

claim paper

Read More »

216

click to vote

NN
2007
Springer

105views Neural Networks» more NN 2007»

Guiding exploration by pre-existing knowledge without modifying reward

15 years 6 months ago

Download www.cs.hut.fi

Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...

Kary Främling

claim paper

Read More »

165

click to vote

CORR
1998
Springer

164views Education» more CORR 1998»

Training Reinforcement Neurocontrollers Using the Polytope Algorithm

15 years 6 months ago

Download zeus.cs.uoi.gr

A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...

Aristidis Likas, Isaac E. Lagaris

claim paper

Read More »

188

click to vote

AAMAS
2002
Springer

130views Intelligent Agents» more AAMAS 2002»

Relational Reinforcement Learning for Agents in Worlds with Objects

15 years 6 months ago

Download www-ai.ijs.si

In reinforcement learning, an agent tries to learn a policy, i.e., how to select an action in a given state of the environment, so that it maximizes the total amount of reward it ...

Saso Dzeroski

claim paper

Read More »

224

click to vote

AIHC
2007
Springer

324views Applied Computing» more AIHC 2007»

Emotion and Reinforcement: Affective Facial Expressions Facilitate Robot Learning

16 years 22 days ago

Download mmi.tudelft.nl

Computer models can be used to investigate the role of emotion in learning. Here we present EARL, our framework for the systematic study of the relation between emotion, adaptation...

Joost Broekens

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers