Sciweavers

2108 search results - page 112 / 422
» Tracking in Reinforcement Learning
Sort
View
EUSFLAT
2001
144views Fuzzy Logic» more  EUSFLAT 2001»
13 years 11 months ago
Adaptive torque control using a connectionist reinforcement learning agent
The correction of angular misalignment between mating components is a fundamental requirement for their successful assembly. In this paper we present how a learning agent based on...
Lorenzo Brignone, Martin Howarth, S. Sivayoganatha...
ICML
2010
IEEE
13 years 11 months ago
Efficient Reinforcement Learning with Multiple Reward Functions for Randomized Controlled Trial Analysis
We introduce new, efficient algorithms for value iteration with multiple reward functions and continuous state. We also give an algorithm for finding the set of all nondominated a...
Daniel J. Lizotte, Michael H. Bowling, Susan A. Mu...
FLAIRS
2004
13 years 11 months ago
State Space Reduction For Hierarchical Reinforcement Learning
er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...
Mehran Asadi, Manfred Huber
ICML
2006
IEEE
14 years 10 months ago
Qualitative reinforcement learning
When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...
Arkady Epshteyn, Gerald DeJong
ICML
2000
IEEE
14 years 10 months ago
Reinforcement Learning in POMDP's via Direct Gradient Ascent
This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...
Jonathan Baxter, Peter L. Bartlett