Search Sciweavers | Sciweavers

2108 search results - page 112 / 422

» Tracking in Reinforcement Learning

115

Voted

EUSFLAT
2001

144views Fuzzy Logic» more EUSFLAT 2001»

Adaptive torque control using a connectionist reinforcement learning agent

15 years 4 months ago

Download www.eusflat.org

The correction of angular misalignment between mating components is a fundamental requirement for their successful assembly. In this paper we present how a learning agent based on...

Lorenzo Brignone, Martin Howarth, S. Sivayoganatha...

claim paper

Read More »

108

Voted

ICML
2010
IEEE

171views Machine Learning» more ICML 2010»

Efficient Reinforcement Learning with Multiple Reward Functions for Randomized Controlled Trial Analysis

15 years 3 months ago

Download www.stat.lsa.umich.edu

We introduce new, efficient algorithms for value iteration with multiple reward functions and continuous state. We also give an algorithm for finding the set of all nondominated a...

Daniel J. Lizotte, Michael H. Bowling, Susan A. Mu...

claim paper

Read More »

127

Voted

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

15 years 4 months ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

102

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 3 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

113

Voted

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 3 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 112 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers