Search Sciweavers | Sciweavers

128 search results - page 6 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

205

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

16 years 7 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

194

click to vote

KI
2007
Springer

124views Artificial Intelligence» more KI 2007»

Making a Robot Learn to Play Soccer Using Reward and Punishment

16 years 1 months ago

Download www.ni.uos.de

In this paper, we show how reinforcement learning can be applied to real robots to achieve optimal robot behavior. As example, we enable an autonomous soccer robot to learn interce...

Heiko Müller, Martin Lauer, Roland Hafner, Sa...

claim paper

Read More »

210

click to vote

ACL
2008

127views Computational Linguistics» more ACL 2008»

Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz Data: Bootstrapping and Evaluation

15 years 8 months ago

Download www.aclweb.org

We address two problems in the field of automatic optimization of dialogue strategies: learning effective dialogue strategies when no initial data or system exists, and evaluating...

Verena Rieser, Oliver Lemon

claim paper

Read More »

152

click to vote

ICML
2008
IEEE

105views Machine Learning» more ICML 2008»

Learning all optimal policies with multiple criteria

16 years 7 months ago

Download leon.barrettnexus.com

We describe an algorithm for learning in the presence of multiple criteria. Our technique generalizes previous approaches in that it can learn optimal policies for all linear pref...

Leon Barrett, Srini Narayanan

claim paper

Read More »

184

click to vote

WSC
2008

154views Modeling And Simulation» more WSC 2008»

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

15 years 9 months ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...

Abhijit Gosavi

claim paper

Read More »

« Prev « First page 6 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers