Search Sciweavers | Sciweavers

128 search results - page 13 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

226

click to vote

INLG
2010
Springer

134views Natural Language Processing» more INLG 2010»

Hierarchical Reinforcement Learning for Adaptive Text Generation

15 years 5 months ago

Download www.aclweb.org

We present a novel approach to natural language generation (NLG) that applies hierarchical reinforcement learning to text generation in the wayfinding domain. Our approach aims to...

Nina Dethlefs, Heriberto Cuayáhuitl

claim paper

Read More »

216

click to vote

JAIR
2008

119views more JAIR 2008»

A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics

15 years 7 months ago

Download www.ece.utk.edu

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Due to the complexity of the problem, the majority of the previo...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

210

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

15 years 5 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

189

click to vote

ECAL
2007
Springer

227views Artificial Intelligence» more ECAL 2007»

Guided Self-organisation for Autonomous Robot Development

16 years 1 months ago

Download robot.informatik.uni-leipzig.de

Abstract. The paper presents a method to guide the self-organised development of behaviours of autonomous robots. In earlier publications we demonstrated how to use the homeokinesi...

Georg Martius, J. Michael Herrmann, Ralf Der

claim paper

Read More »

187

click to vote

ICML
2005
IEEE

119views Machine Learning» more ICML 2005»

Dynamic preferences in multi-criteria reinforcement learning

16 years 7 months ago

Download www.machinelearning.org

The current framework of reinforcement learning is based on maximizing the expected returns based on scalar rewards. But in many real world situations, tradeoffs must be made amon...

Sriraam Natarajan, Prasad Tadepalli

claim paper

Read More »

« Prev « First page 13 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers