Search Sciweavers | Sciweavers

81 search results - page 8 / 17

» The Optimal Reward Baseline for Gradient-Based Reinforcement...

159

click to vote

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

16 years 15 days ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

181

click to vote

JAIR
2008

119views more JAIR 2008»

A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics

15 years 5 months ago

Download www.ece.utk.edu

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Due to the complexity of the problem, the majority of the previo...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

135

click to vote

ACL
2010

142views Computational Linguistics» more ACL 2010»

Optimising Information Presentation for Spoken Dialogue Systems

15 years 3 months ago

Download aclweb.org

We present a novel approach to Information Presentation (IP) in Spoken Dialogue Systems (SDS) using a data-driven statistical optimisation framework for content planning and attri...

Verena Rieser, Oliver Lemon, Xingkun Liu

claim paper

Read More »

148

click to vote

ICML
2005
IEEE

119views Machine Learning» more ICML 2005»

Dynamic preferences in multi-criteria reinforcement learning

16 years 6 months ago

Download www.machinelearning.org

The current framework of reinforcement learning is based on maximizing the expected returns based on scalar rewards. But in many real world situations, tradeoffs must be made amon...

Sriraam Natarajan, Prasad Tadepalli

claim paper

Read More »

181

click to vote

ECAL
2001
Springer

110views Artificial Intelligence» more ECAL 2001»

Evolution of Reinforcement Learning in Uncertain Environments: Emergence of Risk-Aversion and Matching

15 years 10 months ago

Download gandalf.psych.umn.edu

Reinforcement learning (RL) is a fundamental process by which organisms learn to achieve a goal from interactions with the environment. Using Artiﬁcial Life techniques we derive ...

Yael Niv, Daphna Joel, Isaac Meilijson, Eytan Rupp...

claim paper

Read More »

« Prev « First page 8 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers