Search Sciweavers | Sciweavers

1236 search results - page 157 / 248

» Opposition-Based Reinforcement Learning

191

click to vote

AAAI
2007

104views Intelligent Agents» more AAAI 2007»

Active Imitation Learning

15 years 9 months ago

Download www.cs.washington.edu

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...

Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

195

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

174

click to vote

ATAL
2004
Springer

105views Intelligent Agents» more ATAL 2004»

Best-Response Multiagent Learning in Non-Stationary Environments

16 years 12 days ago

Download www.odu.edu

This paper investigates a relatively new direction in Multiagent Reinforcement Learning. Most multiagent learning techniques focus on Nash equilibria as elements of both the learn...

Michael Weinberg, Jeffrey S. Rosenschein

claim paper

Read More »

197

click to vote

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

15 years 10 months ago

Download www.eecs.umich.edu

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

156

click to vote

ACL
2010

142views Computational Linguistics» more ACL 2010»

Optimising Information Presentation for Spoken Dialogue Systems

15 years 5 months ago

Download aclweb.org

We present a novel approach to Information Presentation (IP) in Spoken Dialogue Systems (SDS) using a data-driven statistical optimisation framework for content planning and attri...

Verena Rieser, Oliver Lemon, Xingkun Liu

claim paper

Read More »

« Prev « First page 157 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers