Search Sciweavers | Sciweavers

458 search results - page 66 / 92

» Q-Decomposition for Reinforcement Learning Agents

173

Voted

ACL
1998

129views Computational Linguistics» more ACL 1998»

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email

15 years 7 months ago

Download acl.eldoc.ub.rug.nl

This paper describes a novel method by which a dialogue agent can learn to choose an optimal dialogue strategy. While it is widely agreed that dialogue strategies should be formul...

Marilyn A. Walker, Jeanne Frommer, Shrikanth Naray...

claim paper

Read More »

157

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

15 years 7 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

178

Voted

AGI
2011

222views Artificial Intelligence» more AGI 2011»

Measuring Agent Intelligence via Hierarchies of Environments

14 years 9 months ago

Download www.ssec.wisc.edu

Under Legg’s and Hutter’s formal measure [1], performance in easy environments counts more toward an agent’s intelligence than does performance in difficult environments. An ...

Bill Hibbard

claim paper

Read More »

198

click to vote

AGI
2008

142views Artificial Intelligence» more AGI 2008»

Transfer Learning and Intelligence: an Argument and Approach

15 years 7 months ago

Download www.cs.utexas.edu

In order to claim fully general intelligence in an autonomous agent, the ability to learn is one of the most central capabilities. Classical machine learning techniques have had ma...

Matthew E. Taylor, Gregory Kuhlmann, Peter Stone

claim paper

Read More »

164

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

16 years 12 days ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

« Prev « First page 66 / 92 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers