Search Sciweavers | Sciweavers

1233 search results - page 122 / 247

» Feudal Reinforcement Learning

218

click to vote

SGAI
2010
Springer

226views Artificial Intelligence» more SGAI 2010»

Hierarchical Traces for Reduced NSM Memory Requirements

15 years 4 months ago

Download staff.newport.ac.uk

This paper presents work on using hierarchical long term memory to reduce the memory requirements of nearest sequence memory (NSM) learning, a previously published, instance-based ...

Torbjørn S. Dahl

claim paper

Read More »

180

click to vote

INTERSPEECH
2010

175views Signal Processing» more INTERSPEECH 2010»

Still talking to machines (cognitively speaking)

15 years 28 days ago

Download mi.eng.cam.ac.uk

This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...

Steve Young

claim paper

Read More »

204

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 27 days ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

169

click to vote

IJCAI
2007

173views Artificial Intelligence» more IJCAI 2007»

Reinforcement Learning of Local Shape in the Game of Go

15 years 7 months ago

Download webdocs.cs.ualberta.ca

We explore an application to the game of Go of a reinforcement learning approach based on a linear evaluation function and large numbers of binary features. This strategy has prov...

David Silver, Richard S. Sutton, Martin Mülle...

claim paper

Read More »

171

Voted

DAGM
2006
Springer

121views Image Processing» more DAGM 2006»

Handling Camera Movement Constraints in Reinforcement Learning Based Active Object Recognition

15 years 9 months ago

Download www5.informatik.uni-erlangen.de

In real world scenes, objects to be classified are usually not visible from every direction, since they are almost always positioned on some kind of opaque plane. When moving a cam...

Christian Derichs, Heinrich Niemann

claim paper

Read More »

« Prev « First page 122 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers