Search Sciweavers | Sciweavers

1262 search results - page 186 / 253

» Reinforcement Learning: An Introduction

180

Voted

SGAI
2010
Springer

226views Artificial Intelligence» more SGAI 2010»

Hierarchical Traces for Reduced NSM Memory Requirements

15 years 1 months ago

Download staff.newport.ac.uk

This paper presents work on using hierarchical long term memory to reduce the memory requirements of nearest sequence memory (NSM) learning, a previously published, instance-based ...

Torbjørn S. Dahl

claim paper

Read More »

144

Voted

INTERSPEECH
2010

175views Signal Processing» more INTERSPEECH 2010»

Still talking to machines (cognitively speaking)

14 years 10 months ago

Download mi.eng.cam.ac.uk

This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...

Steve Young

claim paper

Read More »

163

Voted

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 10 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

125

Voted

ICML
1998
IEEE

165views Machine Learning» more ICML 1998»

Intra-Option Learning about Temporally Abstract Actions

16 years 4 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

122

click to vote

GECCO
2005
Springer

139views Optimization» more GECCO 2005»

Event-driven learning classifier systems for online soccer games

15 years 9 months ago

Download www.genetic-programming.org

This paper reports on the application of classifier systems to the acquisition of decision-making algorithms for agents in online soccer games. The objective of this research is t...

Yuji Sato, Ryutaro Kanno

claim paper

Read More »

« Prev « First page 186 / 253 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers