Search Sciweavers | Sciweavers

1630 search results - page 204 / 326

» Coordinated Reinforcement Learning

153

click to vote

INTERSPEECH
2010

175views Signal Processing» more INTERSPEECH 2010»

Still talking to machines (cognitively speaking)

14 years 11 months ago

Download mi.eng.cam.ac.uk

This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...

Steve Young

claim paper

Read More »

172

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 11 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

131

Voted

ICML
1998
IEEE

165views Machine Learning» more ICML 1998»

Intra-Option Learning about Temporally Abstract Actions

16 years 4 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

145

Voted

IJCAI
2001

141views Artificial Intelligence» more IJCAI 2001»

Robot Weightlifting By Direct Policy Search

15 years 5 months ago

Download reference.kfupm.edu.sa

This paper describes a method for structuring a robot motor learning task. By designing a suitably parameterized policy, we show that a simple search algorithm, along with biologi...

Michael T. Rosenstein, Andrew G. Barto

claim paper

Read More »

128

click to vote

GECCO
2005
Springer

139views Optimization» more GECCO 2005»

Event-driven learning classifier systems for online soccer games

15 years 9 months ago

Download www.genetic-programming.org

This paper reports on the application of classifier systems to the acquisition of decision-making algorithms for agents in online soccer games. The objective of this research is t...

Yuji Sato, Ryutaro Kanno

claim paper

Read More »

« Prev « First page 204 / 326 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers