Sciweavers

1630 search results - page 204 / 326
» Coordinated Reinforcement Learning
Sort
View
INTERSPEECH
2010
14 years 11 months ago
Still talking to machines (cognitively speaking)
This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...
Steve Young
JMLR
2010
189views more  JMLR 2010»
14 years 11 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
131
Voted
ICML
1998
IEEE
16 years 4 months ago
Intra-Option Learning about Temporally Abstract Actions
tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...
Richard S. Sutton, Doina Precup, Satinder P. Singh
145
Voted
IJCAI
2001
15 years 5 months ago
Robot Weightlifting By Direct Policy Search
This paper describes a method for structuring a robot motor learning task. By designing a suitably parameterized policy, we show that a simple search algorithm, along with biologi...
Michael T. Rosenstein, Andrew G. Barto
GECCO
2005
Springer
139views Optimization» more  GECCO 2005»
15 years 9 months ago
Event-driven learning classifier systems for online soccer games
This paper reports on the application of classifier systems to the acquisition of decision-making algorithms for agents in online soccer games. The objective of this research is t...
Yuji Sato, Ryutaro Kanno