Sciweavers

92 search results - page 14 / 19
» A General Convergence Method for Reinforcement Learning in t...
Sort
View
ICML
2001
IEEE
14 years 7 months ago
Direct Policy Search using Paired Statistical Tests
Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...
Malcolm J. A. Strens, Andrew W. Moore
COLT
2005
Springer
14 years 5 days ago
Ranking and Scoring Using Empirical Risk Minimization
A general model is proposed for studying ranking problems. We investigate learning methods based on empirical minimization of the natural estimates of the ranking risk. The empiric...
Stéphan Clémençon, Gáb...
CEC
2005
IEEE
14 years 10 days ago
XCS with computed prediction for the learning of Boolean functions
Computed prediction represents a major shift in learning classifier system research. XCS with computed prediction, based on linear approximators, has been applied so far to functi...
Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...
ECML
2007
Springer
14 years 26 days ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
ISCC
2003
IEEE
110views Communications» more  ISCC 2003»
13 years 12 months ago
Intelligent Agents Serving Based On The Society Information
In this paper, we propose a serving system consisting intelligent agents processing society information in a multi-user domain. The agents use the similarity information on the us...
Sanem Sariel, B. Tevfik Akgün