Sciweavers

827 search results - page 60 / 166
» Variational methods for Reinforcement Learning
Sort
View
ROBOCUP
2007
Springer
167views Robotics» more  ROBOCUP 2007»
14 years 1 months ago
Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others
The existing reinforcement learning approaches have been suffering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...
Kentarou Noma, Yasutake Takahashi, Minoru Asada
JMLR
2010
129views more  JMLR 2010»
13 years 2 months ago
Efficient Multioutput Gaussian Processes through Variational Inducing Kernels
Interest in multioutput kernel methods is increasing, whether under the guise of multitask learning, multisensor networks or structured output data. From the Gaussian process pers...
Mauricio Alvarez, David Luengo, Michalis Titsias, ...
ATAL
2008
Springer
13 years 9 months ago
Sigma point policy iteration
In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...
Michael H. Bowling, Alborz Geramifard, David Winga...
TMM
2010
199views Management» more  TMM 2010»
13 years 2 months ago
Video Annotation Through Search and Graph Reinforcement Mining
Abstract--Unlimited vocabulary annotation of multimedia documents remains elusive despite progress solving the problem in the case of a small, fixed lexicon. Taking advantage of th...
Emily Moxley, Tao Mei, Bangalore S. Manjunath
ICML
2009
IEEE
14 years 8 months ago
Discovering options from example trajectories
We present a novel technique for automated problem decomposition to address the problem of scalability in reinforcement learning. Our technique makes use of a set of near-optimal ...
Peng Zang, Peng Zhou, David Minnen, Charles Lee Is...