Sciweavers

827 search results - page 76 / 166
» Variational methods for Reinforcement Learning
Sort
View
JMLR
2010
189views more  JMLR 2010»
13 years 2 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
IROS
2008
IEEE
125views Robotics» more  IROS 2008»
14 years 2 months ago
Dynamic correlation matrix based multi-Q learning for a multi-robot system
—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...
Hongliang Guo, Yan Meng
IDEAL
2004
Springer
14 years 1 months ago
Learning Users' Interests in a Market-Based Recommender System
Recommender systems are widely used to cope with the problem of information overload and, consequently, many recommendation methods have been developed. However, no one technique i...
Yan Zheng Wei, Luc Moreau, Nicholas R. Jennings
AOIS
2004
13 years 9 months ago
Market-Based Recommender Systems: Learning Users' Interests by Quality Classification
Recommender systems are widely used to cope with the problem of information overload and, consequently, many recommendation methods have been developed. However, no one technique i...
Yan Zheng Wei, Luc Moreau, Nicholas R. Jennings
ICCV
2009
IEEE
15 years 24 days ago
Learning Pedestrian Dynamics from the Real World
In this paper we describe a method to learn parameters which govern pedestrian motion by observing video data. Our learning framework is based on variational mode learning and a...
Paul Scovanner, Marshall Tappen