Sciweavers

127 search results - page 13 / 26
» Online Methods for Multi-Domain Learning and Adaptation
Sort
View
JMLR
2010
189views more  JMLR 2010»
13 years 2 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
ICPR
2008
IEEE
14 years 8 months ago
Human tracking based on Soft Decision Feature and online real boosting
Online Boosting is an effective incremental learning method which can update weak classifiers efficiently according to the object being trackedt. It is a promising technique for o...
Hironobu Fujiyoshi, Masato Kawade, Shihong Lao, Ta...
ICRA
2003
IEEE
108views Robotics» more  ICRA 2003»
14 years 27 days ago
On-line safe path planning in unknown environments
s - For the on-line safe path planning of a mobile robot in unknown environments, the paper proposes a simple Hopfield Neural Network ( HNN ) planner. Without learning process, the...
Weidong Chen, Changhong Fan, Yugeng Xi
KES
2004
Springer
14 years 29 days ago
Coordination in Multiagent Reinforcement Learning Systems
This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action ...
M. A. S. Kamal, Junichi Murata
CVPR
2003
IEEE
14 years 27 days ago
Adaptive Pattern Discovery for Interactive Multimedia Retrieval
Relevance feedback has been an indispensable component for multimedia retrieval systems. In this paper, we present an adaptive pattern discovery method, which addresses relevance ...
Yimin Wu, Aidong Zhang