Search Sciweavers | Sciweavers

127 search results - page 13 / 26

» Online Methods for Multi-Domain Learning and Adaptation

202

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 24 days ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

152

click to vote

ICPR
2008
IEEE

217views Computer Vision» more ICPR 2008»

Human tracking based on Soft Decision Feature and online real boosting

16 years 7 months ago

Download www.vision.cs.chubu.ac.jp

Online Boosting is an effective incremental learning method which can update weak classifiers efficiently according to the object being trackedt. It is a promising technique for o...

Hironobu Fujiyoshi, Masato Kawade, Shihong Lao, Ta...

claim paper

Read More »

166

click to vote

ICRA
2003
IEEE

108views Robotics» more ICRA 2003»

On-line safe path planning in unknown environments

15 years 11 months ago

Download www.techfak.uni-bielefeld.de

s - For the on-line safe path planning of a mobile robot in unknown environments, the paper proposes a simple Hopfield Neural Network ( HNN ) planner. Without learning process, the...

Weidong Chen, Changhong Fan, Yugeng Xi

claim paper

Read More »

159

click to vote

KES
2004
Springer

165views Information Technology» more KES 2004»

Coordination in Multiagent Reinforcement Learning Systems

15 years 11 months ago

Download cig.ees.kyushu-u.ac.jp

This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action ...

M. A. S. Kamal, Junichi Murata

claim paper

Read More »

162

click to vote

CVPR
2003
IEEE

179views Computer Vision» more CVPR 2003»

Adaptive Pattern Discovery for Interactive Multimedia Retrieval

15 years 11 months ago

Download www.cse.buffalo.edu

Relevance feedback has been an indispensable component for multimedia retrieval systems. In this paper, we present an adaptive pattern discovery method, which addresses relevance ...

Yimin Wu, Aidong Zhang

claim paper

Read More »

« Prev « First page 13 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers