Sciweavers

2108 search results - page 170 / 422
» Tracking in Reinforcement Learning
Sort
View
130
Voted
ROBOCUP
2000
Springer
104views Robotics» more  ROBOCUP 2000»
15 years 6 months ago
Essex Wizards 2000 Team Description
: This article gives an overview of the Essex Wizards 2000 team participated in the RoboCup 2000 simulator league. A brief description of the agent architecture for the team is int...
Huosheng Hu, Kostas Kostiadis, Matthew Hunter, Kos...
114
Voted
ESANN
2008
15 years 4 months ago
Similarities and differences between policy gradient methods and evolution strategies
Natural policy gradient methods and the covariance matrix adaptation evolution strategy, two variable metric methods proposed for solving reinforcement learning tasks, are contrast...
Verena Heidrich-Meisner, Christian Igel
124
Voted
NIPS
2007
15 years 4 months ago
Stable Dual Dynamic Programming
Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...
Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...
144
Voted
AGI
2008
15 years 4 months ago
Transfer Learning and Intelligence: an Argument and Approach
In order to claim fully general intelligence in an autonomous agent, the ability to learn is one of the most central capabilities. Classical machine learning techniques have had ma...
Matthew E. Taylor, Gregory Kuhlmann, Peter Stone
146
Voted
MIR
2005
ACM
129views Multimedia» more  MIR 2005»
15 years 8 months ago
Tracking concept drifting with an online-optimized incremental learning framework
Concept drifting is an important and challenging research issue in the field of machine learning. This paper mainly addresses the issue of semantic concept drifting in time series...
Jun Wu, Dayong Ding, Xian-Sheng Hua, Bo Zhang