Sciweavers

4544 search results - page 262 / 909
» Reinforcement Learning with Time
Sort
View
ICANN
2010
Springer
13 years 11 months ago
Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients
Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...
Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...
PKDD
2010
Springer
122views Data Mining» more  PKDD 2010»
13 years 9 months ago
Exploration in Relational Worlds
Abstract. One of the key problems in model-based reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large relational domains, in wh...
Tobias Lang, Marc Toussaint, Kristian Kersting
RAS
2010
164views more  RAS 2010»
13 years 9 months ago
Bridging the gap between feature- and grid-based SLAM
One important design decision for the development of autonomously navigating mobile robots is the choice of the representation of the environment. This includes the question which...
Kai M. Wurm, Cyrill Stachniss, Giorgio Grisetti
IAT
2010
IEEE
13 years 8 months ago
Multiagent Meta-level Control for a Network of Weather Radars
It is crucial for embedded systems to adapt to the dynamics of open environments. This adaptation process becomes especially challenging in the context of multiagent systems. In t...
Shanjun Cheng, Anita Raja, Victor R. Lesser
COLT
2001
Springer
14 years 3 months ago
Learning Rates for Q-Learning
In this paper we derive convergence rates for Q-learning. We show an interesting relationship between the convergence rate and the learning rate used in Q-learning. For a polynomi...
Eyal Even-Dar, Yishay Mansour