Search Sciweavers | Sciweavers

4544 search results - page 262 / 909

» Reinforcement Learning with Time

119

Voted

ICANN
2010
Springer

164views Neural Networks» more ICANN 2010»

Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients

15 years 2 months ago

Download www.idsia.ch

Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...

Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...

claim paper

Read More »

122

Voted

PKDD
2010
Springer

122views Data Mining» more PKDD 2010»

Exploration in Relational Worlds

15 years 1 months ago

Download user.cs.tu-berlin.de

Abstract. One of the key problems in model-based reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large relational domains, in wh...

Tobias Lang, Marc Toussaint, Kristian Kersting

claim paper

Read More »

120

Voted

RAS
2010

164views more RAS 2010»

Bridging the gap between feature- and grid-based SLAM

15 years 1 months ago

Download www.informatik.uni-freiburg.de

One important design decision for the development of autonomously navigating mobile robots is the choice of the representation of the environment. This includes the question which...

Kai M. Wurm, Cyrill Stachniss, Giorgio Grisetti

claim paper

Read More »

147

Voted

IAT
2010
IEEE

133views Intelligent Agents» more IAT 2010»

Multiagent Meta-level Control for a Network of Weather Radars

15 years 22 days ago

Download coitweb.uncc.edu

It is crucial for embedded systems to adapt to the dynamics of open environments. This adaptation process becomes especially challenging in the context of multiagent systems. In t...

Shanjun Cheng, Anita Raja, Victor R. Lesser

claim paper

Read More »

105

click to vote

COLT
2001
Springer

84views Machine Learning» more COLT 2001»

Learning Rates for Q-Learning

15 years 7 months ago

Download www.ai.mit.edu

In this paper we derive convergence rates for Q-learning. We show an interesting relationship between the convergence rate and the learning rate used in Q-learning. For a polynomi...

Eyal Even-Dar, Yishay Mansour

claim paper

Read More »

« Prev « First page 262 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers