Search Sciweavers | Sciweavers

154 search results - page 14 / 31

» Sample-Efficient Evolutionary Function Approximation for Rei...

193

click to vote

COR
2008

142views more COR 2008»

Application of reinforcement learning to the game of Othello

15 years 6 months ago

Download www.cs.uu.nl

Operations research and management science are often confronted with sequential decision making problems with large state spaces. Standard methods that are used for solving such c...

Nees Jan van Eck, Michiel C. van Wezel

claim paper

Read More »

195

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 8 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

195

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

190

click to vote

CORR
2002
Springer

100views Education» more CORR 2002»

A neural model for multi-expert architectures

15 years 6 months ago

Download user.cs.tu-berlin.de

We present a generalization of conventional artificial neural networks that allows for a functional equivalence to multi-expert systems. The new model provides an architectural fr...

Marc Toussaint

claim paper

Read More »

156

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 8 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

« Prev « First page 14 / 31 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers