Search Sciweavers | Sciweavers

4544 search results - page 159 / 909

» Reinforcement Learning with Time

166

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 8 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

141

click to vote

ICCBR
2005
Springer

91views Automated Reasoning» more ICCBR 2005»

Opportunities for CBR in Learning by Doing

16 years 18 days ago

Download gaia.fdi.ucm.es

In this paper we partially describe JV2 M, a metaphorical simulation of the Java Virtual Machine where students can learn Java language compilation and reinforce object-oriented pr...

Pedro Pablo Gómez-Martín, Marco Anto...

claim paper

Read More »

200

click to vote

DAGM
2007
Springer

148views Image Processing» more DAGM 2007»

Efficient Learning of Neural Networks with Evolutionary Algorithms

15 years 11 months ago

Download www.ks.informatik.uni-kiel.de

Abstract. In this article we present EANT2, a method that creates neural networks (NNs) by evolutionary reinforcement learning. The structure of NNs is developed using mutation ope...

Nils T. Siebel, Jochen Krause, Gerald Sommer

claim paper

Read More »

189

click to vote

ECAL
2007
Springer

227views Artificial Intelligence» more ECAL 2007»

Guided Self-organisation for Autonomous Robot Development

16 years 1 months ago

Download robot.informatik.uni-leipzig.de

Abstract. The paper presents a method to guide the self-organised development of behaviours of autonomous robots. In earlier publications we demonstrated how to use the homeokinesi...

Georg Martius, J. Michael Herrmann, Ralf Der

claim paper

Read More »

216

click to vote

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

16 years 8 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

« Prev « First page 159 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers