Search Sciweavers | Sciweavers

463 search results - page 19 / 93

» Localizing Search in Reinforcement Learning

199

click to vote

AAAI
2006

116views Intelligent Agents» more AAAI 2006»

Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping

15 years 8 months ago

Download www.cs.utexas.edu

Transfer learning concerns applying knowledge learned in one task (the source) to improve learning another related task (the target). In this paper, we use structure mapping, a ps...

Yaxin Liu, Peter Stone

claim paper

Read More »

183

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 7 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

192

click to vote

HIS
2004

195views Information Technology» more HIS 2004»

Stigmergy in Multi Agent Reinforcement Learning

15 years 8 months ago

Download hal.inria.fr

In this paper, we describe how certain aspects of the biological phenomena of stigmergy can be imported into multiagent reinforcement learning (MARL), with the purpose of better e...

Raghav Aras, Alain Dutech, François Charpil...

claim paper

Read More »

174

click to vote

GECCO
2005
Springer

155views Optimization» more GECCO 2005»

Co-evolving recurrent neurons learn deep memory POMDPs

16 years 17 days ago

Download www.idsia.ch

Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...

Faustino J. Gomez, Jürgen Schmidhuber

claim paper

Read More »

201

click to vote

GECCO
2004
Springer

155views Optimization» more GECCO 2004»

Genetic Network Programming with Reinforcement Learning and Its Performance Evaluation

16 years 13 days ago

Download www.cs.york.ac.uk

A new graph-based evolutionary algorithm named “Genetic Network Programming, GNP” has been proposed. GNP represents its solutions as directed graph structures, which can improv...

Shingo Mabu, Kotaro Hirasawa, Jinglu Hu

claim paper

Read More »

« Prev « First page 19 / 93 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers