Search Sciweavers | Sciweavers

24 search results - page 4 / 5

» Technical Update: Least-Squares Temporal Difference Learning

162

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 7 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

170

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Bayesian actor-critic algorithms

16 years 6 months ago

Download www.machinelearning.org

We1 present a new actor-critic learning model in which a Bayesian class of non-parametric critics, using Gaussian process temporal difference learning is used. Such critics model ...

Mohammad Ghavamzadeh, Yaakov Engel

claim paper

Read More »

147

click to vote

ECTEL
2007
Springer

121views Machine Learning» more ECTEL 2007»

Remote Cooperation on Project-centred Learning: a Working Implemented Solution in Academia

16 years 6 days ago

Download ftp.informatik.rwth-aachen.de

The paper aims at illustrating the original technical solution provided within an academic institute in order to manage teaching activities, encompassing the coordination of projec...

Carola Salvioni, Antonio Vincenzo Taddeo

claim paper

Read More »

145

click to vote

ICALT
2003
IEEE

104views Machine Learning» more ICALT 2003»

Gaining Computational Literacy by Creating Hybrid Aesthetic Learning Spaces

15 years 11 months ago

Download www.kimm.uni-luebeck.de

Although the technical skills of pupils are quite high, the current approach to gain media literacy still focusses on updating software applying skills, rather than exploring the ...

Daniela Reimann, Michael Herczeg, Thomas Winkler, ...

claim paper

Read More »

161

click to vote

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

15 years 6 months ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

« Prev « First page 4 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers