Search Sciweavers | Sciweavers

80 search results - page 10 / 16

» Efficient Reinforcement Learning Using Recursive Least-Squar...

117

click to vote

CONSTRAINTS
2008

89views more CONSTRAINTS 2008»

A Reinforcement Learning Approach to Interval Constraint Propagation

15 years 2 months ago

Download www.crt.umontreal.ca

When solving systems of nonlinear equations with interval constraint methods, it has often been observed that many calls to contracting operators do not participate actively to th...

Frédéric Goualard, Christophe Jerman...

claim paper

Read More »

116

click to vote

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

15 years 2 months ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

135

Voted

GECCO
2007
Springer

181views Optimization» more GECCO 2007»

Learning recursive programs with cooperative coevolution of genetic code mapping and genotype

15 years 6 months ago

Download www.cs.bham.ac.uk

The Probabilistic Adaptive Mapping Developmental Genetic Programming (PAM DGP) algorithm that cooperatively coevolves a population of adaptive mappings and associated genotypes is...

Garnett Carl Wilson, Malcolm I. Heywood

claim paper

Read More »

128

Voted

RAS
2000

161views more RAS 2000»

Active object recognition by view integration and reinforcement learning

15 years 2 months ago

Download www.emt.tu-graz.ac.at

A mobile agent with the task to classify its sensor pattern has to cope with ambiguous information. Active recognition of three-dimensional objects involves the observer in a sear...

Lucas Paletta, Axel Pinz

claim paper

Read More »

130

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 14 days ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

« Prev « First page 10 / 16 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers