Search Sciweavers | Sciweavers

50 search results - page 9 / 10

» Reinforcement Learning for Penalty Avoidance in Continuous S...

click to vote

IJCAI
2003

118views Artificial Intelligence» more IJCAI 2003»

Simultaneous Adversarial Multi-Robot Learning

13 years 8 months ago

Download www.cs.cmu.edu

Multi-robot learning faces all of the challenges of robot learning with all of the challenges of multiagent learning. There has been a great deal of recent research on multiagent ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

13 years 7 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

14 years 4 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

click to vote

ESANN
2007

122views Neural Networks» more ESANN 2007»

The Recurrent Control Neural Network

13 years 8 months ago

Download www.dice.ucl.ac.be

This paper presents our Recurrent Control Neural Network (RCNN), which is a model-based approach for a data-eﬃcient modelling and control of reinforcement learning problems in di...

Anton Maximilian Schäfer, Steffen Udluft, Han...

claim paper

Read More »

click to vote

IJRR
2011

159views more IJRR 2011»

Learning visual representations for perception-action systems

13 years 2 months ago

Download robot-learning.de

We discuss vision as a sensory modality for systems that eﬀect actions in response to perceptions. While the internal representations informed by vision may be arbitrarily compl...

Justus H. Piater, Sébastien Jodogne, Renaud...

claim paper

Read More »

« Prev « First page 9 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers