Search Sciweavers | Sciweavers

181 search results - page 16 / 37

» On Policy Learning in Restricted Policy Spaces

click to vote

RSS
2007

135views Robotics» more RSS 2007»

Learning omnidirectional path following using dimensionality reduction

13 years 9 months ago

Download www.roboticsproceedings.org

Abstract— We consider the task of omnidirectional path following for a quadruped robot: moving a four-legged robot along any arbitrary path while turning in any arbitrary manner....

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

click to vote

ATAL
2005
Springer

146views Intelligent Agents» more ATAL 2005»

Exploiting belief bounds: practical POMDPs for personal assistant agents

14 years 1 months ago

Download teamcore.usc.edu

Agents or agent teams deployed to assist humans often face the challenges of monitoring the state of key processes in their environment (including the state of their human users t...

Pradeep Varakantham, Rajiv T. Maheswaran, Milind T...

claim paper

Read More »

click to vote

ESANN
2007

122views Neural Networks» more ESANN 2007»

The Recurrent Control Neural Network

13 years 9 months ago

Download www.dice.ucl.ac.be

This paper presents our Recurrent Control Neural Network (RCNN), which is a model-based approach for a data-eﬃcient modelling and control of reinforcement learning problems in di...

Anton Maximilian Schäfer, Steffen Udluft, Han...

claim paper

Read More »

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

14 years 1 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

click to vote

ICANN
1997
Springer

87views Neural Networks» more ICANN 1997»

On Learning Soccer Strategies

13 years 11 months ago

Download igitur-archive.library.uu.nl

We use simulated soccer to study multiagent learning. Each team's players (agents) share action set and policy but may behave differently due to position-dependent inputs. All...

Rafal Salustowicz, Marco Wiering, Jürgen Schm...

claim paper

Read More »

« Prev « First page 16 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers