Search Sciweavers | Sciweavers

495 search results - page 16 / 99

» Constructing States for Reinforcement Learning

click to vote

ICML
2008
IEEE

162views Machine Learning» more ICML 2008»

Automatic discovery and transfer of MAXQ hierarchies

14 years 8 months ago

Download pages.cs.wisc.edu

We present an algorithm, HI-MAT (Hierarchy Induction via Models And Trajectories), that discovers MAXQ task hierarchies by applying dynamic Bayesian network models to a successful...

Neville Mehta, Soumya Ray, Prasad Tadepalli, Thoma...

claim paper

Read More »

click to vote

ROBOCUP
2007
Springer

167views Robotics» more ROBOCUP 2007»

Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others

14 years 1 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...

Kentarou Noma, Yasutake Takahashi, Minoru Asada

claim paper

Read More »

click to vote

ATAL
2004
Springer

101views Intelligent Agents» more ATAL 2004»

From Global Selective Perception to Local Selective Perception

14 years 1 months ago

Download www.damas.ift.ulaval.ca

This paper presents a reinforcement learning algorithm used to allocate tasks to agents in an uncertain real-time environment. In such environment, tasks have to be analyzed and a...

Sébastien Paquet, Nicolas Bernier, Brahim C...

claim paper

Read More »

click to vote

ECML
2003
Springer

149views Machine Learning» more ECML 2003»

Could Active Perception Aid Navigation of Partially Observable Grid Worlds?

14 years 29 days ago

Download homepages.inf.ed.ac.uk

Due to the unavoidable fact that a robot’s sensors will be limited in some manner, it is entirely possible that it can ﬁnd itself unable to distinguish between diﬀering state...

Paul A. Crook, Gillian Hayes

claim paper

Read More »

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

13 years 12 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

« Prev « First page 16 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers