Search Sciweavers | Sciweavers

1233 search results - page 228 / 247

» Feudal Reinforcement Learning

175

click to vote

AINA
2006
IEEE

179views Computer Networks» more AINA 2006»

Constrained Flooding: A Robust and Efficient Routing Framework for Wireless Sensor Networks

15 years 9 months ago

Download www.parc.com

Flooding protocols for wireless networks in general have been shown to be very inefficient and therefore are mainly used in network initialization or route discovery and maintenan...

Ying Zhang, Markus P. J. Fromherz

claim paper

Read More »

139

click to vote

NIPS
2007

146views Information Technology» more NIPS 2007»

Receding Horizon Differential Dynamic Programming

15 years 7 months ago

Download books.nips.cc

The control of high-dimensional, continuous, non-linear dynamical systems is a key problem in reinforcement learning and control. Local, trajectory-based methods, using techniques...

Yuval Tassa, Tom Erez, William D. Smart

claim paper

Read More »

168

click to vote

ICMLA
2004

114views Machine Learning» more ICMLA 2004»

Planning with predictive state representations

15 years 7 months ago

Download www.eecs.umich.edu

Predictive state representation (PSR) models for controlled dynamical systems have recently been proposed as an alternative to traditional models such as partially observable Mark...

Michael R. James, Satinder P. Singh, Michael L. Li...

claim paper

Read More »

150

click to vote

LPE
1997

75views Logical Reasoning» more LPE 1997»

Visualizing Solutions with Viewers

15 years 7 months ago

Download www.complang.tuwien.ac.at

Visualization can be a powerful aid for learning a programming language. It may be used to reinforce central language concepts. In the context of Prolog and CLP-languages, however...

Ulrich Neumerkel, Christoph Rettig, Christian Scha...

claim paper

Read More »

140

click to vote

NIPS
1997

121views Information Technology» more NIPS 1997»

Generalized Prioritized Sweeping

15 years 7 months ago

Download www.cs.huji.ac.il

Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent’s limited computational resources to achieve a good estimate of the value of ...

David Andre, Nir Friedman, Ronald Parr

claim paper

Read More »

« Prev « First page 228 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers