Search Sciweavers | Sciweavers

779 search results - page 39 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

197

Voted

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

215

click to vote

AINA
2006
IEEE

179views Computer Networks» more AINA 2006»

Constrained Flooding: A Robust and Efficient Routing Framework for Wireless Sensor Networks

15 years 11 months ago

Download www.parc.com

Flooding protocols for wireless networks in general have been shown to be very inefficient and therefore are mainly used in network initialization or route discovery and maintenan...

Ying Zhang, Markus P. J. Fromherz

claim paper

Read More »

226

click to vote

LAMAS
2005
Springer

168views Intelligent Agents» more LAMAS 2005»

Multi-agent Relational Reinforcement Learning

16 years 27 days ago

Download dtai.cs.kuleuven.be

In this paper we report on using a relational state space in multi-agent reinforcement learning. There is growing evidence in the Reinforcement Learning research community that a r...

Tom Croonenborghs, Karl Tuyls, Jan Ramon, Maurice ...

claim paper

Read More »

188

Voted

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

209

click to vote

AIIA
2007
Springer

147views Artificial Intelligence» more AIIA 2007»

Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions

16 years 1 months ago

Download sequel.futurs.inria.fr

The application of Reinforcement Learning (RL) algorithms to learn tasks for robots is often limited by the large dimension of the state space, which may make prohibitive its appli...

Andrea Bonarini, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

« Prev « First page 39 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers