Search Sciweavers | Sciweavers

1799 search results - page 113 / 360

» Filtered Reinforcement Learning

123

click to vote

JSAC
2010

107views more JSAC 2010»

Online learning in autonomic multi-hop wireless networks for transmitting mission-critical applications

15 years 26 days ago

Download medianetlab.ee.ucla.edu

Abstract—In this paper, we study how to optimize the transmission decisions of nodes aimed at supporting mission-critical applications, such as surveillance, security monitoring,...

Hsien-Po Shiang, Mihaela van der Schaar

claim paper

Read More »

166

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

16 years 3 months ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

124

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

15 years 9 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

124

click to vote

AAAI
2008

199views Intelligent Agents» more AAAI 2008»

Maximum Entropy Inverse Reinforcement Learning

15 years 4 months ago

Download www.andrew.cmu.edu

Recent research has shown the benefit of framing problems of imitation learning as solutions to Markov Decision Problems. This approach reduces learning to the problem of recoveri...

Brian Ziebart, Andrew L. Maas, J. Andrew Bagnell, ...

claim paper

Read More »

115

click to vote

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

15 years 2 months ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

« Prev « First page 113 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers