Search Sciweavers | Sciweavers

1630 search results - page 151 / 326

» Coordinated Reinforcement Learning

169

click to vote

ML
1998
ACM

220views Machine Learning» more ML 1998»

Learning to Improve Coordinated Actions in Cooperative Distributed Problem-Solving Environments

15 years 3 months ago

Download mas.cs.umass.edu

Abstract. Coordination is an essential technique in cooperative, distributed multiagent systems. However, sophisticated coordination strategies are not always cost-effective in all...

Toshiharu Sugawara, Victor R. Lesser

claim paper

Read More »

141

click to vote

EUSFLAT
2009

140views Fuzzy Logic» more EUSFLAT 2009»

Incremental Possibilistic Approach for Online Clustering and Classification

15 years 1 months ago

Download www.eusflat.org

In this paper, we propose to develop the supervised classification method Fuzzy Pattern Matching to be in addition a non supervised one. The goal is to monitor dynamic systems with...

Moamar Sayed Mouchaweh, Bernard Riera

claim paper

Read More »

128

click to vote

ICML
2005
IEEE

137views Machine Learning» more ICML 2005»

Learning to compete, compromise, and cooperate in repeated general-sum games

16 years 4 months ago

Download www.mit.edu

Learning algorithms often obtain relatively low average payoffs in repeated general-sum games between other learning agents due to a focus on myopic best-response and one-shot Nas...

Jacob W. Crandall, Michael A. Goodrich

claim paper

Read More »

137

click to vote

AAAI
2007

104views Intelligent Agents» more AAAI 2007»

Active Imitation Learning

15 years 6 months ago

Download www.cs.washington.edu

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...

Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

143

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 4 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

« Prev « First page 151 / 326 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers