Search Sciweavers | Sciweavers

1227 search results - page 1 / 246

» Learning Rates for Q-Learning

172

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

16 years 1 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

170

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

193

click to vote

NIPS
2003

207views Information Technology» more NIPS 2003»

Extending Q-Learning to General Adaptive Multi-Agent Systems

15 years 8 months ago

Download books.nips.cc

Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...

Gerald Tesauro

claim paper

Read More »

176

click to vote

ICRA
2002
IEEE

133views Robotics» more ICRA 2002»

The Necessity of Average Rewards in Cooperative Multirobot Learning

15 years 11 months ago

Download www.ri.cmu.edu

Learning can be an effective way for robot systems to deal with dynamic environments and changing task conditions. However, popular singlerobot learning algorithms based on discou...

Poj Tangamchit, John M. Dolan, Pradeep K. Khosla

claim paper

Read More »

180

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Transfer Learning in Collaborative Filtering with Uncertain Ratings

13 years 9 months ago

Download www.cse.ust.hk

To solve the sparsity problem in collaborative ﬁltering, researchers have introduced transfer learning as a viable approach to make use of auxiliary data. Most previous transfer...

Weike Pan, Evan Wei Xiang, Qiang Yang

claim paper

Read More »

« Prev « First page 1 / 246 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers