Search Sciweavers | Sciweavers

268 search results - page 42 / 54

» Solving multiagent assignment Markov decision processes

192

click to vote

ATAL
2010
Springer

115views Intelligent Agents» more ATAL 2010»

Self-organization for coordinating decentralized reinforcement learning

15 years 7 months ago

Download www.cs.umass.edu

Decentralized reinforcement learning (DRL) has been applied to a number of distributed applications. However, one of the main challenges faced by DRL is its convergence. Previous ...

Chongjie Zhang, Victor R. Lesser, Sherief Abdallah

claim paper

Read More »

219

Voted

UAI
2000

136views Artificial Intelligence» more UAI 2000»

Fast Planning in Stochastic Games

15 years 8 months ago

Download www.cis.upenn.edu

Stochastic games generalize Markov decision processes MDPs to a multiagent setting by allowing the state transitions to depend jointly on all player actions, and having rewards de...

Michael J. Kearns, Yishay Mansour, Satinder P. Sin...

claim paper

Read More »

141

click to vote

ICML
2007
IEEE

162views Machine Learning» more ICML 2007»

Automatic shaping and decomposition of reward functions

16 years 7 months ago

Download www.machinelearning.org

This paper investigates the problem of automatically learning how to restructure the reward function of a Markov decision process so as to speed up reinforcement learning. We begi...

Bhaskara Marthi

claim paper

Read More »

153

click to vote

ICML
2008
IEEE

147views Machine Learning» more ICML 2008»

Apprenticeship learning using linear programming

16 years 7 months ago

Download www.cs.ualberta.ca

In apprenticeship learning, the goal is to learn a policy in a Markov decision process that is at least as good as a policy demonstrated by an expert. The difficulty arises in tha...

Umar Syed, Michael H. Bowling, Robert E. Schapire

claim paper

Read More »

183

click to vote

ICN
2007
Springer

97views Computer Networks» more ICN 2007»

Heuristic Approach of Optimal Code Allocation in High Speed Downlink Packet Access Networks

16 years 26 days ago

Download www.sce.carleton.ca

— In this paper, we use the Markov Decision Process (MDP) technique to ﬁnd the optimal code allocation policy in High-Speed Downlink Packet Access (HSDPA) networks. A discrete ...

Hussein Al-Zubaidy, Jerome Talim, Ioannis Lambadar...

claim paper

Read More »

« Prev « First page 42 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers