Search Sciweavers | Sciweavers

23

ATAL
2006
Springer

136views Intelligent Agents» more ATAL 2006»

Resource allocation among agents with preferences induced by factored MDPs

13 years 11 months ago

Distributing scarce resources among agents in a way that maximizes the social welfare of the group is a computationally hard problem when the value of a resource bundle is not lin...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

26

click to vote

AIPS
2009

144views Artificial Intelligence» more AIPS 2009»

Efficient Solutions to Factored MDPs with Imprecise Transition Probabilities

13 years 8 months ago

Download www.ime.usp.br

When modeling real-world decision-theoretic planning problems in the Markov decision process (MDP) framework, it is often impossible to obtain a completely accurate estimate of tr...

Karina Valdivia Delgado, Scott Sanner, Leliane Nun...

claim paper

Read More »

23

click to vote

IJCNN
2008
IEEE

113views Neural Networks» more IJCNN 2008»

Uncertainty propagation for quality assurance in Reinforcement Learning

14 years 1 months ago

Download www.inb.uni-luebeck.de

— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...

Daniel Schneegaß, Steffen Udluft, Thomas Mar...

claim paper

Read More »

35

click to vote

ISAAC
2010
Springer

243views Algorithms» more ISAAC 2010»

Lower Bounds for Howard's Algorithm for Finding Minimum Mean-Cost Cycles

13 years 5 months ago

Download www.daimi.au.dk

Howard's policy iteration algorithm is one of the most widely used algorithms for finding optimal policies for controlling Markov Decision Processes (MDPs). When applied to we...

Thomas Dueholm Hansen, Uri Zwick

claim paper

Read More »

22

click to vote

RAS
2010

131views more RAS 2010»

Probabilistic Policy Reuse for inter-task transfer learning

13 years 5 months ago

Download scalab.uc3m.es

Policy Reuse is a reinforcement learning technique that eﬃciently learns a new policy by using past similar learned policies. The Policy Reuse learner improves its exploration b...

Fernando Fernández, Javier García, M...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers