Search Sciweavers | Sciweavers

90 search results - page 11 / 18

» On the hardness of finding symmetries in Markov decision pro...

click to vote

AAAI
2006

86views Intelligent Agents» more AAAI 2006»

Targeting Specific Distributions of Trajectories in MDPs

13 years 8 months ago

Download www.cc.gatech.edu

We define TTD-MDPs, a novel class of Markov decision processes where the traditional goal of an agent is changed from finding an optimal trajectory through a state space to realiz...

David L. Roberts, Mark J. Nelson, Charles Lee Isbe...

claim paper

Read More »

click to vote

AAAI
2008

134views Intelligent Agents» more AAAI 2008»

Interaction Structure and Dimensionality Reduction in Decentralized MDPs

13 years 9 months ago

Download www.aaai.org

Decentralized Markov Decision Processes are a powerful general model of decentralized, cooperative multi-agent problem solving. The high complexity of the general problem leads to...

Martin Allen, Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

click to vote

CCE
2004

162views Software Engineering» more CCE 2004»

An algorithmic framework for improving heuristic solutions: Part II. A new version of the stochastic traveling salesman problem

13 years 7 months ago

Download www.che.gatech.edu

The algorithmic framework developed for improving heuristic solutions of the new version of deterministic TSP [Choi et al., 2002] is extended to the stochastic case. To verify the...

Jaein Choi, Jay H. Lee, Matthew J. Realff

claim paper

Read More »

click to vote

KDD
2008
ACM

142views Data Mining» more KDD 2008»

Efficient ticket routing by resolution sequence mining

14 years 7 months ago

Download www.public.asu.edu

IT problem management calls for quick identification of resolvers to reported problems. The efficiency of this process highly depends on ticket routing--transferring problem ticke...

Qihong Shao, Yi Chen, Shu Tao, Xifeng Yan, Nikos A...

claim paper

Read More »

click to vote

ICML
2007
IEEE

162views Machine Learning» more ICML 2007»

Automatic shaping and decomposition of reward functions

14 years 8 months ago

Download www.machinelearning.org

This paper investigates the problem of automatically learning how to restructure the reward function of a Markov decision process so as to speed up reinforcement learning. We begi...

Bhaskara Marthi

claim paper

Read More »

« Prev « First page 11 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers