Search Sciweavers | Sciweavers

332 search results - page 24 / 67

» Ranking policies in discrete Markov decision processes

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

13 years 5 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

click to vote

AAAI
2010

172views Intelligent Agents» more AAAI 2010»

Using Bisimulation for Policy Transfer in MDPs

13 years 9 months ago

Download www.cs.mcgill.ca

Knowledge transfer has been suggested as a useful approach for solving large Markov Decision Processes. The main idea is to compute a decision-making policy in one environment and...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

AUTOMATICA
2008

74views more AUTOMATICA 2008»

Policy iteration based feedback control

13 years 8 months ago

Download www.cfins.au.tsinghua.edu.cn

It is well known that stochastic control systems can be viewed as Markov decision processes (MDPs) with continuous state spaces. In this paper, we propose to apply the policy iter...

Kan-Jian Zhang, Yan-Kai Xu, Xi Chen, Xi-Ren Cao

claim paper

Read More »

click to vote

IJCAI
2007

175views Artificial Intelligence» more IJCAI 2007»

An Experts Algorithm for Transfer Learning

13 years 9 months ago

Download www.ijcai.org

A long-lived agent continually faces new tasks in its environment. Such an agent may be able to use knowledge learned in solving earlier tasks to produce candidate policies for it...

Erik Talvitie, Satinder Singh

claim paper

Read More »

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

13 years 9 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

« Prev « First page 24 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers