Search Sciweavers | Sciweavers

102 search results - page 14 / 21

» MDPs with Non-Deterministic Policies

154

click to vote

ICML
2005
IEEE

159views Machine Learning» more ICML 2005»

Bounded real-time dynamic programming: RTDP with monotone upper bounds and performance guarantees

16 years 7 months ago

Download www.cs.cmu.edu

MDPs are an attractive formalization for planning, but realistic problems often have intractably large state spaces. When we only need a partial policy to get from a fixed start s...

H. Brendan McMahan, Maxim Likhachev, Geoffrey J. G...

claim paper

Read More »

165

click to vote

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

16 years 7 months ago

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

134

click to vote

AAAI
2010

163views Intelligent Agents» more AAAI 2010»

Trial-Based Dynamic Programming for Multi-Agent Planning

15 years 7 months ago

Download rbr.cs.umass.edu

Trial-based approaches offer an efficient way to solve singleagent MDPs and POMDPs. These approaches allow agents to focus their computations on regions of the environment they en...

Feng Wu, Shlomo Zilberstein, Xiaoping Chen

claim paper

Read More »

174

click to vote

ICML
2005
IEEE

127views Machine Learning» more ICML 2005»

Exploration and apprenticeship learning in reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

192

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 7 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

« Prev « First page 14 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers