Search Sciweavers | Sciweavers

23

ICMLA
2007

126views Machine Learning» more ICMLA 2007»

Learning to evaluate conditional partial plans

13 years 9 months ago

In our research we study rational agents which learn how to choose the best conditional, partial plan in any situation. The agent uses an incomplete symbolic inference engine, emp...

Slawomir Nowaczyk, Jacek Malec

claim paper

Read More »

21

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 8 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

34

click to vote

PUK
2000

130views Computer Science» more PUK 2000»

Knowledge-Based Control of Decision Theoretic Planning - Adaptive Planning Model Selection

13 years 8 months ago

Download www-is.informatik.uni-oldenburg.de

This paper proposes a new planning architecture for agents operating in uncertain and dynamic environments. Decisiontheoretic planning has been recognized as a useful tool for rea...

Jun Miura, Yoshiaki Shirai

claim paper

Read More »

28

click to vote

AAAI
1996

197views Intelligent Agents» more AAAI 1996»

Computing Optimal Policies for Partially Observable Decision Processes Using Compact Representations

13 years 8 months ago

Download people.cs.ubc.ca

: Partially-observable Markov decision processes provide a very general model for decision-theoretic planning problems, allowing the trade-offs between various courses of actions t...

Craig Boutilier, David Poole

claim paper

Read More »

21

click to vote

ICTAI
2009
IEEE

86views Artificial Intelligence» more ICTAI 2009»

TiMDPpoly: An Improved Method for Solving Time-Dependent MDPs

13 years 5 months ago

Download www.montefiore.ulg.ac.be

We introduce TiMDPpoly, an algorithm designed to solve planning problems with durative actions, under probabilistic uncertainty, in a non-stationary, continuous-time context. Miss...

Emmanuel Rachelson, Patrick Fabiani, Fréd&e...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers