Search Sciweavers | Sciweavers

332 search results - page 36 / 67

» Ranking policies in discrete Markov decision processes

click to vote

EXACT
2008

128views Applied Computing» more EXACT 2008»

Explaining recommendations generated by MDPs

13 years 10 months ago

Download www.cs.uwaterloo.ca

There has been little work in explaining recommendations generated by Markov Decision Processes (MDPs). We analyze the difculty of explaining policies computed automatically and id...

Omar Zia Khan, Pascal Poupart, James P. Black

claim paper

Read More »

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

13 years 7 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

NIPS
2004

224views Information Technology» more NIPS 2004»

Approximately Efficient Online Mechanism Design

13 years 9 months ago

Download www.cs.cmu.edu

Online mechanism design (OMD) addresses the problem of sequential decision making in a stochastic environment with multiple self-interested agents. The goal in OMD is to make valu...

David C. Parkes, Satinder P. Singh, Dimah Yanovsky

claim paper

Read More »

click to vote

IAT
2005
IEEE

132views Intelligent Agents» more IAT 2005»

Decomposing Large-Scale POMDP Via Belief State Analysis

14 years 1 months ago

Download www.comp.hkbu.edu.hk

Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing ...

Xin Li, William K. Cheung, Jiming Liu

claim paper

Read More »

click to vote

JMLR
2006

116views more JMLR 2006»

Point-Based Value Iteration for Continuous POMDPs

13 years 8 months ago

Download jmlr.csail.mit.edu

We propose a novel approach to optimize Partially Observable Markov Decisions Processes (POMDPs) defined on continuous spaces. To date, most algorithms for model-based POMDPs are ...

Josep M. Porta, Nikos A. Vlassis, Matthijs T. J. S...

claim paper

Read More »

« Prev « First page 36 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers