Search Sciweavers | Sciweavers

90 search results - page 2 / 18

» On the hardness of finding symmetries in Markov decision pro...

click to vote

NIPS
2004

103views Information Technology» more NIPS 2004»

Experts in a Markov Decision Process

13 years 8 months ago

Download books.nips.cc

We consider an MDP setting in which the reward function is allowed to change during each time step of play (possibly in an adversarial manner), yet the dynamics remain fixed. Simi...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

click to vote

COLING
2010

138views Computational Linguistics» more COLING 2010»

Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes

13 years 2 months ago

Download aclweb.org

This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...

Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...

claim paper

Read More »

click to vote

AAAI
1997

139views Intelligent Agents» more AAAI 1997»

Model Minimization in Markov Decision Processes

13 years 8 months ago

Download www.cs.brown.edu

Many stochastic planning problems can be represented using Markov Decision Processes (MDPs). A difficulty with using these MDP representations is that the common algorithms for so...

Thomas Dean, Robert Givan

claim paper

Read More »

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

13 years 8 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

14 years 8 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

« Prev « First page 2 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers