Search Sciweavers | Sciweavers

92 search results - page 9 / 19

» Game-based Abstraction for Markov Decision Processes

click to vote

ICML
1998
IEEE

165views Machine Learning» more ICML 1998»

Intra-Option Learning about Temporally Abstract Actions

14 years 10 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

click to vote

FOCS
2007
IEEE

174views Theoretical Computer Science» more FOCS 2007»

On the Complexity of Nash Equilibria and Other Fixed Points (Extended Abstract)

14 years 4 months ago

Download homepages.inf.ed.ac.uk

d Abstract) Kousha Etessami LFCS, School of Informatics University of Edinburgh Mihalis Yannakakis Department of Computer Science Columbia University We reexamine what it means to...

Kousha Etessami, Mihalis Yannakakis

claim paper

Read More »

click to vote

ICMLA
2009

185views Machine Learning» more ICMLA 2009»

Automatic Feature Selection for Model-Based Reinforcement Learning in Factored MDPs

13 years 7 months ago

Download staff.science.uva.nl

Abstract--Feature selection is an important challenge in machine learning. Unfortunately, most methods for automating feature selection are designed for supervised learning tasks a...

Mark Kroon, Shimon Whiteson

claim paper

Read More »

click to vote

GECCO
2004
Springer

142views Optimization» more GECCO 2004»

Improving MACS Thanks to a Comparison with 2TBNs

14 years 3 months ago

Download www.cs.york.ac.uk

Abstract. Factored Markov Decision Processes is the theoretical framework underlying multi-step Learning Classiﬁer Systems research. This framework is mostly used in the context ...

Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuil...

claim paper

Read More »

click to vote

ECSQARU
2001
Springer

118views Automated Reasoning» more ECSQARU 2001»

Space-Progressive Value Iteration: An Anytime Algorithm for a Class of POMDPs

14 years 2 months ago

Download www.cs.ust.hk

Abstract. Finding optimal policies for general partially observable Markov decision processes (POMDPs) is computationally difﬁcult primarily due to the need to perform dynamic-pr...

Nevin Lianwen Zhang, Weihong Zhang

claim paper

Read More »

« Prev « First page 9 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers