Search Sciweavers | Sciweavers

802 search results - page 78 / 161

» Experts in a Markov Decision Process

157

click to vote

AMAI
2006
Springer

123views Artificial Intelligence» more AMAI 2006»

Symmetric approximate linear programming for factored MDPs with application to constrained problems

15 years 5 months ago

Download ai.stanford.edu

A weakness of classical Markov decision processes (MDPs) is that they scale very poorly due to the flat state-space representation. Factored MDPs address this representational pro...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

219

click to vote

AAAI
2011

246views Intelligent Agents» more AAAI 2011»

An Online Spectral Learning Algorithm for Partially Observable Nonlinear Dynamical Systems

14 years 5 months ago

Download www.cs.cmu.edu

Recently, a number of researchers have proposed spectral algorithms for learning models of dynamical systems—for example, Hidden Markov Models (HMMs), Partially Observable Marko...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

342

click to vote

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

14 years 4 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

128

click to vote

GECCO
2005
Springer

130views Optimization» more GECCO 2005»

ATNoSFERES revisited

15 years 11 months ago

Download www.cs.bham.ac.uk

ATNoSFERES is a Pittsburgh style Learning Classiﬁer System (LCS) in which the rules are represented as edges of an Augmented Transition Network. Genotypes are strings of tokens ...

Samuel Landau, Olivier Sigaud, Marc Schoenauer

claim paper

Read More »

163

click to vote

NIPS
2001

192views Information Technology» more NIPS 2001»

Predictive Representations of State

15 years 6 months ago

Download www.eecs.umich.edu

We show that states of a dynamical system can be usefully represented by multi-step, action-conditional predictions of future observations. State representations that are grounded...

Michael L. Littman, Richard S. Sutton, Satinder P....

claim paper

Read More »

« Prev « First page 78 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers