Search Sciweavers | Sciweavers

67 search results - page 7 / 14

» Learning predictive state representations using non-blind po...

214

click to vote

NIPS
2004

94views Information Technology» more NIPS 2004»

Schema Learning: Experience-Based Construction of Predictive Action Models

15 years 8 months ago

Download books.nips.cc

Schema learning is a way to discover probabilistic, constructivist, predictive action models (schemas) from experience. It includes methods for finding and using hidden state to m...

Michael P. Holmes, Charles Lee Isbell Jr.

claim paper

Read More »

231

Voted

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 8 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

221

click to vote

RECOMB
2004
Springer

166views Computational Biology» more RECOMB 2004»

Learning Regulatory Network Models that Represent Regulator States and Roles

16 years 7 months ago

Download pages.cs.wisc.edu

Abstract. We present an approach to inferring probabilistic models of generegulatory networks that is intended to provide a more mechanistic representation of transcriptional regul...

Keith Noto, Mark Craven

claim paper

Read More »

202

Voted

AI
2000
Springer

154views Artificial Intelligence» more AI 2000»

Stochastic dynamic programming with factored representations

15 years 7 months ago

Download www.cs.tufts.edu

Markov decisionprocesses(MDPs) haveproven to be popular models for decision-theoretic planning, but standard dynamic programming algorithms for solving MDPs rely on explicit, stat...

Craig Boutilier, Richard Dearden, Moisés Go...

claim paper

Read More »

253

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 4 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

« Prev « First page 7 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers