Search Sciweavers | Sciweavers

495 search results - page 64 / 99

» Constructing States for Reinforcement Learning

199

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

15 years 1 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

157

click to vote

ICML
2001
IEEE

159views Machine Learning» more ICML 2001»

Direct Policy Search using Paired Statistical Tests

16 years 4 months ago

Download www.autonlab.org

Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...

Malcolm J. A. Strens, Andrew W. Moore

claim paper

Read More »

135

click to vote

ICCS
1993
Springer

99views Applied Computing» more ICCS 1993»

Towards Domain-Independent Machine Intelligence

15 years 8 months ago

Download www.soe.ucsc.edu

Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....

Robert Levinson

claim paper

Read More »

163

click to vote

CI
2005

106views more CI 2005»

Incremental Learning of Procedural Planning Knowledge in Challenging Environments

15 years 4 months ago

Download www.sunnyhome.org

Autonomous agents that learn about their environment can be divided into two broad classes. One class of existing learners, reinforcement learners, typically employ weak learning ...

Douglas J. Pearson, John E. Laird

claim paper

Read More »

140

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

14 years 11 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

« Prev « First page 64 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers