Search Sciweavers | Sciweavers

181 search results - page 28 / 37

» State Space Reduction For Hierarchical Reinforcement Learnin...

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

13 years 5 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

click to vote

NIPS
2008

125views Information Technology» more NIPS 2008»

Multi-resolution Exploration in Continuous Spaces

13 years 9 months ago

Download www.research.rutgers.edu

The essence of exploration is acting to try to decrease uncertainty. We propose a new methodology for representing uncertainty in continuous-state control problems. Our approach, ...

Ali Nouri, Michael L. Littman

claim paper

Read More »

click to vote

FLAIRS
2008

132views Artificial Intelligence» more FLAIRS 2008»

Learning Continuous Action Models in a Real-Time Strategy Environment

13 years 9 months ago

Download www.knexusresearch.com

Although several researchers have integrated methods for reinforcement learning (RL) with case-based reasoning (CBR) to model continuous action spaces, existing integrations typic...

Matthew Molineaux, David W. Aha, Philip Moore

claim paper

Read More »

click to vote

LION
2007
Springer

192views Optimization» more LION 2007»

Learning While Optimizing an Unknown Fitness Surface

14 years 1 months ago

Download www.science.unitn.it

This paper is about Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular a novel application of RL is considered i...

Roberto Battiti, Mauro Brunato, Paolo Campigotto

claim paper

Read More »

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

14 years 4 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

« Prev « First page 28 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers