Search Sciweavers | Sciweavers

113 search results - page 15 / 23

» Model Approximation for HEXQ Hierarchical Reinforcement Lear...

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

13 years 9 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

click to vote

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Learning state-action basis functions for hierarchical MDPs

14 years 8 months ago

Download www.machinelearning.org

This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper exten...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

click to vote

PKDD
2009
Springer

152views Data Mining» more PKDD 2009»

Feature Selection for Value Function Approximation Using Bayesian Model Selection

14 years 2 months ago

Download userweb.cs.utexas.edu

Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...

Tobias Jung, Peter Stone

claim paper

Read More »

click to vote

GECCO
2006
Springer

195views Optimization» more GECCO 2006»

Studying XCS/BOA learning in Boolean functions: structure encoding and random Boolean functions

13 years 11 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

Recently, studies with the XCS classifier system on Boolean functions have shown that in certain types of functions simple crossover operators can lead to disruption and, conseque...

Martin V. Butz, Martin Pelikan

claim paper

Read More »

click to vote

ICML
2005
IEEE

123views Machine Learning» more ICML 2005»

A model for handling approximate, noisy or incomplete labeling in text classification

14 years 8 months ago

Download www.cse.iitb.ac.in

We introduce a Bayesian model, BayesANIL, that is capable of estimating uncertainties associated with the labeling process. Given a labeled or partially labeled training corpus of...

Ganesh Ramakrishnan, Krishna Prasad Chitrapura, Ra...

claim paper

Read More »

« Prev « First page 15 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers