Search Sciweavers | Sciweavers

272 search results - page 13 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

click to vote

ICML
2009
IEEE

123views Machine Learning» more ICML 2009»

Constraint relaxation in approximate linear programs

14 years 8 months ago

Download anytime.cs.umass.edu

Approximate Linear Programming (ALP) is a reinforcement learning technique with nice theoretical properties, but it often performs poorly in practice. We identify some reasons for...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

14 years 8 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

14 years 8 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

click to vote

GECCO
2004
Springer

122views Optimization» more GECCO 2004»

Gradient-Based Learning Updates Improve XCS Performance in Multistep Problems

14 years 26 days ago

Download www.cs.york.ac.uk

This paper introduces a gradient-based reward prediction update mechanism to the XCS classiﬁer system as applied in neuralnetwork type learning and function approximation mechani...

Martin V. Butz, David E. Goldberg, Pier Luca Lanzi

claim paper

Read More »

click to vote

ICRA
2009
IEEE

259views Robotics» more ICRA 2009»

Constructing action set from basis functions for reinforcement learning of robot control

14 years 2 months ago

Download robotics.aist-nara.ac.jp

Abstract— Continuous action sets are used in many reinforcement learning (RL) applications in robot control since the control input is continuous. However, discrete action sets a...

Akihiko Yamaguchi, Jun Takamatsu, Tsukasa Ogasawar...

claim paper

Read More »

« Prev « First page 13 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers