Search Sciweavers | Sciweavers

161 search results - page 24 / 33

» Iterative learning fuzzy control

130

Voted

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

14 years 9 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

150

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

15 years 8 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

100

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 4 days ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

104

click to vote

IJCNN
2008
IEEE

113views Neural Networks» more IJCNN 2008»

Uncertainty propagation for quality assurance in Reinforcement Learning

15 years 8 months ago

Download www.inb.uni-luebeck.de

— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...

Daniel Schneegaß, Steffen Udluft, Thomas Mar...

claim paper

Read More »

127

click to vote

IJCNLP
2005
Springer

146views Natural Language Processing» more IJCNLP 2005»

Heuristic Methods for Reducing Errors of Geographic Named Entities Learned by Bootstrapping

15 years 8 months ago

Download isoft.postech.ac.kr

Abstract. One of issues in the bootstrapping for named entity recognition is how to control annotation errors introduced at every iteration. In this paper, we present several heuri...

Seungwoo Lee, Gary Geunbae Lee

claim paper

Read More »

« Prev « First page 24 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers