Search Sciweavers | Sciweavers

15 search results - page 3 / 3

» On the Worst-Case Analysis of Temporal-Difference Learning A...

click to vote

STACS
1999
Springer

117views Theoretical Computer Science» more STACS 1999»

A Complete and Tight Average-Case Analysis of Learning Monomials

13 years 11 months ago

Download www-alg.ist.hokudai.ac.jp

Abstract. We advocate to analyze the average complexity of learning problems. An appropriate framework for this purpose is introduced. Based on it we consider the problem of learni...

Rüdiger Reischuk, Thomas Zeugmann

claim paper

Read More »

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

13 years 5 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

click to vote

ICML
2004
IEEE

172views Machine Learning» more ICML 2004»

Large margin hierarchical classification

14 years 8 months ago

Download web.mac.com

We present an algorithmic framework for supervised classification learning where the set of labels is organized in a predefined hierarchical structure. This structure is encoded b...

Ofer Dekel, Joseph Keshet, Yoram Singer

claim paper

Read More »

click to vote

GECCO
2010
Springer

155views Optimization» more GECCO 2010»

Negative selection algorithms without generating detectors

14 years 9 days ago

Download www.tcs.uni-luebeck.de

Negative selection algorithms are immune-inspired classiﬁers that are trained on negative examples only. Classiﬁcation is performed by generating detectors that match none of ...

Maciej Liskiewicz, Johannes Textor

claim paper

Read More »

click to vote

ATAL
2008
Springer

176views Intelligent Agents» more ATAL 2008»

Analysis of an evolutionary reinforcement learning method in a multiagent domain

13 years 9 months ago

Download www.aamas-conference.org

Many multiagent problems comprise subtasks which can be considered as reinforcement learning (RL) problems. In addition to classical temporal difference methods, evolutionary algo...

Jan Hendrik Metzen, Mark Edgington, Yohannes Kassa...

claim paper

Read More »

« Prev « First page 3 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers