Search Sciweavers | Sciweavers

813 search results - page 19 / 163

» Ensemble Algorithms in Reinforcement Learning

click to vote

ESANN
2004

117views Neural Networks» more ESANN 2004»

Online policy adaptation for ensemble classifiers

13 years 8 months ago

Download eprints.pascal-network.org

Ensemble algorithms can improve the performance of a given learning algorithm through the combination of multiple base classifiers into an ensemble. In this paper, the idea of usin...

Christos Dimitrakakis, Samy Bengio

posted by olethros

Read More »

click to vote

ATAL
2008
Springer

99views Intelligent Agents» more ATAL 2008»

Non-linear dynamics in multiagent reinforcement learning algorithms

13 years 9 months ago

Download www.aamas-conference.org

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Only a subset of these MARL algorithms both do not require agent...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

click to vote

ATAL
2011
Springer

199views Intelligent Agents» more ATAL 2011»

Metric learning for reinforcement learning agents

12 years 7 months ago

Download www.eecs.berkeley.edu

A key component of any reinforcement learning algorithm is the underlying representation used by the agent. While reinforcement learning (RL) agents have typically relied on hand-...

Matthew E. Taylor, Brian Kulis, Fei Sha

claim paper

Read More »

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

13 years 9 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

click to vote

AUSAI
2001
Springer

105views Artificial Intelligence» more AUSAI 2001»

Wrapping Boosters against Noise

13 years 12 months ago

Download www.cs.waikato.ac.nz

Abstract. Wrappers have recently been used to obtain parameter optimizations for learning algorithms. In this paper we investigate the use of a wrapper for estimating the correct n...

Bernhard Pfahringer, Geoffrey Holmes, Gabi Schmidb...

claim paper

Read More »

« Prev « First page 19 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers