Search Sciweavers | Sciweavers

3381 search results - page 93 / 677

» LEO - DB2's LEarning Optimizer

135

click to vote

TSMC
2002

102views more TSMC 2002»

Generalized pursuit learning schemes: new families of continuous and discretized learning automata

15 years 4 months ago

Download ce.sharif.edu

The fastest learning automata (LA) algorithms currently available fall in the family of estimator algorithms introduced by Thathachar and Sastry [24]. The pioneering work of these ...

M. Agache, B. John Oommen

claim paper

Read More »

110

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Margin Distribution and Learning

16 years 5 months ago

Download l2r.cs.uiuc.edu

Recent theoretical results have shown that improved bounds on generalization error of classifiers can be obtained by explicitly taking the observed margin distribution of the trai...

Ashutosh Garg, Dan Roth

claim paper

Read More »

131

click to vote

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

16 years 5 months ago

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

123

click to vote

ALT
2008
Springer

171views Machine Learning» more ALT 2008»

Active Learning in Multi-armed Bandits

16 years 1 months ago

Download www.sztaki.hu

In this paper we consider the problem of actively learning the mean values of distributions associated with a ﬁnite number of options (arms). The algorithms can select which opti...

András Antos, Varun Grover, Csaba Szepesv&a...

claim paper

Read More »

127

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

15 years 5 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

« Prev « First page 93 / 677 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers