Sciweavers

36 search results - page 2 / 8
» jmlr 2002
Sort
View
JMLR
2002
100views more  JMLR 2002»
13 years 10 months ago
On the Convergence of Optimistic Policy Iteration
We consider a finite-state Markov decision problem and establish the convergence of a special case of optimistic policy iteration that involves Monte Carlo estimation of Q-values,...
John N. Tsitsiklis
JMLR
2002
75views more  JMLR 2002»
13 years 10 months ago
Stability and Generalization
We define notions of stability for learning algorithms and show how to use these notions to derive generalization error bounds based on the empirical error and the leave-one-out e...
Olivier Bousquet, André Elisseeff
JMLR
2002
73views more  JMLR 2002»
13 years 10 months ago
Variational Learning of Clusters of Undercomplete Nonsymmetric Independent Components
We apply a variational method to automatically determine the number of mixtures of independent components in high-dimensional datasets, in which the sources may be nonsymmetricall...
Kwokleung Chan, Te-Won Lee, Terrence J. Sejnowski
JMLR
2002
102views more  JMLR 2002»
13 years 10 months ago
Efficient Algorithms for Decision Tree Cross-validation
Cross-validation is a useful and generally applicable technique often employed in machine learning, including decision tree induction. An important disadvantage of straightforward...
Hendrik Blockeel, Jan Struyf
JMLR
2002
74views more  JMLR 2002»
13 years 10 months ago
The Representational Power of Discrete Bayesian Networks
One of the most important fundamental properties of Bayesian networks is the representational power, reflecting what kind of functions they can or cannot represent. In this paper,...
Charles X. Ling, Huajie Zhang