Sciweavers

908 search results - page 46 / 182
» Stochastic Finite Learning
Sort
View
ICML
2010
IEEE
13 years 9 months ago
Convergence of Least Squares Temporal Difference Methods Under General Conditions
We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...
Huizhen Yu
ICML
2005
IEEE
14 years 9 months ago
Finite time bounds for sampling based fitted value iteration
In this paper we consider sampling based fitted value iteration for discounted, large (possibly infinite) state space, finite action Markovian Decision Problems where only a gener...
Csaba Szepesvári, Rémi Munos
LFCS
1992
Springer
14 years 25 days ago
Machine Learning of Higher Order Programs
A generator program for a computable function (by definition) generates an infinite sequence of programs all but finitely many of which compute that function. Machine learning of ...
Ganesh Baliga, John Case, Sanjay Jain, Mandayam Su...
NAACL
1994
13 years 10 months ago
A Report of Recent Progress in Transformation-Based Error-Driven Learning
Most recent research in trainable part of speech taggers has explored stochastic tagging. While these taggers obtain high accuracy, linguistic information is captured indirectly, ...
Eric Brill
NIPS
2004
13 years 10 months ago
A Method for Inferring Label Sampling Mechanisms in Semi-Supervised Learning
We consider the situation in semi-supervised learning, where the "label sampling" mechanism stochastically depends on the true response (as well as potentially on the fe...
Saharon Rosset, Ji Zhu, Hui Zou, Trevor Hastie