Sciweavers

1476 search results - page 92 / 296
» Robust constraint-consistent learning
Sort
View
ICML
2000
IEEE
14 years 10 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
ICML
2007
IEEE
14 years 10 months ago
Beamforming using the relevance vector machine
Beamformers are spatial filters that pass source signals in particular focused locations while suppressing interference from elsewhere. The widely-used minimum variance adaptive b...
David P. Wipf, Srikantan S. Nagarajan
ALT
1999
Springer
14 years 1 months ago
On the Uniform Learnability of Approximations to Non-Recursive Functions
Abstract. Blum and Blum (1975) showed that a class B of suitable recursive approximations to the halting problem is reliably EX-learnable. These investigations are carried on by sh...
Frank Stephan, Thomas Zeugmann
ICPR
2008
IEEE
14 years 11 months ago
A discrete-time parallel update algorithm for distributed learning
We present a distributed machine learning framework based on support vector machines that allows classification problems to be solved iteratively through parallel update algorithm...
Christian Bauckhage, Tansu Alpcan
IROS
2009
IEEE
120views Robotics» more  IROS 2009»
14 years 4 months ago
Interactive learning of visually symmetric objects
— This paper describes a robotic system that learns visual models of symmetric objects autonomously. Our robot learns by physically interacting with an object using its end effec...
Wai Ho Li, Lindsay Kleeman