Sciweavers

704 search results - page 45 / 141
» Improved Learning of AC0 Functions
Sort
View
ICML
2000
IEEE
14 years 9 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
IADIS
2008
13 years 10 months ago
An Adaptive Learning Management System With Support For 3d Collaboration
This paper describes the development of an Adaptive Learning Management System with Support for 3D Collaboration (ALMaS-3D) and presents an evaluation of its main functionalities....
Claudio Kirner, Clodonil H. Trigo, Tereza G. Kirne...
ICIP
2008
IEEE
14 years 10 months ago
Long term learning for image retrieval over networks
In this paper, we present a long term learning system for content based image retrieval over a network. Relevant feedback is used among different sessions to learn both the simila...
David Picard, Arnaud Revel, Matthieu Cord
NIPS
2008
13 years 10 months ago
Multi-task Gaussian Process Learning of Robot Inverse Dynamics
The inverse dynamics problem for a robotic manipulator is to compute the torques needed at the joints to drive it along a given trajectory; it is beneficial to be able to learn th...
Kian Ming Adam Chai, Christopher K. I. Williams, S...
PKDD
2010
Springer
160views Data Mining» more  PKDD 2010»
13 years 7 months ago
Entropy and Margin Maximization for Structured Output Learning
Abstract. We consider the problem of training discriminative structured output predictors, such as conditional random fields (CRFs) and structured support vector machines (SSVMs)....
Patrick Pletscher, Cheng Soon Ong, Joachim M. Buhm...