Sciweavers

582 search results - page 93 / 117
» Gaussian Processes in Reinforcement Learning
Sort
View
ATAL
2006
Springer
14 years 1 months ago
Scalable and reliable data delivery in mobile ad hoc sensor networks
This paper studies scalable data delivery algorithms in mobile ad hoc sensor networks with node and link failures. Many algorithms have been developed for data delivery and fusion...
Bin Yu, Paul Scerri, Katia P. Sycara, Yang Xu, Mic...
PKDD
2010
Springer
122views Data Mining» more  PKDD 2010»
13 years 8 months ago
Exploration in Relational Worlds
Abstract. One of the key problems in model-based reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large relational domains, in wh...
Tobias Lang, Marc Toussaint, Kristian Kersting
NIPS
1993
13 years 11 months ago
Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming
Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...
Christopher G. Atkeson
ICASSP
2011
IEEE
13 years 1 months ago
An acoustically-motivated spatial prior for under-determined reverberant source separation
We consider the task of under-determined reverberant audio source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a ze...
Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gr...
SDM
2012
SIAM
294views Data Mining» more  SDM 2012»
12 years 2 days ago
Kernelized Probabilistic Matrix Factorization: Exploiting Graphs and Side Information
We propose a new matrix completion algorithm— Kernelized Probabilistic Matrix Factorization (KPMF), which effectively incorporates external side information into the matrix fac...
Tinghui Zhou, Hanhuai Shan, Arindam Banerjee, Guil...