Sciweavers

80 search results - page 4 / 16
» Efficient Reinforcement Learning Using Recursive Least-Squar...
Sort
View
ACL
2008
13 years 9 months ago
Semi-Supervised Convex Training for Dependency Parsing
We present a novel semi-supervised training algorithm for learning dependency parsers. By combining a supervised large margin loss with an unsupervised least squares loss, a discr...
Qin Iris Wang, Dale Schuurmans, Dekang Lin
ATAL
2009
Springer
14 years 2 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
ICASSP
2011
IEEE
12 years 11 months ago
A sliding-window online fast variational sparse Bayesian learning algorithm
In this work a new online learning algorithm that uses automatic relevance determination (ARD) is proposed for fast adaptive nonlinear filtering. A sequential decision rule for i...
Thomas Buchgraber, Dmitriy Shutin, H. Vincent Poor
IJON
2006
112views more  IJON 2006»
13 years 7 months ago
Palmprint recognition using FastICA algorithm and radial basis probabilistic neural network
This paper proposes a novel and successful method for recognizing palmprint based on radial basis probabilistic neural network (RBPNN) proposed by us. The RBPNN is trained by the ...
Li Shang, De-Shuang Huang, Ji-Xiang Du, Chun-Hou Z...
ICASSP
2011
IEEE
12 years 11 months ago
Adaptive modelling with tunable RBF network using multi-innovation RLS algorithm assisted by swarm intelligence
— In this paper, we propose a new on-line learning algorithm for the non-linear system identification: the swarm intelligence aided multi-innovation recursive least squares (SIM...
Hao Chen, Yu Gong, Xia Hong