Sciweavers

983 search results - page 18 / 197
» A Better Update Policy
Sort
View
UAI
2008
13 years 9 months ago
Improving Gradient Estimation by Incorporating Sensor Data
An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...
Gregory Lawrence, Stuart J. Russell
ICASSP
2008
IEEE
14 years 2 months ago
Bayesian update of dialogue state for robust dialogue systems
This paper presents a new framework for accumulating beliefs in spoken dialogue systems. The technique is based on updating a Bayesian Network that represents the underlying state...
Blaise Thomson, Jost Schatzmann, Steve Young
ICIP
2002
IEEE
14 years 9 months ago
Building adaptive 2D wavelet decompositions by update lifting
This paper discusses a method for the construction of nonlinear 2D wavelet decompositions using an adaptive update lifting scheme. A very interesting aspect is that the decomposit...
Béatrice Pesquet-Popescu, Gemma Piella, Hen...
GECCO
2006
Springer
140views Optimization» more  GECCO 2006»
13 years 11 months ago
Prediction update algorithms for XCSF: RLS, Kalman filter, and gain adaptation
We study how different prediction update algorithms influence the performance of XCSF. We consider three classical parameter estimation algorithms (NLMS, RLS, and Kalman filter) a...
Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...
ECSQARU
2001
Springer
14 years 1 days ago
Space-Progressive Value Iteration: An Anytime Algorithm for a Class of POMDPs
Abstract. Finding optimal policies for general partially observable Markov decision processes (POMDPs) is computationally difficult primarily due to the need to perform dynamic-pr...
Nevin Lianwen Zhang, Weihong Zhang