Sciweavers

1813 search results - page 228 / 363
» Calculi of Approximation Spaces
Sort
View
169
Voted
ML
2008
ACM
152views Machine Learning» more  ML 2008»
15 years 4 months ago
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...
András Antos, Csaba Szepesvári, R&ea...
NCA
2008
IEEE
15 years 4 months ago
Neurodynamic programming: a case study of the traveling salesman problem
The paper focuses on the study of solving the large-scale traveling salesman problem (TSP) based on neurodynamic programming. From this perspective, two methods, temporal differenc...
Jia Ma, Tao Yang, Zeng-Guang Hou, Min Tan, Derong ...
147
Voted
MCSS
2006
Springer
15 years 4 months ago
Global complete observability and output-to-state stability imply the existence of a globally convergent observer
In this paper we consider systems which are globally completly observable and output-to-state stable. The former property guarantees the existence of coordinates such that the dyna...
Alessandro Astolfi, Laurent Praly
129
Voted
FOCM
2007
48views more  FOCM 2007»
15 years 4 months ago
Integration and Optimization of Multivariate Polynomials by Restriction onto a Random Subspace
Abstract. We consider the problem of efficient integration of an n-variate polynomial with respect to the Gaussian measure in Rn and related problems of complex integration and opt...
Alexander I. Barvinok
PAMI
2006
233views more  PAMI 2006»
15 years 4 months ago
Model-Based Hand Tracking Using a Hierarchical Bayesian Filter
This paper sets out a tracking framework, which is applied to the recovery of threedimensional hand motion from an image sequence. The method handles the issues of initialization,...
Björn Stenger, Arasanathan Thayananthan, Phil...