Sciweavers

227 search results - page 29 / 46
» Linearly Parameterized Bandits
Sort
View
ESANN
2007
13 years 10 months ago
Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning
In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...
Jan Peters, Stefan Schaal
AUTOMATICA
2008
126views more  AUTOMATICA 2008»
13 years 8 months ago
Local stability analysis using simulations and sum-of-squares programming
The problem of computing bounds on the region-of-attraction for systems with polynomial vector fields is considered. Invariant subsets of the region-of-attraction are characterize...
Ufuk Topcu, Andrew K. Packard, Peter Seiler
ADCM
2006
133views more  ADCM 2006»
13 years 8 months ago
Convex combination maps over triangulations, tilings, and tetrahedral meshes
: In a recent paper by the first author, a simple proof was given of a result by Tutte on the validity of barycentric mappings, recast in terms of the injectivity of piecewise line...
Michael S. Floater, Valérie Pham-Trong
AUTOMATICA
2004
94views more  AUTOMATICA 2004»
13 years 8 months ago
Drift-free attitude estimation for accelerated rigid bodies
In this paper we study the attitude estimation problem for an accelerated rigid body using gyros and accelerometers. The application in mind is that of a walking robot and particu...
Henrik Rehbinder, Xiaoming Hu
ML
2002
ACM
168views Machine Learning» more  ML 2002»
13 years 8 months ago
On Average Versus Discounted Reward Temporal-Difference Learning
We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...
John N. Tsitsiklis, Benjamin Van Roy