Search Sciweavers | Sciweavers

227 search results - page 29 / 46

» Linearly Parameterized Bandits

132

click to vote

ESANN
2007

148views Neural Networks» more ESANN 2007»

Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

15 years 6 months ago

Download www.dice.ucl.ac.be

In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...

Jan Peters, Stefan Schaal

claim paper

Read More »

133

click to vote

AUTOMATICA
2008

126views more AUTOMATICA 2008»

Local stability analysis using simulations and sum-of-squares programming

15 years 4 months ago

Download jagger.berkeley.edu

The problem of computing bounds on the region-of-attraction for systems with polynomial vector fields is considered. Invariant subsets of the region-of-attraction are characterize...

Ufuk Topcu, Andrew K. Packard, Peter Seiler

claim paper

Read More »

132

click to vote

ADCM
2006

133views more ADCM 2006»

Convex combination maps over triangulations, tilings, and tetrahedral meshes

15 years 4 months ago

Download folk.uio.no

: In a recent paper by the first author, a simple proof was given of a result by Tutte on the validity of barycentric mappings, recast in terms of the injectivity of piecewise line...

Michael S. Floater, Valérie Pham-Trong

claim paper

Read More »

123

Voted

AUTOMATICA
2004

94views more AUTOMATICA 2004»

Drift-free attitude estimation for accelerated rigid bodies

15 years 4 months ago

Download www.math.kth.se

In this paper we study the attitude estimation problem for an accelerated rigid body using gyros and accelerometers. The application in mind is that of a walking robot and particu...

Henrik Rehbinder, Xiaoming Hu

claim paper

Read More »

158

click to vote

ML
2002
ACM

168views Machine Learning» more ML 2002»

On Average Versus Discounted Reward Temporal-Difference Learning

15 years 4 months ago

Download web.mit.edu

We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...

John N. Tsitsiklis, Benjamin Van Roy

claim paper

Read More »

« Prev « First page 29 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers