Search Sciweavers | Sciweavers

236 search results - page 38 / 48

» Bias and Variance Approximation in Value Function Estimates

241

click to vote

JAIR
2002

163views more JAIR 2002»

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

15 years 7 months ago

Download www.jair.org

The recursive least-squares (RLS) algorithm is one of the most well-known algorithms used in adaptive filtering, system identification and adaptive control. Its popularity is main...

Xin Xu, Hangen He, Dewen Hu

claim paper

Read More »

210

Voted

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

16 years 8 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

255

click to vote

SIAMCOMP
2010

174views more SIAMCOMP 2010»

On the Complexity of Nash Equilibria and Other Fixed Points

15 years 5 months ago

Download homepages.inf.ed.ac.uk

We reexamine what it means to compute Nash equilibria and, more generally, what it means to compute a ﬁxed point of a given Brouwer function, and we investigate the complexity o...

Kousha Etessami, Mihalis Yannakakis

claim paper

Read More »

262

Voted

EC
2011

240views ECommerce» more EC 2011»

HypE: An Algorithm for Fast Hypervolume-Based Many-Objective Optimization

15 years 2 months ago

Download www.tik.ee.ethz.ch

Abstract—In the ﬁeld of evolutionary multi-criterion optimization, the hypervolume indicator is the only single set quality measure that is known to be strictly monotonic with ...

Johannes Bader, Eckart Zitzler

claim paper

Read More »

201

click to vote

ICIP
2000
IEEE

155views Image Processing» more ICIP 2000»

Statistical Threshold Design for the Two-State Signal-Dependent Rank Order Mean Filter

16 years 9 months ago

Download vision.ece.ucsb.edu

The signal-dependent rank order mean (SD-ROM) ?lter is effective at removing high levels of impulse noise from 2D scalar-valued signals. Excellent results have been presented for ...

Michael S. Moore, Sanjit K. Mitra

claim paper

Read More »

« Prev « First page 38 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers