Search Sciweavers | Sciweavers

21

ICPR
2006
IEEE

260views computer vision» more ICPR 2006»

Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

14 years 10 months ago

To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...

Siwei Luo, Yu Zheng, Ziang Lv

claim paper

Read More »

19

click to vote

ICML
2003
IEEE

137views Machine Learning» more ICML 2003»

BL-WoLF: A Framework For Loss-Bounded Learnability In Zero-Sum Games

14 years 9 months ago

Download www.hpl.hp.com

We present BL-WoLF, a framework for learnability in repeated zero-sum games where the cost of learning is measured by the losses the learning agent accrues (rather than the number...

Vincent Conitzer, Tuomas Sandholm

claim paper

Read More »

29

click to vote

ICCAD
2004
IEEE

107views Hardware» more ICCAD 2004»

Computation of signal threshold crossing times directly from higher order moments

14 years 5 months ago

Download www.eecs.northwestern.edu

—This paper introduces a simple method for calculating the times at which any signal crosses a prespecified threshold voltage (e.g., 10%, 20%, 50%, etc.) directly from the moment...

Yehea I. Ismail, Chirayu S. Amin

claim paper

Read More »

38

click to vote

ICRA
2008
IEEE

134views Robotics» more ICRA 2008»

An optimal filtering algorithm for non-parametric observation models in robot localization

14 years 3 months ago

Download babel.isa.uma.es

— The lack of a parameterized observation model in robot localization using occupancy grids requires the application of sampling-based methods, or particle ﬁlters. This work ad...

Jose-Luis Blanco, Javier Gonzalez, Juan-Antonio Fe...

claim paper

Read More »

23

click to vote

3DPVT
2006
IEEE

155views Visualization» more 3DPVT 2006»

The Reverse Projection Correlation Principle for Depth from Defocus

14 years 3 months ago

Download www.cim.mcgill.ca

In this paper, we address the problem of ﬁnding depth from defocus in a fundamentally new way. Most previous methods have used an approximate model in which blurring is shift in...

Scott McCloskey, Michael S. Langer, Kaleem Siddiqi

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers