Search Sciweavers | Sciweavers

369 search results - page 11 / 74

» Global Optimization for Value Function Approximation

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

14 years 25 days ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

click to vote

CVPR
2008
IEEE

181views Computer Vision» more CVPR 2008»

Globally optimal bilinear programming for computer vision applications

14 years 10 months ago

Download vision.ucsd.edu

We present a practical algorithm that provably achieves the global optimum for a class of bilinear programs commonly arising in computer vision applications. Our approach relies o...

Manmohan Krishna Chandraker, David J. Kriegman

claim paper

Read More »

click to vote

AAAI
2008

151views Intelligent Agents» more AAAI 2008»

Generalized Point Based Value Iteration for Interactive POMDPs

13 years 11 months ago

Download www.aaai.org

We develop a point based method for solving finitely nested interactive POMDPs approximately. Analogously to point based value iteration (PBVI) in POMDPs, we maintain a set of bel...

Prashant Doshi, Dennis Perez

claim paper

Read More »

click to vote

NA
2007

120views more NA 2007»

On choosing "optimal" shape parameters for RBF approximation

13 years 8 months ago

Download amadeus.math.iit.edu

Many radial basis function (RBF) methods contain a free shape parameter that plays an important role for the accuracy of the method. In most papers the authors end up choosing this...

Gregory E. Fasshauer, Jack G. Zhang

claim paper

Read More »

click to vote

DEDS
2010

97views more DEDS 2010»

On Regression-Based Stopping Times

13 years 8 months ago

Download www.stanford.edu

We study approaches that fit a linear combination of basis functions to the continuation value function of an optimal stopping problem and then employ a greedy policy based on the...

Benjamin Van Roy

claim paper

Read More »

« Prev « First page 11 / 74 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers