Search Sciweavers | Sciweavers

91 search results - page 8 / 19

» Optimality of affine policies in multi-stage robust optimiza...

click to vote

ICMLA
2010

211views Machine Learning» more ICMLA 2010»

Ensembles of Neural Networks for Robust Reinforcement Learning

13 years 5 months ago

Download ahans.de

Reinforcement learning algorithms that employ neural networks as function approximators have proven to be powerful tools for solving optimal control problems. However, their traini...

Alexander Hans, Steffen Udluft

claim paper

Read More »

click to vote

ICC
2008
IEEE

169views Communications» more ICC 2008»

Optimality of Myopic Sensing in Multi-Channel Opportunistic Access

14 years 2 months ago

Download www.ece.ucdavis.edu

—We consider opportunistic communications over multiple channels where the state (“good” or “bad”) of each channel evolves as independent and identically distributed Mark...

Tara Javidi, Bhaskar Krishnamachari, Qing Zhao, Mi...

claim paper

Read More »

click to vote

CORR
2010
Springer

170views Education» more CORR 2010»

Global Optimization for Value Function Approximation

13 years 7 months ago

Download www.cs.umass.edu

Existing value function approximation methods have been successfully used in many applications, but they often lack useful a priori error bounds. We propose a new approximate bili...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

click to vote

ATAL
2007
Springer

141views Intelligent Agents» more ATAL 2007»

Commitment-driven distributed joint policy search

14 years 1 months ago

Download www-personal.umich.edu

Decentralized MDPs provide powerful models of interactions in multi-agent environments, but are often very diﬃcult or even computationally infeasible to solve optimally. Here we...

Stefan J. Witwicki, Edmund H. Durfee

claim paper

Read More »

click to vote

ECCV
2002
Springer

171views Computer Vision» more ECCV 2002»

Robust Parameterized Component Analysis

14 years 9 months ago

Download www.cs.brown.edu

Principal ComponentAnalysis (PCA) has been successfully applied to construct linear models of shape, graylevel, and motion. In particular, PCA has been widely used to model the var...

Fernando De la Torre, Michael J. Black

claim paper

Read More »

« Prev « First page 8 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers