Search Sciweavers | Sciweavers

35

UAI
2001

86views Artificial Intelligence» more UAI 2001»

Vector-space Analysis of Belief-state Approximation for POMDPs

14 years 4 days ago

We propose a new approach to value-directed belief state approximationfor POMDPs. The valuedirected model allows one to choose approximation methods for belief state monitoringtha...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

32

click to vote

UAI
2001

98views Artificial Intelligence» more UAI 2001»

Policy Improvement for POMDPs Using Normalized Importance Sampling

14 years 4 days ago

Download www.cs.ucr.edu

We present a new method for estimating the expected return of a POMDP from experience. The estimator does not assume any knowledge of the POMDP, can estimate the returns for finit...

Christian R. Shelton

claim paper

Read More »

30

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

14 years 4 days ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

36

click to vote

UAI
2004

113views Artificial Intelligence» more UAI 2004»

Evidence-invariant Sensitivity Bounds

14 years 4 days ago

Download uai.sis.pitt.edu

The sensitivities revealed by a sensitivity analysis of a probabilistic network typically depend on the entered evidence. For a real-life network therefore, the analysis is perfor...

Silja Renooij, Linda C. van der Gaag

claim paper

Read More »

47

click to vote

UAI
2004

143views Artificial Intelligence» more UAI 2004»

The Minimum Information Principle for Discriminative Learning

14 years 4 days ago

Download eprints.pascal-network.org

Exponential models of distributions are widely used in machine learning for classification and modelling. It is well known that they can be interpreted as maximum entropy models u...

Amir Globerson, Naftali Tishby

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers