Sciweavers

160

UAI
2001

98views Artificial Intelligence» more UAI 2001»

Policy Improvement for POMDPs Using Normalized Importance Sampling

15 years 8 months ago

We present a new method for estimating the expected return of a POMDP from experience. The estimator does not assume any knowledge of the POMDP, can estimate the returns for finit...

Christian R. Shelton

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers