Sciweavers

148

CORR
2002
Springer

94views Education» more CORR 2002»

Self-Optimizing and Pareto-Optimal Policies in General Environments based on Bayes-Mixtures

15 years 6 months ago

The problem of making sequential decisions in unknown probabilistic environments is studied. In cycle t action yt results in perception xt and reward rt, where all quantities in g...

Marcus Hutter

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers