Search Sciweavers | Sciweavers

121 search results - page 1 / 25

» Learning Decision Theoretic Utilities through Reinforcement ...

131

click to vote

NIPS
1996

89views Information Technology» more NIPS 1996»

Learning Decision Theoretic Utilities through Reinforcement Learning

15 years 8 months ago

Download papers.cnl.salk.edu

Magnus Stensmo, Terrence J. Sejnowski

claim paper

Read More »

371

Voted

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

14 years 5 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

192

click to vote

FLAIRS
2010

148views Artificial Intelligence» more FLAIRS 2010»

Decision-Theoretic Simulated Annealing

15 years 4 months ago

Download cs.gettysburg.edu

The choice of a good annealing schedule is necessary for good performance of simulated annealing for combinatorial optimization problems. In this paper, we pose the simulated anne...

Todd W. Neller, Christopher J. La Pilla

claim paper

Read More »

174

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

16 years 1 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

188

click to vote

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

15 years 10 months ago

Download www.eecs.umich.edu

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

« Prev « First page 1 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers