Search Sciweavers | Sciweavers

136

NIPS
1998

137views Information Technology» more NIPS 1998»

15 years 4 months ago

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

112

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 3 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

143

click to vote

ECAL
2001
Springer

110views Artificial Intelligence» more ECAL 2001»

Evolution of Reinforcement Learning in Uncertain Environments: Emergence of Risk-Aversion and Matching

15 years 7 months ago

Download gandalf.psych.umn.edu

Reinforcement learning (RL) is a fundamental process by which organisms learn to achieve a goal from interactions with the environment. Using Artiﬁcial Life techniques we derive ...

Yael Niv, Daphna Joel, Isaac Meilijson, Eytan Rupp...

claim paper

Read More »

107

click to vote

ALT
2006
Springer

86views Machine Learning» more ALT 2006»

Risk-Sensitive Online Learning

16 years 2 days ago

Download www.cis.upenn.edu

We consider the problem of online learning in settings in which we want to compete not simply with the rewards of the best expert or stock, but with the best trade-oﬀ between rew...

Eyal Even-Dar, Michael J. Kearns, Jennifer Wortman

claim paper

Read More »

131

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

16 years 3 months ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers