Search Sciweavers | Sciweavers

128 search results - page 14 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

152

click to vote

ECAL
2001
Springer

110views Artificial Intelligence» more ECAL 2001»

Evolution of Reinforcement Learning in Uncertain Environments: Emergence of Risk-Aversion and Matching

15 years 8 months ago

Download gandalf.psych.umn.edu

Reinforcement learning (RL) is a fundamental process by which organisms learn to achieve a goal from interactions with the environment. Using Artiﬁcial Life techniques we derive ...

Yael Niv, Daphna Joel, Isaac Meilijson, Eytan Rupp...

claim paper

Read More »

126

click to vote

ATAL
2007
Springer

130views Intelligent Agents» more ATAL 2007»

Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective

15 years 10 months ago

Download www.aamas-conference.org

This paper presents the dynamics of multiple reinforcement learning agents from an Evolutionary Game Theoretic (EGT) perspective. We provide a Replicator Dynamics model for tradit...

Liviu Panait, Karl Tuyls

claim paper

Read More »

154

click to vote

JAIR
2000

131views more JAIR 2000»

An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email

15 years 3 months ago

Download www.jair.org

This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal dialogue strategy from its experience interacting with human users. The method...

Marilyn A. Walker

claim paper

Read More »

115

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 4 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

156

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

15 years 10 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

« Prev « First page 14 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers