Search Sciweavers | Sciweavers

81 search results - page 9 / 17

» The Optimal Reward Baseline for Gradient-Based Reinforcement...

click to vote

ATAL
2007
Springer

130views Intelligent Agents» more ATAL 2007»

Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective

14 years 1 months ago

Download www.aamas-conference.org

This paper presents the dynamics of multiple reinforcement learning agents from an Evolutionary Game Theoretic (EGT) perspective. We provide a Replicator Dynamics model for tradit...

Liviu Panait, Karl Tuyls

claim paper

Read More »

click to vote

JAIR
2000

131views more JAIR 2000»

An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email

13 years 7 months ago

Download www.jair.org

This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal dialogue strategy from its experience interacting with human users. The method...

Marilyn A. Walker

claim paper

Read More »

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

14 years 8 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

14 years 2 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

click to vote

AAMAS
2005
Springer

126views Intelligent Agents» more AAMAS 2005»

Learning to Coordinate Using Commitment Sequences in Cooperative Multi-agent Systems

14 years 1 months ago

Download como.vub.ac.be

We report on an investigation of the learning of coordination in cooperative multi-agent systems. Speciﬁcally, we study solutions that are applicable to independent agents i.e. ...

Spiros Kapetanakis, Daniel Kudenko, Malcolm J. A. ...

claim paper

Read More »

« Prev « First page 9 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers