Search Sciweavers | Sciweavers

771 search results - page 83 / 155

» Markov Decision Processes with Arbitrary Reward Processes

126

click to vote

ATAL
2007
Springer

110views Intelligent Agents» more ATAL 2007»

Autonomous nondeterministic tour guides: improving quality of experience with TTD-MDPs

15 years 10 months ago

Download andrewcantino.com

In this paper, we address the problem of building a system of autonomous agents for a complex environment, in our case, a museum with many visitors. Visitors may have varying pref...

Andrew S. Cantino, David L. Roberts, Charles L. Is...

claim paper

Read More »

158

click to vote

PRICAI
2000
Springer

193views Artificial Intelligence» more PRICAI 2000»

Generating Hierarchical Structure in Reinforcement Learning from State Variables

15 years 7 months ago

Download www.csee.umbc.edu

This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...

Bernhard Hengst

claim paper

Read More »

141

click to vote

ATAL
2010
Springer

136views Intelligent Agents» more ATAL 2010»

Quasi deterministic POMDPs and DecPOMDPs

15 years 5 months ago

Download www.damas.ift.ulaval.ca

In this paper, we study a particular subclass of partially observable models, called quasi-deterministic partially observable Markov decision processes (QDET-POMDPs), characterize...

Camille Besse, Brahim Chaib-draa

claim paper

Read More »

210

click to vote

CLIMA
2011

238views Intelligent Agents» more CLIMA 2011»

Verifying Team Formation Protocols with Probabilistic Model Checking

14 years 3 months ago

Download www.veriware.org

Multi-agent systems are an increasingly important software paradigm and in many of its applications agents cooperate to achieve a particular goal. This requires the design of efﬁ...

Taolue Chen, Marta Z. Kwiatkowska, David Parker, A...

claim paper

Read More »

155

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 4 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

« Prev « First page 83 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers