Search Sciweavers | Sciweavers

683 search results - page 51 / 137

» Coarticulation in Markov Decision Processes

click to vote

AIPS
2004

105views Artificial Intelligence» more AIPS 2004»

Decision-Theoretic Military Operations Planning

13 years 11 months ago

Download eprints.pascal-network.org

Military operations planning involves concurrent actions, resource assignment, and conflicting costs. Individual tasks sometimes fail with a known probability, promoting a decisio...

Douglas Aberdeen, Sylvie Thiébaux, Lin Zhan...

claim paper

Read More »

click to vote

AIPS
2000

129views Artificial Intelligence» more AIPS 2000»

Representations of Decision-Theoretic Planning Tasks

13 years 11 months ago

Download www.aaai.org

Goal-directed Markov Decision Process models (GDMDPs) are good models for many decision-theoretic planning tasks. They have been used in conjunction with two different reward stru...

Sven Koenig, Yaxin Liu

claim paper

Read More »

click to vote

AAAI
2008

144views Intelligent Agents» more AAAI 2008»

A Variance Analysis for POMDP Policy Evaluation

14 years 12 days ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes have been studied widely as a model for decision making under uncertainty, and a number of methods have been developed to find the s...

Mahdi Milani Fard, Joelle Pineau, Peng Sun

claim paper

Read More »

click to vote

ALT
2006
Springer

146views Machine Learning» more ALT 2006»

Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning

14 years 7 months ago

Download www.logos.t.u-tokyo.ac.jp

Abstract. Recently, some non-regular subclasses of context-free grammars have been found to be eﬃciently learnable from positive data. In order to use these eﬃcient algorithms ...

Takeshi Shibata, Ryo Yoshinaka, Takashi Chikayama

claim paper

Read More »

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

14 years 11 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

« Prev « First page 51 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers