Search Sciweavers | Sciweavers

683 search results - page 90 / 137

» Coarticulation in Markov Decision Processes

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

13 years 10 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

click to vote

AUTOMATICA
2008

74views more AUTOMATICA 2008»

Policy iteration based feedback control

13 years 9 months ago

Download www.cfins.au.tsinghua.edu.cn

It is well known that stochastic control systems can be viewed as Markov decision processes (MDPs) with continuous state spaces. In this paper, we propose to apply the policy iter...

Kan-Jian Zhang, Yan-Kai Xu, Xi Chen, Xi-Ren Cao

claim paper

Read More »

click to vote

CORR
2006
Springer

100views Education» more CORR 2006»

Capacity of Cooperative Fusion in the Presence of Byzantine Sensors

13 years 9 months ago

Download www.truststc.org

Abstract-- The problem of cooperative fusion in the presence of both Byzantine sensors and misinformed sensors is considered. An information theoretic formulation is used to charac...

Oliver Kosut, Lang Tong

claim paper

Read More »

click to vote

CORR
2000
Springer

129views Education» more CORR 2000»

Prosody-Based Automatic Segmentation of Speech into Sentences and Topics

13 years 8 months ago

Download www.speech.sri.com

A crucial step in processing speech audio data for information extraction, topic detection, or browsing/playback is to segment the input into sentence and topic units. Speech segm...

Elizabeth Shriberg, Andreas Stolcke, Dilek Z. Hakk...

claim paper

Read More »

click to vote

ISAAC
2010
Springer

243views Algorithms» more ISAAC 2010»

Lower Bounds for Howard's Algorithm for Finding Minimum Mean-Cost Cycles

13 years 7 months ago

Download www.daimi.au.dk

Howard's policy iteration algorithm is one of the most widely used algorithms for finding optimal policies for controlling Markov Decision Processes (MDPs). When applied to we...

Thomas Dueholm Hansen, Uri Zwick

claim paper

Read More »

« Prev « First page 90 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers