Search Sciweavers | Sciweavers

683 search results - page 91 / 137

» Coarticulation in Markov Decision Processes

click to vote

CDC
2009
IEEE

147views Control Systems» more CDC 2009»

A probabilistic approach for control of a stochastic system from LTL specifications

13 years 6 months ago

Download people.bu.edu

We consider the problem of controlling a continuous-time linear stochastic system from a specification given as a Linear Temporal Logic (LTL) formula over a set of linear predicate...

Morteza Lahijanian, Sean B. Andersson, Calin Belta

claim paper

Read More »

click to vote

ICMCS
2009
IEEE

149views Multimedia» more ICMCS 2009»

A multi-agent framework for a hybrid dialog management system

13 years 6 months ago

Download www.mmk.ei.tum.de

The importance of dialog management systems has increased in recent years. Dialog systems are created for domain specific applications, so that a high demand for a flexible dialog...

Stefan Schwärzler, Joachim Schenk, Günth...

claim paper

Read More »

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

13 years 3 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

click to vote

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

13 years 3 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

click to vote

ICML
2007
IEEE

204views Machine Learning» more ICML 2007»

Constructing basis functions from directed graphs for value function approximation

14 years 9 months ago

Download www.machinelearning.org

Basis functions derived from an undirected graph connecting nearby samples from a Markov decision process (MDP) have proven useful for approximating value functions. The success o...

Jeffrey Johns, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 91 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers