Search Sciweavers | Sciweavers

50 search results - page 3 / 10

» Approximate state estimation in multiagent settings with con...

click to vote

ECML
2006
Springer

212views Machine Learning» more ECML 2006»

Deconvolutive Clustering of Markov States

13 years 11 months ago

Download www.cs.bham.ac.uk

In this paper we formulate the problem of grouping the states of a discrete Markov chain of arbitrary order simultaneously with deconvolving its transition probabilities. As the na...

Ata Kabán, Xin Wang

claim paper

Read More »

click to vote

AAAI
2008

123views Intelligent Agents» more AAAI 2008»

Towards Faster Planning with Continuous Resources in Stochastic Domains

13 years 10 months ago

Download www.aaai.org

Agents often have to construct plans that obey resource limits for continuous resources whose consumption can only be characterized by probability distributions. While Markov Deci...

Janusz Marecki, Milind Tambe

claim paper

Read More »

click to vote

AMAI
2004
Springer

164views Artificial Intelligence» more AMAI 2004»

A Framework for Sequential Planning in Multi-Agent Settings

14 years 1 months ago

Download www.jair.org

This paper extends the framework of partially observable Markov decision processes (POMDPs) to multi-agent settings by incorporating the notion of agent models into the state spac...

Piotr J. Gmytrasiewicz, Prashant Doshi

claim paper

Read More »

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

13 years 7 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

click to vote

ICRA
2009
IEEE

259views Robotics» more ICRA 2009»

Constructing action set from basis functions for reinforcement learning of robot control

14 years 2 months ago

Download robotics.aist-nara.ac.jp

Abstract— Continuous action sets are used in many reinforcement learning (RL) applications in robot control since the control input is continuous. However, discrete action sets a...

Akihiko Yamaguchi, Jun Takamatsu, Tsukasa Ogasawar...

claim paper

Read More »

« Prev « First page 3 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers