Search Sciweavers | Sciweavers

62 search results - page 5 / 13

» Probabilistic inference for solving discrete and continuous ...

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

13 years 11 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

click to vote

ICASSP
2011
IEEE

167views Signal Processing» more ICASSP 2011»

A unified approach to real time audio-to-score and audio-to-audio alignment using sequential Montecarlo inference techniques

12 years 11 months ago

Download articles.ircam.fr

We present a methodology for the real time alignment of music signals using sequential Montecarlo inference techniques. The alignment problem is formulated as the state tracking o...

Nicola Montecchio, Arshia Cont

claim paper

Read More »

click to vote

ATAL
2008
Springer

116views Intelligent Agents» more ATAL 2008»

Controlling deliberation in a Markov decision process-based agent

13 years 9 months ago

Download coitweb.uncc.edu

Meta-level control manages the allocation of limited resources to deliberative actions. This paper discusses efforts in adding meta-level control capabilities to a Markov Decision...

George Alexander, Anita Raja, David J. Musliner

claim paper

Read More »

click to vote

AAAI
2007

88views Intelligent Agents» more AAAI 2007»

Continuous State POMDPs for Object Manipulation Tasks

13 years 9 months ago

Download www.aaai.org

My research focus is on using continuous state partially observable Markov decision processes (POMDPs) to perform object manipulation tasks using a robotic arm. During object mani...

Emma Brunskill

claim paper

Read More »

click to vote

QEST
2005
IEEE

137views Modeling and Simulation» more QEST 2005»

iLTLChecker: A Probabilistic Model Checker for Multiple DTMCs

14 years 1 months ago

Download osl.cs.uiuc.edu

iLTL is a probabilistic temporal logic that can specify properties of multiple discrete time Markov chains (DTMCs). In this paper, we describe two related tools: MarkovEstimator a...

YoungMin Kwon, Gul A. Agha

claim paper

Read More »

« Prev « First page 5 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers