Sciweavers

2415 search results - page 328 / 483
» Markov Processes on Curves
Sort
View
JMLR
2010
125views more  JMLR 2010»
13 years 3 months ago
Continuous Time Bayesian Network Reasoning and Learning Engine
We present a continuous time Bayesian network reasoning and learning engine (CTBN-RLE). A continuous time Bayesian network (CTBN) provides a compact (factored) description of a co...
Christian R. Shelton, Yu Fan, William Lam, Joon Le...
JMLR
2010
125views more  JMLR 2010»
13 years 3 months ago
Variational methods for Reinforcement Learning
We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...
Thomas Furmston, David Barber
JMLR
2010
189views more  JMLR 2010»
13 years 3 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
NN
2010
Springer
187views Neural Networks» more  NN 2010»
13 years 3 months ago
Efficient exploration through active learning for value function approximation in reinforcement learning
Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...
Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...
TITB
2010
95views Education» more  TITB 2010»
13 years 3 months ago
Sleep staging based on signals acquired through bed sensor
We describe a system for the evaluation of the sleep macrostructure on the basis of Emfit sensor foils placed into bed mattress and of advanced signal processing. The signals on wh...
Juha M. Kortelainen, Martin O. Mendez, Anna M. Bia...