Sciweavers

236 search results - page 42 / 48
» Bias and Variance Approximation in Value Function Estimates
Sort
View
BMCBI
2008
177views more  BMCBI 2008»
13 years 8 months ago
Baseline Correction for NMR Spectroscopic Metabolomics Data Analysis
Background: We propose a statistically principled baseline correction method, derived from a parametric smoothing model. It uses a score function to describe the key features of b...
Yuanxin Xi, David M. Rocke
ECML
2005
Springer
14 years 2 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal
BMCBI
2008
127views more  BMCBI 2008»
13 years 8 months ago
Gene and pathway identification with Lp penalized Bayesian logistic regression
Background: Identifying genes and pathways associated with diseases such as cancer has been a subject of considerable research in recent years in the area of bioinformatics and co...
Zhenqiu Liu, Ronald B. Gartenhaus, Ming Tan, Feng ...
JMLR
2010
137views more  JMLR 2010»
13 years 3 months ago
Importance Sampling for Continuous Time Bayesian Networks
A continuous time Bayesian network (CTBN) uses a structured representation to describe a dynamic system with a finite number of states which evolves in continuous time. Exact infe...
Yu Fan, Jing Xu, Christian R. Shelton
NIPS
2007
13 years 10 months ago
Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods
Learning in real-world domains often requires to deal with continuous state and action spaces. Although many solutions have been proposed to apply Reinforcement Learning algorithm...
Alessandro Lazaric, Marcello Restelli, Andrea Bona...