Search Sciweavers | Sciweavers

236 search results - page 42 / 48

» Bias and Variance Approximation in Value Function Estimates

211

click to vote

BMCBI
2008

177views more BMCBI 2008»

Baseline Correction for NMR Spectroscopic Metabolomics Data Analysis

15 years 7 months ago

Download www.biomedcentral.com

Background: We propose a statistically principled baseline correction method, derived from a parametric smoothing model. It uses a score function to describe the key features of b...

Yuanxin Xi, David M. Rocke

claim paper

Read More »

229

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

16 years 1 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

230

click to vote

BMCBI
2008

127views more BMCBI 2008»

Gene and pathway identification with Lp penalized Bayesian logistic regression

15 years 7 months ago

Download www.biomedcentral.com

Background: Identifying genes and pathways associated with diseases such as cancer has been a subject of considerable research in recent years in the area of bioinformatics and co...

Zhenqiu Liu, Ronald B. Gartenhaus, Ming Tan, Feng ...

claim paper

Read More »

222

click to vote

JMLR
2010

137views more JMLR 2010»

Importance Sampling for Continuous Time Bayesian Networks

15 years 2 months ago

Download jmlr.csail.mit.edu

A continuous time Bayesian network (CTBN) uses a structured representation to describe a dynamic system with a finite number of states which evolves in continuous time. Exact infe...

Yu Fan, Jing Xu, Christian R. Shelton

claim paper

Read More »

217

click to vote

NIPS
2007

158views Information Technology» more NIPS 2007»

Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods

15 years 9 months ago

Download books.nips.cc

Learning in real-world domains often requires to deal with continuous state and action spaces. Although many solutions have been proposed to apply Reinforcement Learning algorithm...

Alessandro Lazaric, Marcello Restelli, Andrea Bona...

claim paper

Read More »

« Prev « First page 42 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers