Search Sciweavers | Sciweavers

1234 search results - page 210 / 247

» Multi-criteria Reinforcement Learning

167

click to vote

EUROGP
2009
Springer

130views Optimization» more EUROGP 2009»

One-Class Genetic Programming

15 years 10 months ago

Download users.cs.dal.ca

One-class classiﬁcation naturally only provides one-class of exemplars, the target class, from which to construct the classiﬁcation model. The one-class approach is constructed...

Robert Curry, Malcolm I. Heywood

claim paper

Read More »

138

click to vote

BMEI
2008
IEEE

153views Biomedical Imaging» more BMEI 2008»

A Retrospective Comparative Study of Three Data Modelling Techniques in Anticoagulation Therapy

15 years 10 months ago

Download eprints.lancs.ac.uk

Three types of data modelling technique are applied retrospectively to individual patients’ anticoagulation therapy data to predict their future levels of anticoagulation. The r...

Simon McDonald, Costas S. Xydeas, Plamen P. Angelo...

claim paper

Read More »

144

click to vote

SASO
2008
IEEE

125views Control Systems» more SASO 2008»

Self-Adaptive Dissemination of Data in Dynamic Sensor Networks

15 years 10 months ago

Download www.datafusionlab.org

The distribution of data in large dynamic wireless sensor networks presents a difﬁcult problem due to node mobility, link failures, and trafﬁc congestion. In this paper, we pr...

David Dorsey, Bjorn Jay Carandang, Moshe Kam, Chri...

claim paper

Read More »

144

Voted

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

15 years 10 months ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

109

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 10 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

« Prev « First page 210 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers