Search Sciweavers | Sciweavers

1512 search results - page 232 / 303

» Qualitative reinforcement learning

142

click to vote

BMEI
2008
IEEE

153views Biomedical Imaging» more BMEI 2008»

A Retrospective Comparative Study of Three Data Modelling Techniques in Anticoagulation Therapy

15 years 11 months ago

Download eprints.lancs.ac.uk

Three types of data modelling technique are applied retrospectively to individual patients’ anticoagulation therapy data to predict their future levels of anticoagulation. The r...

Simon McDonald, Costas S. Xydeas, Plamen P. Angelo...

claim paper

Read More »

146

click to vote

SASO
2008
IEEE

125views Control Systems» more SASO 2008»

Self-Adaptive Dissemination of Data in Dynamic Sensor Networks

15 years 11 months ago

Download www.datafusionlab.org

The distribution of data in large dynamic wireless sensor networks presents a difﬁcult problem due to node mobility, link failures, and trafﬁc congestion. In this paper, we pr...

David Dorsey, Bjorn Jay Carandang, Moshe Kam, Chri...

claim paper

Read More »

147

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

15 years 10 months ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

111

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 10 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

135

click to vote

CIMCA
2006
IEEE

164views Intelligent Agents» more CIMCA 2006»

Multi-Agent Coalition Formation for Long-Term Task or Mobile Network

15 years 10 months ago

Download digital.cs.usu.edu

Coalition formation is a process to form a group and solve a problem via cooperation. Because of the rising of network, each computing device can communicate through network. We c...

Hsiu-Hui Lee, Chung-Hsien Chen

claim paper

Read More »

« Prev « First page 232 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers