Search Sciweavers | Sciweavers

154

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

14 years 10 months ago

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

140

click to vote

AAAI
2011

144views Intelligent Agents» more AAAI 2011»

Differential Eligibility Vectors for Advantage Updating and Gradient Methods

14 years 3 months ago

Download gaips.inesc-id.pt

In this paper we propose differential eligibility vectors (DEV) for temporal-difference (TD) learning, a new class of eligibility vectors designed to bring out the contribution of...

Francisco S. Melo

claim paper

Read More »

131

click to vote

IWAN
2000
Springer

85views Computer Networks» more IWAN 2000»

Two Rule-Based Building-Block Architectures for Policy-Based Network Control

15 years 7 months ago

Download www.kanadas.com

Policy-based networks can be customized by users by injecting programs called policies into the network nodes. So if general-purpose functions can be specified in a policy-based ne...

Yasusi Kanada

claim paper

Read More »

130

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 4 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

114

click to vote

APSEC
2004
IEEE

86views Software Engineering» more APSEC 2004»

Partitioning of Java Applications to Support Dynamic Updates

15 years 7 months ago

Download www.diku.dk

The requirement for 24/7 availability of distributed applications complicates their maintenance and evolution as shutting down such applications to perform updates may not be an a...

Robert Pawel Bialek, Eric Jul, Jean-Guy Schneider,...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers