Search Sciweavers | Sciweavers

91 search results - page 2 / 19

» Event-based control using quadratic approximate value functi...

145

click to vote

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

14 years 9 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

142

click to vote

CDC
2010
IEEE

89views Control Systems» more CDC 2010»

Stochastic approximation for consensus with general time-varying weight matrices

14 years 10 months ago

Download mathstat.carleton.ca

This paper considers consensus problems with delayed noisy measurements, and stochastic approximation is used to achieve mean square consensus. For stochastic approximation based c...

Minyi Huang

claim paper

Read More »

111

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 5 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

140

click to vote

SIAMCO
2002

121views more SIAMCO 2002»

Consistent Approximations and Approximate Functions and Gradients in Optimal Control

15 years 2 months ago

Download www.ann.jussieu.fr

As shown in [7], optimal control problems with either ODE or PDE dynamics can be solved efficiently using a setting of consistent approximations obtained by numerical discretizati...

Olivier Pironneau, Elijah Polak

claim paper

Read More »

124

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 3 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

« Prev « First page 2 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers