Search Sciweavers | Sciweavers

80 search results - page 5 / 16

» Efficient Reinforcement Learning Using Recursive Least-Squar...

100

click to vote

PRIMA
2009
Springer

102views Intelligent Agents» more PRIMA 2009»

Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

15 years 9 months ago

Download teamcore.usc.edu

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...

Itsuki Noda

claim paper

Read More »

127

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

15 years 8 months ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

137

click to vote

LION
2007
Springer

192views Optimization» more LION 2007»

Learning While Optimizing an Unknown Fitness Surface

15 years 8 months ago

Download www.science.unitn.it

This paper is about Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular a novel application of RL is considered i...

Roberto Battiti, Mauro Brunato, Paolo Campigotto

claim paper

Read More »

108

click to vote

INFOCOM
1994
IEEE

128views Communications» more INFOCOM 1994»

Dynamic Bandwidth Allocation for Efficient Transport of Real-Time VBR Video over ATM

15 years 6 months ago

Download koasas.kaist.ac.kr

This pape? presents a novel approach to dynamic transmission bandwidth allocation for transport of real-time variable-bit-rate video in ATM networks. Describe video traffic in the...

Song Chong, San-qi Li, Joydeep Ghosh

claim paper

Read More »

203

Voted

ICAART
2010
INSTICC

509views Intelligent Agents» more ICAART 2010»

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

15 years 11 months ago

Download arxiv.org

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

« Prev « First page 5 / 16 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers