Search Sciweavers | Sciweavers

651 search results - page 24 / 131

» Algorithms for Inverse Reinforcement Learning

click to vote

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

13 years 7 months ago

Download eprints.iisc.ernet.in

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

click to vote

BROADNETS
2007
IEEE

119views Computer Networks» more BROADNETS 2007»

Reinforcement learning based routing in all-optical networks with physical impairments

13 years 11 months ago

Download www.tsp.ece.mcgill.ca

Abstract-- We present and evaluate a reinforcement learningbased RWA algorithm for all-optical networks subject to physical impairments. The technique is suitable for decentralized...

Yvan Pointurier, Fariba Heidari

claim paper

Read More »

click to vote

AAAI
1997

107views Intelligent Agents» more AAAI 1997»

Reinforcement Learning with Time

13 years 9 months ago

Download www.aaai.org

This paper steps back from the standard infinite horizon formulation of reinforcement learning problems to consider the simpler case of finite horizon problems. Although finite ho...

Daishi Harada

claim paper

Read More »

click to vote

COLT
2003
Springer

104views Machine Learning» more COLT 2003»

Learning Random Log-Depth Decision Trees under the Uniform Distribution

14 years 1 months ago

Download www.cs.columbia.edu

We consider three natural models of random logarithmic depth decision trees over Boolean variables. We give an eﬃcient algorithm that for each of these models learns all but an ...

Jeffrey C. Jackson, Rocco A. Servedio

claim paper

Read More »

click to vote

SBIA
2004
Springer

137views Artificial Intelligence» more SBIA 2004»

Heuristically Accelerated Q-Learning: A New Approach to Speed Up Reinforcement Learning

14 years 1 months ago

Download www.fei.edu.br

This work presents a new algorithm, called Heuristically Accelerated Q–Learning (HAQL), that allows the use of heuristics to speed up the well-known Reinforcement Learning algori...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

« Prev « First page 24 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers