Search Sciweavers | Sciweavers

495 search results - page 58 / 99

» Constructing States for Reinforcement Learning

142

click to vote

NIPS
1993

134views Information Technology» more NIPS 1993»

Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming

15 years 5 months ago

Download www.cs.cmu.edu

Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...

Christopher G. Atkeson

claim paper

Read More »

128

Voted

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 5 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

135

click to vote

AVSS
2007
IEEE

161views Signal Processing» more AVSS 2007»

Vehicular traffic density estimation via statistical methods with automated state learning

15 years 10 months ago

Download www.cse.unsw.edu.au

This paper proposes a novel approach of combining an unsupervised clustering scheme called AutoClass with Hidden Markov Models (HMMs) to determine the traffic density state in a R...

Evan Tan, Jing Chen

claim paper

Read More »

139

click to vote

ICRA
2009
IEEE

121views Robotics» more ICRA 2009»

Learning sequential visual attention control through dynamic state space discretization

15 years 10 months ago

Download ilab.usc.edu

² Similar to humans and primates, artificial creatures like robots are limited in terms of allocation of their resources to huge sensory and perceptual information. Serial process...

Ali Borji, Majid Nili Ahmadabadi, Babak Nadjar Ara...

claim paper

Read More »

133

click to vote

ICRA
2010
IEEE

143views Robotics» more ICRA 2010»

Apprenticeship learning via soft local homomorphisms

15 years 2 months ago

Download damas.ift.ulaval.ca

Abstract— We consider the problem of apprenticeship learning when the expert’s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IR...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

« Prev « First page 58 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers