Search Sciweavers | Sciweavers

286 search results - page 57 / 58

» Using inaccurate models in reinforcement learning

205

click to vote

SIGIR
2005
ACM

107views Information Technology» more SIGIR 2005»

Orthogonal locality preserving indexing

16 years 1 months ago

Download www.cs.uiuc.edu

We consider the problem of document indexing and representation. Recently, Locality Preserving Indexing (LPI) was proposed for learning a compact document subspace. Different from...

Deng Cai, Xiaofei He

claim paper

Read More »

189

click to vote

ICRA
2003
IEEE

165views Robotics» more ICRA 2003»

Multi-robot task-allocation through vacancy chains

16 years 21 days ago

Download www-robotics.usc.edu

Existing task allocation algorithms generally do not consider the eﬀects of task interaction, such as interference, but instead assume that tasks are independent. That assumptio...

Torbjørn S. Dahl, Maja J. Mataric, Gaurav S...

claim paper

Read More »

223

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 7 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

199

click to vote

BC
2006

124views more BC 2006»

Motor-maps, navigation and implicit space representation in the hippocampus

15 years 7 months ago

Download ece.ut.ac.ir

Abstract Multiple sensory-motor maps located in the brainstem and the cortex are involved in spatial orientation. Guiding movements of eyes, head, neck and arms they provide an app...

Alexander Kaske, Gösta Winberg, Joakim Cö...

claim paper

Read More »

216

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 8 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

« Prev « First page 57 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers