Search Sciweavers | Sciweavers

495 search results - page 66 / 99

» Constructing States for Reinforcement Learning

142

click to vote

IJRR
2011

159views more IJRR 2011»

Learning visual representations for perception-action systems

14 years 11 months ago

Download robot-learning.de

We discuss vision as a sensory modality for systems that eﬀect actions in response to perceptions. While the internal representations informed by vision may be arbitrarily compl...

Justus H. Piater, Sébastien Jodogne, Renaud...

claim paper

Read More »

176

click to vote

JAIR
2008

148views more JAIR 2008»

Learning Partially Observable Deterministic Action Models

15 years 4 months ago

Download www.jair.org

We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...

Eyal Amir, Allen Chang

claim paper

Read More »

143

click to vote

SIGDIAL
2010

158views Natural Language Processing» more SIGDIAL 2010»

Sparse Approximate Dynamic Programming for Dialog Management

15 years 1 months ago

Download www.sigdial.org

Spoken dialogue management strategy optimization by means of Reinforcement Learning (RL) is now part of the state of the art. Yet, there is still a clear mismatch between the comp...

Senthilkumar Chandramohan, Matthieu Geist, Olivier...

claim paper

Read More »

145

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 5 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

149

click to vote

ICML
1999
IEEE

152views Machine Learning» more ICML 1999»

Distributed Value Functions

16 years 4 months ago

Download www.ri.cmu.edu

Many interesting problems, such as power grids, network switches, and tra c ow, that are candidates for solving with reinforcement learningRL, alsohave properties that make distri...

Jeff G. Schneider, Weng-Keen Wong, Andrew W. Moore...

claim paper

Read More »

« Prev « First page 66 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers