Search Sciweavers | Sciweavers

495 search results - page 61 / 99

» Constructing States for Reinforcement Learning

135

click to vote

ECML
2006
Springer

84views Machine Learning» more ECML 2006»

Efficient Non-linear Control Through Neuroevolution

15 years 7 months ago

Download www.idsia.ch

Abstract. Many complex control problems are not amenable to traditional controller design. Not only is it difficult to model real systems, but often it is unclear what kind of beha...

Faustino J. Gomez, Jürgen Schmidhuber, Risto ...

claim paper

Read More »

149

click to vote

IJCNN
2008
IEEE

202views Neural Networks» more IJCNN 2008»

Learning to select relevant perspective in a dynamic environment

15 years 10 months ago

Download www.cs.qub.ac.uk

— When an agent observes its environment, there are two important characteristics of the perceived information. One is the relevance of information and the other is redundancy. T...

Zhihui Luo, David A. Bell, Barry McCollum, Qingxia...

claim paper

Read More »

127

click to vote

NIPS
1997

121views Information Technology» more NIPS 1997»

Generalized Prioritized Sweeping

15 years 5 months ago

Download www.cs.huji.ac.il

Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent’s limited computational resources to achieve a good estimate of the value of ...

David Andre, Nir Friedman, Ronald Parr

claim paper

Read More »

187

click to vote

SIGDIAL
2010

137views Natural Language Processing» more SIGDIAL 2010»

Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy

15 years 1 months ago

Download mastarpj.nict.go.jp

This paper presents a spoken dialogue framework that helps users in making decisions. Users often do not have a definite goal or criteria for selecting from a list of alternatives...

Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chi...

claim paper

Read More »

127

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 5 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

« Prev « First page 61 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers