Search Sciweavers | Sciweavers

827 search results - page 69 / 166

» Variational methods for Reinforcement Learning

200

click to vote

NIPS
1996

112views Information Technology» more NIPS 1996»

Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning

15 years 8 months ago

Download www.ri.cmu.edu

Model learning combined with dynamic programming has been shown to be e ective for learning control of continuous state dynamic systems. The simplest method assumes the learned mod...

Jeff G. Schneider

claim paper

Read More »

194

click to vote

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

15 years 6 months ago

Download www.demo.cs.brandeis.edu

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

332

click to vote

ACCV
2010
Springer

460views Computer Vision» more ACCV 2010»

Abstraction and Generalization of 3D structure for recognition in large intra-class variation

15 years 8 months ago

Download www.eecis.udel.edu

Humans have abstract models for object classes which helps recognize previously unseen instances, despite large intra-class variations. Also objects are grouped into classes based...

Gowri Somanath, Chandra Kambhamettu

posted by gowri

Read More »

205

Voted

FBIT
2007
IEEE

142views Information Technology» more FBIT 2007»

Learning to Drive a Real Car in 20 Minutes

16 years 1 months ago

Download www.ni.uos.de

The paper describes our ﬁrst experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...

Martin Riedmiller, Michael Montemerlo, Hendrik Dah...

claim paper

Read More »

189

click to vote

ACL
2010

135views Computational Linguistics» more ACL 2010»

Reading between the Lines: Learning to Map High-Level Instructions to Commands

15 years 5 months ago

Download ai.cs.washington.edu

In this paper, we address the task of mapping high-level instructions to sequences of commands in an external environment. Processing these instructions is challenging--they posit...

S. R. K. Branavan, Luke S. Zettlemoyer, Regina Bar...

claim paper

Read More »

« Prev « First page 69 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers