Search Sciweavers | Sciweavers

20

ML
2000
ACM

126views Machine Learning» more ML 2000»

Learning to Play Chess Using Temporal Differences

13 years 7 months ago

In this paper we present TDLEAF( ), a variation on the TD( ) algorithm that enables it to be used in conjunction with game-tree search. We present some experiments in which our che...

Jonathan Baxter, Andrew Tridgell, Lex Weaver

claim paper

Read More »

36

click to vote

SIGIR
2011
ACM

277views Information Technology» more SIGIR 2011»

Collaborative competitive filtering: learning recommender using context of user choice

12 years 10 months ago

Download www.cc.gatech.edu

While a user’s preference is directly reﬂected in the interactive choice process between her and the recommender, this wealth of information was not fully exploited for learni...

Shuang-Hong Yang, Bo Long, Alexander J. Smola, Hon...

claim paper

Read More »

20

click to vote

AAAI
2011

217views Intelligent Agents» more AAAI 2011»

Fast Newton-CG Method for Batch Learning of Conditional Random Fields

12 years 7 months ago

Download 2boy.org

We propose a fast batch learning method for linearchain Conditional Random Fields (CRFs) based on Newton-CG methods. Newton-CG methods are a variant of Newton method for high-dime...

Yuta Tsuboi, Yuya Unno, Hisashi Kashima, Naoaki Ok...

claim paper

Read More »

21

click to vote

ICRA
2009
IEEE

170views Robotics» more ICRA 2009»

Imitation learning with generalized task descriptions

14 years 2 months ago

Download www.informatik.uni-freiburg.de

— In this paper, we present an approach that allows a robot to observe, generalize, and reproduce tasks observed from multiple demonstrations. Motion capture data is recorded in ...

Clemens Eppner, Jürgen Sturm, Maren Bennewitz...

claim paper

Read More »

27

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

14 years 2 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers