Search Sciweavers | Sciweavers

4544 search results - page 190 / 909

» Reinforcement Learning with Time

173

Voted

IROS
2006
IEEE

107views Robotics» more IROS 2006»

Heterogeneous and Hierarchical Cooperative Learning via Combining Decision Trees

16 years 1 months ago

Download birg2.epfl.ch

Abstract— Decision trees, being human readable and hierarchically structured, provide a suitable mean to derive state-space abstraction and simplify the inclusion of the availabl...

Masoud Asadpour, Majid Nili Ahmadabadi, Roland Sie...

claim paper

Read More »

180

Voted

AIPS
2008

95views Artificial Intelligence» more AIPS 2008»

Learning Heuristic Functions through Approximate Linear Programming

15 years 9 months ago

Download anytime.cs.umass.edu

Planning problems are often formulated as heuristic search. The choice of the heuristic function plays a significant role in the performance of planning systems, but a good heuris...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

184

Voted

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 9 months ago

Download www.cs.umass.edu

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

188

click to vote

ICGA
2008

100views Optimization» more ICGA 2008»

Learning the Piece Values for Three Chess Variants

15 years 7 months ago

Download www.ke.tu-darmstadt.de

A set of experiments for learning the values of chess pieces is described for the popular chess variants Crazyhouse Chess, Suicide Chess, and Atomic Chess. We follow an establishe...

Sacha Droste, Johannes Fürnkranz

claim paper

Read More »

192

click to vote

ACL
2010

135views Computational Linguistics» more ACL 2010»

Reading between the Lines: Learning to Map High-Level Instructions to Commands

15 years 5 months ago

Download ai.cs.washington.edu

In this paper, we address the task of mapping high-level instructions to sequences of commands in an external environment. Processing these instructions is challenging--they posit...

S. R. K. Branavan, Luke S. Zettlemoyer, Regina Bar...

claim paper

Read More »

« Prev « First page 190 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers