Search Sciweavers | Sciweavers

28

ROMAN
2007
IEEE

150views Robotics» more ROMAN 2007»

Asymmetric Interpretations of Positive and Negative Human Feedback for a Social Learning Agent

14 years 3 months ago

— The ability for people to interact with robots and teach them new skills will be crucial to the successful application of robots in everyday human environments. In order to des...

Andrea Lockerd Thomaz, Cynthia Breazeal

claim paper

Read More »

25

click to vote

IROS
2006
IEEE

107views Robotics» more IROS 2006»

Heterogeneous and Hierarchical Cooperative Learning via Combining Decision Trees

14 years 3 months ago

Download birg2.epfl.ch

Abstract— Decision trees, being human readable and hierarchically structured, provide a suitable mean to derive state-space abstraction and simplify the inclusion of the availabl...

Masoud Asadpour, Majid Nili Ahmadabadi, Roland Sie...

claim paper

Read More »

23

click to vote

AIPS
2008

95views Artificial Intelligence» more AIPS 2008»

Learning Heuristic Functions through Approximate Linear Programming

13 years 11 months ago

Download anytime.cs.umass.edu

Planning problems are often formulated as heuristic search. The choice of the heuristic function plays a significant role in the performance of planning systems, but a good heuris...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

28

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

13 years 10 months ago

Download www.cs.umass.edu

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

32

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

13 years 10 months ago

Download eprints.pascal-network.org

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers