Sciweavers

651 search results - page 70 / 131
» Algorithms for Inverse Reinforcement Learning
Sort
View
ECAI
2008
Springer
13 years 12 months ago
Exploiting locality of interactions using a policy-gradient approach in multiagent learning
In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...
Francisco S. Melo
CSREAEEE
2008
199views Business» more  CSREAEEE 2008»
13 years 11 months ago
Progranimate - A Web Enabled Algorithmic Problem Solving Application
- This paper proposes the use of an interactive web based problem solving application that utilises flowchart based programming and code generation to address the issues faced by n...
Andrew Scott, Mike Watkins, Duncan McPhee
GECCO
2005
Springer
111views Optimization» more  GECCO 2005»
14 years 3 months ago
XCS with eligibility traces
The development of the XCS Learning Classifier System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...
Jan Drugowitsch, Alwyn Barry
RECOMB
2006
Springer
14 years 10 months ago
Simple and Fast Inverse Alignment
For as long as biologists have been computing alignments of sequences, the question of what values to use for scoring substitutions and gaps has persisted. While some choices for s...
John D. Kececioglu, Eagu Kim
SOCROB
2010
126views Robotics» more  SOCROB 2010»
13 years 8 months ago
Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief
Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...
Antoine Hiolle, Lola Cañamero, Pierre Andry...