Search Sciweavers | Sciweavers

651 search results - page 70 / 131

» Algorithms for Inverse Reinforcement Learning

149

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 8 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

169

click to vote

CSREAEEE
2008

199views Business» more CSREAEEE 2008»

Progranimate - A Web Enabled Algorithmic Problem Solving Application

15 years 7 months ago

Download www.comp.glam.ac.uk

- This paper proposes the use of an interactive web based problem solving application that utilises flowchart based programming and code generation to address the issues faced by n...

Andrew Scott, Mike Watkins, Duncan McPhee

claim paper

Read More »

158

click to vote

GECCO
2005
Springer

111views Optimization» more GECCO 2005»

XCS with eligibility traces

15 years 11 months ago

Download www.bcs.rochester.edu

The development of the XCS Learning Classiﬁer System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...

Jan Drugowitsch, Alwyn Barry

claim paper

Read More »

158

click to vote

RECOMB
2006
Springer

112views Computational Biology» more RECOMB 2006»

Simple and Fast Inverse Alignment

16 years 6 months ago

Download www.cs.arizona.edu

For as long as biologists have been computing alignments of sequences, the question of what values to use for scoring substitutions and gaps has persisted. While some choices for s...

John D. Kececioglu, Eagu Kim

claim paper

Read More »

179

click to vote

SOCROB
2010

126views Robotics» more SOCROB 2010»

Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief

15 years 4 months ago

Download fostsvn.uopnet.plymouth.ac.uk

Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...

Antoine Hiolle, Lola Cañamero, Pierre Andry...

claim paper

Read More »

« Prev « First page 70 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers