Search Sciweavers | Sciweavers

236 search results - page 28 / 48

» Non-linear dynamics in multiagent reinforcement learning alg...

151

click to vote

ICRA
2007
IEEE

110views Robotics» more ICRA 2007»

A Reinforcement Learning Approach to Lift Generation in Flapping MAVs: Experimental Results

16 years 1 months ago

Download groups.csail.mit.edu

— In [17] we proposed an RL framework for control of ﬂapping-wing MAVs. The algorithm has been discussed and simulation results using a quasi-steady model showed initial promis...

Mehran Motamed, Joseph Yan

claim paper

Read More »

212

click to vote

AAAI
2006

116views Intelligent Agents» more AAAI 2006»

Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping

15 years 9 months ago

Download www.cs.utexas.edu

Transfer learning concerns applying knowledge learned in one task (the source) to improve learning another related task (the target). In this paper, we use structure mapping, a ps...

Yaxin Liu, Peter Stone

claim paper

Read More »

202

click to vote

ATAL
2004
Springer

197views Intelligent Agents» more ATAL 2004»

Adaptive, Distributed Control of Constrained Multi-Agent Systems

16 years 27 days ago

Download collectives.stanford.edu

Product Distribution (PD) theory was recently developed as a framework for analyzing and optimizing distributed systems. In this paper we demonstrate its use for adaptive distribu...

Stefan Bieniawski, David Wolpert

claim paper

Read More »

231

click to vote

ATAL
2008
Springer

160views Intelligent Agents» more ATAL 2008»

Sequential decision making in repeated coalition formation under uncertainty

15 years 9 months ago

Download www.aamas-conference.org

The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learnin...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

222

Voted

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 28 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers