Search Sciweavers | Sciweavers

87 search results - page 14 / 18

» Direct Policy Search Reinforcement Learning for Robot Contro...

177

click to vote

AGENTS
1999
Springer

105views Security Privacy» more AGENTS 1999»

Team-Partitioned, Opaque-Transition Reinforcement Learning

15 years 11 months ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

205

click to vote

SMC
2007
IEEE

102views Control Systems» more SMC 2007»

An improved immune Q-learning algorithm

16 years 1 months ago

Download web2.uwindsor.ca

—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...

Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...

claim paper

Read More »

187

click to vote

ISCAS
2006
IEEE

103views Hardware» more ISCAS 2006»

Towards autonomous adaptive behavior in a bio-inspired CNN-controlled robot

16 years 1 months ago

Download web.mit.edu

— This paper describes a general approach for the unsupervised learning of behaviors in a behavior-based robot. The key idea is to formalize a behavior produced by a Motor Map dr...

Paolo Arena, Luigi Fortuna, Mattia Frasca, Luca Pa...

claim paper

Read More »

201

click to vote

ICANN
2010
Springer

166views Neural Networks» more ICANN 2010»

Exploring Continuous Action Spaces with Diffusion Trees for Reinforcement Learning

15 years 8 months ago

Download www.tu-ilmenau.de

We propose a new approach for reinforcement learning in problems with continuous actions. Actions are sampled by means of a diffusion tree, which generates samples in the continuou...

Christian Vollmer, Erik Schaffernicht, Horst-Micha...

claim paper

Read More »

217

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

16 years 1 months ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

« Prev « First page 14 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers