Search Sciweavers | Sciweavers

3274 search results - page 47 / 655

» Using Learning in a Control Agent

224

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

16 years 15 days ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

214

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

179

click to vote

AI
1999
Springer

91views Artificial Intelligence» more AI 1999»

Building Agent Teams Using an Explicit Teamwork Model and Learning

15 years 6 months ago

Download www.isi.edu

Milind Tambe, Jafar Adibi, Y. Alonaizon, Ali Erdem...

claim paper

Read More »

179

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

16 years 1 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

174

click to vote

DAGSTUHL
2003

95views Software Engineering» more DAGSTUHL 2003»

Toward a Cognitive System Algebra: Application to Facial Expression Learning and Imitation

15 years 8 months ago

Download publi-etis.ensea.fr

In this paper, we try to demonstrate the capability of a very simple architecture to learn to recognize and reproduce facial expressions without the innate capability to recognize ...

Philippe Gaussier, Ken Prepin, Jacqueline Nadel

claim paper

Read More »

« Prev « First page 47 / 655 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers