Search Sciweavers | Sciweavers

4544 search results - page 209 / 909

» Reinforcement Learning with Time

133

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

15 years 9 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

click to vote

ECML
2003
Springer

87views Machine Learning» more ECML 2003»

Self-evaluated Learning Agent in Multiple State Games

15 years 8 months ago

Download www.ai.sanken.osaka-u.ac.jp

Abstract. Most of multi-agent reinforcement learning algorithms aim to converge to a Nash equilibrium, but a Nash equilibrium does not necessarily mean a desirable result. On the o...

Koichi Moriyama, Masayuki Numao

claim paper

Read More »

176

click to vote

ECCV
2010
Springer

251views Computer Vision» more ECCV 2010»

Discriminative Tracking by Metric Learning

15 years 7 months ago

Download www.eecs.northwestern.edu

We present a discriminative model that casts appearance modeling and visual matching into a single objective for visual tracking. Most previous discriminative models for visual tra...

claim paper

Read More »

128

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 6 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

110

click to vote

ATAL
2008
Springer

145views Intelligent Agents» more ATAL 2008»

Artificial agents learning human fairness

15 years 4 months ago

Download www.sce.carleton.ca

Recent advances in technology allow multi-agent systems to be deployed in cooperation with or as a service for humans. Typically, those systems are designed assuming individually ...

Steven de Jong, Karl Tuyls, Katja Verbeeck

claim paper

Read More »

« Prev « First page 209 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers