Search Sciweavers | Sciweavers

135 search results - page 22 / 27

» Using Reinforcement Learning to Coordinate Better

211

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 2 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

174

click to vote

BMEI
2008
IEEE

153views Biomedical Imaging» more BMEI 2008»

A Retrospective Comparative Study of Three Data Modelling Techniques in Anticoagulation Therapy

16 years 1 months ago

Download eprints.lancs.ac.uk

Three types of data modelling technique are applied retrospectively to individual patients’ anticoagulation therapy data to predict their future levels of anticoagulation. The r...

Simon McDonald, Costas S. Xydeas, Plamen P. Angelo...

claim paper

Read More »

206

click to vote

ATAL
2010
Springer

129views Intelligent Agents» more ATAL 2010»

Learning multi-agent state space representations

15 years 8 months ago

Download como.vub.ac.be

This paper describes an algorithm, called CQ-learning, which learns to adapt the state representation for multi-agent systems in order to coordinate with other agents. We propose ...

Yann-Michaël De Hauwere, Peter Vrancx, Ann No...

claim paper

Read More »

221

click to vote

ATAL
2006
Springer

177views Intelligent Agents» more ATAL 2006»

Convergence analysis for collective vocabulary development

15 years 11 months ago

Download www.isrl.illinois.edu

We study how decentralized agents can develop a shared vocabulary without global coordination. Answering this question can help us understand the emergence of many communication s...

Jun Wang, Les Gasser, Jim Houk

claim paper

Read More »

201

click to vote

ATAL
2009
Springer

172views Intelligent Agents» more ATAL 2009»

Integrating organizational control into multi-agent learning

16 years 1 months ago

Download www.aamas-conference.org

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in largescale systems. In this work, we develop an organization-b...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

« Prev « First page 22 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers