Search Sciweavers | Sciweavers

1262 search results - page 197 / 253

» Reinforcement Learning: An Introduction

135

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 7 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

115

Voted

ATAL
2008
Springer

145views Intelligent Agents» more ATAL 2008»

Artificial agents learning human fairness

15 years 5 months ago

Download www.sce.carleton.ca

Recent advances in technology allow multi-agent systems to be deployed in cooperation with or as a service for humans. Typically, those systems are designed assuming individually ...

Steven de Jong, Karl Tuyls, Katja Verbeeck

claim paper

Read More »

113

click to vote

AIIDE
2009

221views Artificial Intelligence» more AIIDE 2009»

Learning Character Behaviors Using Agent Modeling in Games

15 years 4 months ago

Download webdocs.cs.ualberta.ca

Our goal is to provide learning mechanisms to game agents so they are capable of adapting to new behaviors based on the actions of other agents. We introduce a new on-line reinfor...

Richard Zhao, Duane Szafron

claim paper

Read More »

168

click to vote

GLOBECOM
2008
IEEE

169views Communications» more GLOBECOM 2008»

Autonomous Network Management Using Cooperative Learning for Network-Wide Load Balancing in Heterogeneous Networks

15 years 3 months ago

Download sierra.ece.ucdavis.edu

Traditional hop-by-hop dynamic routing makes inefficient use of network resources as it forwards packets along already congested shortest paths while uncongested longer paths may b...

Minsoo Lee, Xiaohui Ye, Dan Marconett, Samuel John...

claim paper

Read More »

128

Voted

ICRA
2010
IEEE

143views Robotics» more ICRA 2010»

Apprenticeship learning via soft local homomorphisms

15 years 1 months ago

Download damas.ift.ulaval.ca

Abstract— We consider the problem of apprenticeship learning when the expert’s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IR...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

« Prev « First page 197 / 253 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers