Sciweavers

1262 search results - page 118 / 253

» Reinforcement Learning: An Introduction

83

ICAART
2010
INSTICC

136views Intelligent Agents» more ICAART 2010»

A Reinforcement Learning Approach for Multiagent Navigation

16 years 24 days ago

A Reinforcement Learning Approach for Multiagent Navigation

Download scalab.uc3m.es

Francisco Martinez-Gil, Fernando Barber, Miguel Lo...

claim paper

Read More »

124

ICAART
2010
INSTICC

222views Intelligent Agents» more ICAART 2010»

Exploiting Similarity Information in Reinforcement Learning - Similarity Models for Multi-Armed Bandits and MDPs

16 years 24 days ago

Exploiting Similarity Information in Reinforcement Learning - Similarity Models for Multi-Armed Bandits and MDPs

Download personal.unileoben.ac.at

Ronald Ortner

claim paper

Read More »

95

ICAART
2010
INSTICC

288views Intelligent Agents» more ICAART 2010»

A Cautious Approach to Generalization in Reinforcement Learning

16 years 24 days ago

A Cautious Approach to Generalization in Reinforcement Learning

Download www.montefiore.ulg.ac.be

Raphael Fonteneau, Susan A. Murphy, Louis Wehenkel...

claim paper

Read More »

130

IUI
2009
ACM

110views Software Engineering» more IUI 2009»

A bayesian reinforcement learning approach for customizing human-robot interfaces

15 years 10 months ago

A bayesian reinforcement learning approach for customizing human-robot interfaces

Download www.cs.mcgill.ca

Amin Atrash, Joelle Pineau

claim paper

Read More »

144

ISDA
2009
IEEE

144views Operating System» more ISDA 2009»

Postponed Updates for Temporal-Difference Reinforcement Learning

15 years 10 months ago

Postponed Updates for Temporal-Difference Reinforcement Learning

Download www.science.uva.nl

This paper presents postponed updates, a new strategy for TD methods that can improve sample efﬁciency without incurring the computational and space requirements of model-based ...

Harm van Seijen, Shimon Whiteson

claim paper

Read More »

« Prev « First page 118 / 253 Last » Next »