Search Sciweavers | Sciweavers

651 search results - page 33 / 131

» Algorithms for Inverse Reinforcement Learning

162

click to vote

NIPS
2007

149views Information Technology» more NIPS 2007»

Online Linear Regression and Its Application to Model-Based Reinforcement Learning

15 years 7 months ago

Download books.nips.cc

We provide a provably efﬁcient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Speciﬁcally, we take a mo...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

194

Voted

AAMAS
2007
Springer

210views Intelligent Agents» more AAMAS 2007»

Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game

16 years 9 days ago

Download sequel.futurs.inria.fr

Abstract. The application of reinforcement learning algorithms to multiagent domains may cause complex non-convergent dynamics. The replicator dynamics, commonly used in evolutiona...

Alessandro Lazaric, Jose Enrique Munoz de Cote, Fa...

claim paper

Read More »

158

Voted

TSMC
2008

229views more TSMC 2008»

A Comprehensive Survey of Multiagent Reinforcement Learning

15 years 6 months ago

Download www.dcsc.tudelft.nl

Multiagent systems are rapidly finding applications in a variety of domains, including robotics, distributed control, telecommunications, and economics. The complexity of many task...

Lucian Busoniu, Robert Babuska, Bart De Schutter

claim paper

Read More »

164

click to vote

ICML
2001
IEEE

172views Machine Learning» more ICML 2001»

Continuous-Time Hierarchical Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Pri...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

196

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

15 years 11 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

« Prev « First page 33 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers