Search Sciweavers | Sciweavers

71 search results - page 3 / 15

» An Analysis of Direct Reinforcement Learning in Non-Markovia...

click to vote

CORR
2012
Springer

196views Education» more CORR 2012»

PAC-Bayesian Policy Evaluation for Reinforcement Learning

12 years 3 months ago

Download www.cs.mcgill.ca

Bayesian priors oﬀer a compact yet general means of incorporating domain knowledge into many learning tasks. The correctness of the Bayesian analysis and inference, however, lar...

Mahdi Milani Fard, Joelle Pineau, Csaba Szepesv&aa...

claim paper

Read More »

click to vote

AAAI
1993

107views Intelligent Agents» more AAAI 1993»

Complexity Analysis of Real-Time Reinforcement Learning

13 years 9 months ago

Download www.ri.cmu.edu

This paper analyzes the complexity of on-line reinforcement learning algorithms, namely asynchronous realtime versions of Q-learning and value-iteration, applied to the problem of...

Sven Koenig, Reid G. Simmons

claim paper

Read More »

click to vote

GECCO
2011
Springer

276views Optimization» more GECCO 2011»

Evolution of reward functions for reinforcement learning

12 years 11 months ago

Download hampshire.edu

The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...

Scott Niekum, Lee Spector, Andrew G. Barto

claim paper

Read More »

click to vote

ICARCV
2006
IEEE

100views Robotics» more ICARCV 2006»

Decentralized Reinforcement Learning Control of a Robotic Manipulator

14 years 1 months ago

Download www.dcsc.tudelft.nl

— Multi-agent systems are rapidly ﬁnding applications in a variety of domains, including robotics, distributed control, telecommunications, etc. Learning approaches to multi-ag...

Lucian Busoniu, Bart De Schutter, Robert Babuska

claim paper

Read More »

click to vote

AAMAS
2007
Springer

210views Intelligent Agents» more AAMAS 2007»

Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game

14 years 1 months ago

Download sequel.futurs.inria.fr

Abstract. The application of reinforcement learning algorithms to multiagent domains may cause complex non-convergent dynamics. The replicator dynamics, commonly used in evolutiona...

Alessandro Lazaric, Jose Enrique Munoz de Cote, Fa...

claim paper

Read More »

« Prev « First page 3 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers