Search Sciweavers | Sciweavers

4544 search results - page 112 / 909

» Reinforcement Learning with Time

153

click to vote

ATAL
2007
Springer

130views Intelligent Agents» more ATAL 2007»

Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective

16 years 19 days ago

Download www.aamas-conference.org

This paper presents the dynamics of multiple reinforcement learning agents from an Evolutionary Game Theoretic (EGT) perspective. We provide a Replicator Dynamics model for tradit...

Liviu Panait, Karl Tuyls

claim paper

Read More »

157

click to vote

CIMCA
2006
IEEE

147views Intelligent Agents» more CIMCA 2006»

Model-driven Walks for Resource Discovery in Peer-to-Peer Networks

16 years 15 days ago

Download mbakhouya.free.fr

In this paper, a distributed and adaptive approach for resource discovery in peer-to-peer networks is presented. This approach is based on the mobile agent paradigm and the random...

Mohamed Bakhouya, Jaafar Gaber

claim paper

Read More »

213

Voted

NN
2007
Springer

105views Neural Networks» more NN 2007»

Guiding exploration by pre-existing knowledge without modifying reward

15 years 6 months ago

Download www.cs.hut.fi

Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...

Kary Främling

claim paper

Read More »

159

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

166

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

16 years 7 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

« Prev « First page 112 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers