Search Sciweavers | Sciweavers

1630 search results - page 100 / 326

» Coordinated Reinforcement Learning

150

Voted

ICC
2007
IEEE

148views Communications» more ICC 2007»

Improved Revenue and Radio Resource Usage through Inter-Operator Joint Radio Resource Management

15 years 10 months ago

Download www.prism.uvsq.fr

— This paper proposes a two-layer Joint Radio Resource Management (JRRM) framework to improve the efficiency in multi-radio and multi-operator cellular scenarios. On the one hand...

Lorenza Giupponi, Ramón Agustí, Jord...

claim paper

Read More »

110

Voted

CIMCA
2006
IEEE

147views Intelligent Agents» more CIMCA 2006»

Model-driven Walks for Resource Discovery in Peer-to-Peer Networks

15 years 9 months ago

Download mbakhouya.free.fr

In this paper, a distributed and adaptive approach for resource discovery in peer-to-peer networks is presented. This approach is based on the mobile agent paradigm and the random...

Mohamed Bakhouya, Jaafar Gaber

claim paper

Read More »

164

Voted

NN
2007
Springer

105views Neural Networks» more NN 2007»

Guiding exploration by pre-existing knowledge without modifying reward

15 years 3 months ago

Download www.cs.hut.fi

Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...

Kary Främling

claim paper

Read More »

126

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 4 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

133

Voted

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

16 years 4 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

« Prev « First page 100 / 326 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers