Search Sciweavers | Sciweavers

458 search results - page 54 / 92

» Q-Decomposition for Reinforcement Learning Agents

215

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 5 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

143

click to vote

ICML
2005
IEEE

119views Machine Learning» more ICML 2005»

Dynamic preferences in multi-criteria reinforcement learning

16 years 6 months ago

Download www.machinelearning.org

The current framework of reinforcement learning is based on maximizing the expected returns based on scalar rewards. But in many real world situations, tradeoffs must be made amon...

Sriraam Natarajan, Prasad Tadepalli

claim paper

Read More »

202

click to vote

ECML
2007
Springer

167views Machine Learning» more ECML 2007»

Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

15 years 9 months ago

Download www.igi.tugraz.at

Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of...

Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass

claim paper

Read More »

133

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 3 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

151

click to vote

EURONGI
2005
Springer

115views Computer Networks» more EURONGI 2005»

An Afterstates Reinforcement Learning Approach to Optimize Admission Control in Mobile Cellular Networks

15 years 11 months ago

Download jogiguz.webs.upv.es

We deploy a novel Reinforcement Learning optimization technique based on afterstates learning to determine the gain that can be achieved by incorporating movement prediction inform...

José Manuel Giménez-Guzmán, J...

claim paper

Read More »

« Prev « First page 54 / 92 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers