Search Sciweavers | Sciweavers

495 search results - page 27 / 99

» Constructing States for Reinforcement Learning

136

click to vote

ATAL
2009
Springer

137views Intelligent Agents» more ATAL 2009»

Generalized model learning for reinforcement learning in factored domains

15 years 10 months ago

Download userweb.cs.utexas.edu

Improving the sample eﬃciency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...

Todd Hester, Peter Stone

claim paper

Read More »

141

click to vote

ICMLA
2010

205views Machine Learning» more ICMLA 2010»

Incremental Learning of Relational Action Rules

15 years 1 months ago

Download www-lipn.univ-paris13.fr

Abstract--In the Relational Reinforcement learning framework, we propose an algorithm that learns an action model allowing to predict the resulting state of each action in any give...

Christophe Rodrigues, Pierre Gérard, C&eacu...

claim paper

Read More »

134

click to vote

ATAL
2008
Springer

127views Intelligent Agents» more ATAL 2008»

Autonomous transfer for reinforcement learning

15 years 6 months ago

Download www.cs.utexas.edu

Recent work in transfer learning has succeeded in making reinforcement learning algorithms more efficient by incorporating knowledge from previous tasks. However, such methods typ...

Matthew E. Taylor, Gregory Kuhlmann, Peter Stone

claim paper

Read More »

104

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

15 years 6 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

169

click to vote

BROADNETS
2004
IEEE

154views Computer Networks» more BROADNETS 2004»

Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning

15 years 8 months ago

Download www.ece.ubc.ca

The scarcity and large fluctuations of link bandwidth in wireless networks have motivated the development of adaptive multimedia services in mobile communication networks, where i...

Fei Yu, Vincent W. S. Wong, Victor C. M. Leung

claim paper

Read More »

« Prev « First page 27 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers