Search Sciweavers | Sciweavers

779 search results - page 23 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

239

click to vote

BROADNETS
2004
IEEE

154views Computer Networks» more BROADNETS 2004»

Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning

15 years 11 months ago

Download www.ece.ubc.ca

The scarcity and large fluctuations of link bandwidth in wireless networks have motivated the development of adaptive multimedia services in mobile communication networks, where i...

Fei Yu, Vincent W. S. Wong, Victor C. M. Leung

claim paper

Read More »

239

click to vote

IJCAI
2007

179views Artificial Intelligence» more IJCAI 2007»

Heuristic Selection of Actions in Multiagent Reinforcement Learning

15 years 8 months ago

Download www.ijcai.org

This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the wellknown Multiagent Reinforcement Learni...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

203

click to vote

ICRA
2009
IEEE

132views Robotics» more ICRA 2009»

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

16 years 1 months ago

Download alumni.media.mit.edu

— Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to ﬁnd a sequence of actio...

Deepak Ramachandran, Rakesh Gupta

claim paper

Read More »

175

click to vote

NIPS
1997

94views Information Technology» more NIPS 1997»

Reinforcement Learning with Hierarchies of Machines

15 years 8 months ago

Download www.cs.berkeley.edu

We present a new approach to reinforcement learning in which the policies considered by the learning process are constrained by hierarchies of partially speciﬁed machines. This ...

Ronald Parr, Stuart J. Russell

claim paper

Read More »

223

click to vote

ACL
2008

213views Computational Linguistics» more ACL 2008»

Generalized Expectation Criteria for Semi-Supervised Learning of Conditional Random Fields

15 years 8 months ago

Download www.cs.umass.edu

This paper presents a semi-supervised training method for linear-chain conditional random fields that makes use of labeled features rather than labeled instances. This is accompli...

Gideon S. Mann, Andrew McCallum

claim paper

Read More »

« Prev « First page 23 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers