Search Sciweavers | Sciweavers

495 search results - page 30 / 99

» Constructing States for Reinforcement Learning

231

click to vote

ATAL
2008
Springer

136views Intelligent Agents» more ATAL 2008»

Efficient multi-agent reinforcement learning through automated supervision

15 years 6 months ago

Download www.cs.umass.edu

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in large-scale systems. In this work, we develop a supervision fr...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

113

click to vote

ESANN
2007

125views Neural Networks» more ESANN 2007»

Replacing eligibility trace for action-value learning with function approximation

15 years 5 months ago

Download www.dice.ucl.ac.be

The eligibility trace is one of the most used mechanisms to speed up reinforcement learning. Earlier reported experiments seem to indicate that replacing eligibility traces would p...

Kary Främling

claim paper

Read More »

143

click to vote

ICRA
2009
IEEE

132views Robotics» more ICRA 2009»

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

15 years 11 months ago

Download alumni.media.mit.edu

— Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to ﬁnd a sequence of actio...

Deepak Ramachandran, Rakesh Gupta

claim paper

Read More »

103

click to vote

NIPS
1998

88views Information Technology» more NIPS 1998»

Scheduling Straight-Line Code Using Reinforcement Learning and Rollouts

15 years 5 months ago

Download www.cs.ou.edu

The execution order of a block of computer instructions can make a difference in its running time by a factor of two or more. In order to achieve the best possible speed, compiler...

Amy McGovern, J. Eliot B. Moss

claim paper

Read More »

144

click to vote

JAIR
2002

99views more JAIR 2002»

Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System

15 years 3 months ago

Download www.eecs.umich.edu

Designing the dialogue policy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing a di...

Satinder P. Singh, Diane J. Litman, Michael J. Kea...

claim paper

Read More »

« Prev « First page 30 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers