Search Sciweavers | Sciweavers

197 search results - page 3 / 40

» Using Reinforcement Learning to Spider the Web Efficiently

175

click to vote

ICML
2004
IEEE

161views Machine Learning» more ICML 2004»

Using relative novelty to identify useful temporal abstractions in reinforcement learning

16 years 7 months ago

Download www.cs.umass.edu

lative Novelty to Identify Useful Temporal Abstractions in Reinforcement Learning ?Ozg?ur S?im?sek ozgur@cs.umass.edu Andrew G. Barto barto@cs.umass.edu Department of Computer Scie...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

198

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

16 years 7 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

195

click to vote

LAMAS
2005
Springer

168views Intelligent Agents» more LAMAS 2005»

Multi-agent Relational Reinforcement Learning

16 years 5 days ago

Download dtai.cs.kuleuven.be

In this paper we report on using a relational state space in multi-agent reinforcement learning. There is growing evidence in the Reinforcement Learning research community that a r...

Tom Croonenborghs, Karl Tuyls, Jan Ramon, Maurice ...

claim paper

Read More »

194

click to vote

NAACL
2001

130views Computational Linguistics» more NAACL 2001»

Learning Optimal Dialogue Management Rules by Using Reinforcement Learning and Inductive Logic Programming

15 years 8 months ago

Download www.aclweb.org

Developing dialogue systems is a complex process. In particular, designing efficient dialogue management strategies is often difficult as there are no precise guidelines to develo...

Renaud Lecoeuche

claim paper

Read More »

288

click to vote

ICAART
2010
INSTICC

509views Intelligent Agents» more ICAART 2010»

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

16 years 3 months ago

Download arxiv.org

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

« Prev « First page 3 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers