Search Sciweavers | Sciweavers

148 search results - page 19 / 30

» Reinforcement Learning for P2P Searching

174

Voted

CIKM
2000
Springer

104views Information Technology» more CIKM 2000»

Relevance and Reinforcement in Interactive Browsing

15 years 11 months ago

Download ciir.cs.umass.edu

We consider the problem of browsing the top ranked portion of the documents returned by an information retrieval system. We describe an interactive relevance feedback agent that a...

Anton Leuski

claim paper

Read More »

215

Voted

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 5 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

195

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

16 years 2 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

192

click to vote

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 8 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

191

Voted

ECML
2003
Springer

129views Machine Learning» more ECML 2003»

Optimising Performance of Competing Search Engines in Heterogeneous Web Environments

16 years 20 days ago

Download userweb.port.ac.uk

Abstract. Distributed heterogeneous search environments are an emerging phenomenon in Web search, in which topic-speciﬁc search engines provide search services, and metasearchers...

Rinat Khoussainov, Nicholas Kushmerick

claim paper

Read More »

« Prev « First page 19 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers