Search Sciweavers | Sciweavers

463 search results - page 26 / 93

» Localizing Search in Reinforcement Learning

167

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 8 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

165

click to vote

CIKM
2000
Springer

104views Information Technology» more CIKM 2000»

Relevance and Reinforcement in Interactive Browsing

15 years 11 months ago

Download ciir.cs.umass.edu

We consider the problem of browsing the top ranked portion of the documents returned by an information retrieval system. We describe an interactive relevance feedback agent that a...

Anton Leuski

claim paper

Read More »

206

click to vote

ICTAI
2009
IEEE

181views Artificial Intelligence» more ICTAI 2009»

Learning in Local Search

16 years 1 months ago

Download www.cril.univ-artois.fr

In this paper a learning based local search approach for propositional satisﬁability is presented. It is based on an original adaptation of the conﬂict driven clause learning ...

Gilles Audemard, Jean-Marie Lagniez, Bertrand Mazu...

claim paper

Read More »

275

click to vote

WOWMOM
2005
ACM

240views Multimedia» more WOWMOM 2005»

An Adaptive Routing Protocol for Ad Hoc Peer-to-Peer Networks

16 years 23 days ago

Download sixearch.org

Ad hoc networks represent a key factor in the evolution of wireless communications. These networks typically consist of equal nodes that communicate without central control, inter...

Luca Gatani, Giuseppe Lo Re, Salvatore Gaglio

claim paper

Read More »

208

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 5 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

« Prev « First page 26 / 93 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers