Search Sciweavers | Sciweavers

179 search results - page 33 / 36

» Learning Relational Navigation Policies

224

click to vote

WWW
2008
ACM

163views Internet Technology» more WWW 2008»

As we may perceive: finding the boundaries of compound documents on the web

16 years 7 months ago

Download www2008.org

This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...

Pavel Dmitriev

claim paper

Read More »

194

click to vote

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

15 years 8 months ago

Download research.microsoft.com

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

170

click to vote

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

15 years 8 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

167

click to vote

ENVSOFT
2007

126views more ENVSOFT 2007»

Uncertainty and precaution in environmental management: Insights from the UPEM conference

15 years 6 months ago

Download igitur-archive.library.uu.nl

Communication across the science-policy interface is complicated by uncertainty and ignorance associated with predictions on which to base policies. The international symposium �...

Jeroen P. van der Sluijs

claim paper

Read More »

211

click to vote

NIPS
2008

271views Information Technology» more NIPS 2008»

Goal-directed decision making in prefrontal cortex: a computational framework

15 years 8 months ago

Download www.princeton.edu

Research in animal learning and behavioral neuroscience has distinguished between two forms of action control: a habit-based form, which relies on stored action values, and a goal...

Matthew Botvinick, James An

claim paper

Read More »

« Prev « First page 33 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers