future discounted reward

ALT
2006
Springer

109views Machine Learning» more ALT 2006»

General Discounting Versus Average Reward

14 years 9 months ago

Consider an agent interacting with an environment in cycles. In every interaction cycle the agent is rewarded for its performance. We compare the average reward U from cycle 1 to ...

Marcus Hutter

claim paper

Read More »

click to vote

ICML
1999
IEEE

138views Machine Learning» more ICML 1999»

Using Reinforcement Learning to Spider the Web Efficiently

15 years 1 months ago

Download www.cs.iastate.edu

Consider the task of exploring the Web in order to find pages of a particular kind or on a particular topic. This task arises in the construction of search engines and Web knowled...

Jason Rennie, Andrew McCallum

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers