Search Sciweavers | Sciweavers

124

NIPS
2000

112views Information Technology» more NIPS 2000»

Balancing Multiple Sources of Reward in Reinforcement Learning

15 years 8 months ago

For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...

Christian R. Shelton

claim paper

Read More »

199

click to vote

NIPS
2000

155views Information Technology» more NIPS 2000»

The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity

15 years 8 months ago

Download www.cs.cmu.edu

We describe a joint probabilistic model for modeling the contents and inter-connectivity of document collections such as sets of web pages or research paper archives. The model is...

David A. Cohn, Thomas Hofmann

claim paper

Read More »

193

click to vote

NIPS
2000

161views Information Technology» more NIPS 2000»

From Margin to Sparsity

15 years 8 months ago

Download users.cecs.anu.edu.au

We present an improvement of Noviko 's perceptron convergence theorem. Reinterpreting this mistakebound as a margindependent sparsity guarantee allows us to give a PAC{style ...

Thore Graepel, Ralf Herbrich, Robert C. Williamson

claim paper

Read More »

176

click to vote

NIPS
2000

104views Information Technology» more NIPS 2000»

The Use of Classifiers in Sequential Inference

15 years 8 months ago

Download l2r.cs.uiuc.edu

We study the problem of combining the outcomes of several different classifiers in a way that provides a coherent inference that satisfies some constraints. In particular, we deve...

Vasin Punyakanok, Dan Roth

claim paper

Read More »

160

click to vote

NIPS
2000

150views Information Technology» more NIPS 2000»

Programmable Reinforcement Learning Agents

15 years 8 months ago

Download reference.kfupm.edu.sa

We present an expressive agent design language for reinforcement learning that allows the user to constrain the policies considered by the learning process.The language includes s...

David Andre, Stuart J. Russell

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers