Sciweavers

88 search results - page 6 / 18
» nips 2000
Sort
View
NIPS
2000
13 years 9 months ago
Balancing Multiple Sources of Reward in Reinforcement Learning
For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...
Christian R. Shelton
NIPS
2000
13 years 9 months ago
The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity
We describe a joint probabilistic model for modeling the contents and inter-connectivity of document collections such as sets of web pages or research paper archives. The model is...
David A. Cohn, Thomas Hofmann
NIPS
2000
13 years 9 months ago
From Margin to Sparsity
We present an improvement of Noviko 's perceptron convergence theorem. Reinterpreting this mistakebound as a margindependent sparsity guarantee allows us to give a PAC{style ...
Thore Graepel, Ralf Herbrich, Robert C. Williamson
NIPS
2000
13 years 9 months ago
The Use of Classifiers in Sequential Inference
We study the problem of combining the outcomes of several different classifiers in a way that provides a coherent inference that satisfies some constraints. In particular, we deve...
Vasin Punyakanok, Dan Roth
NIPS
2000
13 years 9 months ago
Programmable Reinforcement Learning Agents
We present an expressive agent design language for reinforcement learning that allows the user to constrain the policies considered by the learning process.The language includes s...
David Andre, Stuart J. Russell