Sciweavers

NIPS
2001
14 years 29 days ago
Active Information Retrieval
When a client interacts with an expert, e.g. a doctor, it falls upon the expert to ask questions that steer the process towards fulfilling the client's needs. This is most ef...
Tommi Jaakkola, Hava T. Siegelmann
NIPS
2001
14 years 29 days ago
Information Geometrical Framework for Analyzing Belief Propagation Decoder
The mystery of belief propagation (BP) decoder, especially of the turbo decoding, is studied from information geometrical viewpoint. The loopy belief network (BN) of turbo codes m...
Shiro Ikeda, Toshiyuki Tanaka, Shun-ichi Amari
NIPS
2001
14 years 29 days ago
Distribution of Mutual Information
Mutual information is widely used, in a descriptive way, to measure the stochastic dependence of categorical random variables. In order to address questions such as the reliabilit...
M. Hutter
NIPS
2001
14 years 29 days ago
The Method of Quantum Clustering
We propose a novel clustering method that is an extension of ideas inherent to scale-space clustering and support-vector clustering. Like the latter, it associates every data poin...
David Horn, Assaf Gottlieb
NIPS
2001
14 years 29 days ago
Algorithmic Luckiness
Classical statistical learning theory studies the generalisation performance of machine learning algorithms rather indirectly. One of the main detours is that algorithms are studi...
Ralf Herbrich, Robert C. Williamson
NIPS
2001
14 years 29 days ago
Categorization by Learning and Combining Object Parts
We describe an algorithm for automatically learning discriminative components of objects with SVM classifiers. It is based on growing image parts by minimizing theoretical bounds ...
Bernd Heisele, Thomas Serre, Massimiliano Pontil, ...
NIPS
2001
14 years 29 days ago
A theory of neural integration in the head-direction system
Integration in the head-direction system is a computation by which horizontal angular head velocity signals from the vestibular nuclei are integrated to yield a neural representat...
Richard H. R. Hahnloser, Xiaohui Xie, H. Sebastian...
NIPS
2001
14 years 29 days ago
Multiagent Planning with Factored MDPs
We present a principled and efficient planning algorithm for cooperative multiagent dynamic systems. A striking feature of our method is that the coordination and communication be...
Carlos Guestrin, Daphne Koller, Ronald Parr
NIPS
2001
14 years 29 days ago
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...
Gregory Z. Grudic, Lyle H. Ungar
NIPS
2001
14 years 29 days ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...