Sciweavers

205 search results - page 30 / 41
» On Parameterized Approximability
Sort
View
NIPS
1998
13 years 8 months ago
Gradient Descent for General Reinforcement Learning
A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...
Leemon C. Baird III, Andrew W. Moore
VIS
2009
IEEE
199views Visualization» more  VIS 2009»
14 years 8 months ago
Multi-Scale Surface Descriptors
Local shape descriptors compactly characterize regions of a surface, and have been applied to tasks in visualization, shape matching, and analysis. Classically, curvature has be us...
Gregory Cipriano, George N. Phillips Jr., Michae...
ECML
2005
Springer
14 years 28 days ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal
INFOCOM
2005
IEEE
14 years 1 months ago
Spatial energy balancing in large-scale wireless multihop networks
— In this paper we investigate the use of proactive multipath routing to achieve energy efficient operation of ad hoc wireless networks. The focus is on optimizing trade-offs be...
Seung Jun Baek, Gustavo de Veciana
KDD
2006
ACM
134views Data Mining» more  KDD 2006»
14 years 7 months ago
Learning to rank networked entities
Several algorithms have been proposed to learn to rank entities modeled as feature vectors, based on relevance feedback. However, these algorithms do not model network connections...
Alekh Agarwal, Soumen Chakrabarti, Sunny Aggarwal