Sciweavers

5331 search results - page 1000 / 1067
» Algebraic Methods for Optimization Problems
Sort
View
ICML
2010
IEEE
13 years 7 months ago
Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda
Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...
Carlton Downey, Scott Sanner
IPPS
2010
IEEE
13 years 6 months ago
Highly scalable parallel sorting
Sorting is a commonly used process with a wide breadth of applications in the high performance computing field. Early research in parallel processing has provided us with comprehen...
Edgar Solomonik, Laxmikant V. Kalé
MASS
2010
186views Communications» more  MASS 2010»
13 years 6 months ago
VAN: Vehicle-assisted shortest-time path navigation
Traffic congestion is a very serious problem in large cities. With the number of vehicles increasing rapidly, especially in cities whose economy is booming, the situation is gettin...
Wenping Chen, Sencun Zhu, Deying Li
PKDD
2010
Springer
179views Data Mining» more  PKDD 2010»
13 years 6 months ago
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...
Tobias Jung, Peter Stone
PRICAI
2010
Springer
13 years 6 months ago
Sparse Representation: Extract Adaptive Neighborhood for Multilabel Classification
Unlike traditional classification tasks, multilabel classification allows a sample to associate with more than one label. This generalization naturally arises the difficulty in cla...
Shuo Xiang, Songcan Chen, Lishan Qiao
« Prev « First page 1000 / 1067 Last » Next »