Sciweavers

3274 search results - page 507 / 655
» Using Learning in a Control Agent
Sort
View
NIPS
2001
13 years 10 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
ICSTM
2000
164views Management» more  ICSTM 2000»
13 years 10 months ago
Building Sustainable Interest in Modelling in the Classroom
System Dynamics has had a tough time breaking into High Schools. Like all good ideas the most difficult part is convincing those who would most benefit that this new approach is i...
Gordon Kubanek
ML
2002
ACM
178views Machine Learning» more  ML 2002»
13 years 9 months ago
Metric-Based Methods for Adaptive Model Selection and Regularization
We present a general approach to model selection and regularization that exploits unlabeled data to adaptively control hypothesis complexity in supervised learning tasks. The idea ...
Dale Schuurmans, Finnegan Southey
ISCA
2007
IEEE
217views Hardware» more  ISCA 2007»
13 years 9 months ago
Parallel Processing of High-Dimensional Remote Sensing Images Using Cluster Computer Architectures
Hyperspectral sensors represent the most advanced instruments currently available for remote sensing of the Earth. The high spatial and spectral resolution of the images supplied ...
David Valencia, Antonio Plaza, Pablo Martín...
IJDSN
2006
132views more  IJDSN 2006»
13 years 9 months ago
Using Misbehavior to Analyze Strategic versus Aggregate Energy Minimization in Wireless Sensor Networks
Abstract-- We present a novel formulation of the problem of energy misbehavior and develop an analytical framework for quantifying its impact on other nodes. Specifically, we formu...
Rajgopal Kannan, Shuangqing Wei, Vasu Chakravarthy...