Sciweavers

341 search results - page 19 / 69
» Numerical Approximation of a Control Problem for Advection-D...
Sort
View
PKDD
2010
Springer
179views Data Mining» more  PKDD 2010»
13 years 6 months ago
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...
Tobias Jung, Peter Stone
ICC
2009
IEEE
129views Communications» more  ICC 2009»
14 years 3 months ago
Restless Watchdog: Monitoring Multiple Bands with Blind Period in Cognitive Radio Systems
— Spectrum sensing, which monitors the spectrum activity, is studied for cognitive radio systems using multiple frequency bands with non-negligible band switching time (blind per...
Husheng Li
CDC
2009
IEEE
159views Control Systems» more  CDC 2009»
14 years 1 months ago
A distributed machine learning framework
Abstract— A distributed online learning framework for support vector machines (SVMs) is presented and analyzed. First, the generic binary classification problem is decomposed in...
Tansu Alpcan, Christian Bauckhage
SIAMSC
2011
177views more  SIAMSC 2011»
13 years 3 months ago
Computing f(A)b via Least Squares Polynomial Approximations
Given a certain function f, various methods have been proposed in the past for addressing the important problem of computing the matrix-vector product f(A)b without explicitly comp...
Jie Chen, Mihai Anitescu, Yousef Saad
CDC
2009
IEEE
210views Control Systems» more  CDC 2009»
13 years 6 months ago
On maximum lifetime routing in Wireless Sensor Networks
Abstract-- Lifetime maximization is an important optimization problem specific to Wireless Sensor Networks (WSNs) since they operate with limited energy resources which are therefo...
Xu Ning, Christos G. Cassandras