Search Sciweavers | Sciweavers

224 search results - page 15 / 45

» Bounding Learning Time in XCS

210

click to vote

INFOCOM
2010
IEEE

207views Communications» more INFOCOM 2010»

Opportunistic Spectrum Access with Multiple Users: Learning under Competition

15 years 5 months ago

Download www.mit.edu

Abstract—The problem of cooperative allocation among multiple secondary users to maximize cognitive system throughput is considered. The channel availability statistics are initi...

Animashree Anandkumar, Nithin Michael, Ao Tang

claim paper

Read More »

206

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

15 years 5 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

228

click to vote

JMLR
2010

143views more JMLR 2010»

A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in Machine Learning

15 years 5 months ago

Download www.stat.purdue.edu

We extend the well-known BFGS quasi-Newton method and its memory-limited variant LBFGS to the optimization of nonsmooth convex objectives. This is done in a rigorous fashion by ge...

Jin Yu, S. V. N. Vishwanathan, Simon Günter, ...

claim paper

Read More »

227

click to vote

CIMCA
2008
IEEE

125views Intelligent Agents» more CIMCA 2008»

Tree Exploration for Bayesian RL Exploration

16 years 1 months ago

Download arxiv.org

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The ﬁrst employs a Bayesian framework, ...

Christos Dimitrakakis

posted by olethros

Read More »

186

click to vote

COLT
2001
Springer

100views Machine Learning» more COLT 2001»

Learning Additive Models Online with Fast Evaluating Kernels

15 years 11 months ago

Download www.cs.ucl.ac.uk

Abstract. We develop three new techniques to build on the recent advances in online learning with kernels. First, we show that an exponential speed-up in prediction time per trial ...

Mark Herbster

claim paper

Read More »

« Prev « First page 15 / 45 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers