Search Sciweavers | Sciweavers

50 search results - page 8 / 10

» Convergence and Divergence in Standard and Averaging Reinfor...

click to vote

GLOBECOM
2006
IEEE

160views Communications» more GLOBECOM 2006»

Adaptive Learning of Transmission Control Policies for MIMO Fading Channels under Delay Constraint

14 years 1 months ago

Download www.ece.ubc.ca

— This paper addresses learning based adaptive resource allocation for wireless MIMO channels with Markovian fading. The problem is posed as Constrained Markov Decision Process w...

Dejan V. Djonin, Vikram Krishnamurthy

claim paper

Read More »

click to vote

ATAL
2007
Springer

162views Intelligent Agents» more ATAL 2007»

Model-based function approximation in reinforcement learning

14 years 1 months ago

Download userweb.cs.utexas.edu

Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

click to vote

ICML
2007
IEEE

146views Machine Learning» more ICML 2007»

Best of both: a hybridized centroid-medoid clustering heuristic

14 years 8 months ago

Download www.machinelearning.org

Although each iteration of the popular kMeans clustering heuristic scales well to larger problem sizes, it often requires an unacceptably-high number of iterations to converge to ...

Nizar Grira, Michael E. Houle

claim paper

Read More »

click to vote

ATAL
2009
Springer

172views Intelligent Agents» more ATAL 2009»

Integrating organizational control into multi-agent learning

14 years 2 months ago

Download www.aamas-conference.org

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in largescale systems. In this work, we develop an organization-b...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

click to vote

PKDD
2010
Springer

169views Data Mining» more PKDD 2010»

Efficient and Numerically Stable Sparse Learning

13 years 5 months ago

Download www.cs.columbia.edu

We consider the problem of numerical stability and model density growth when training a sparse linear model from massive data. We focus on scalable algorithms that optimize certain...

Sihong Xie, Wei Fan, Olivier Verscheure, Jiangtao ...

claim paper

Read More »

« Prev « First page 8 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers