Search Sciweavers | Sciweavers

282 search results - page 21 / 57

» Online Learning of Approximate Dependency Parsing Algorithms

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

13 years 9 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

14 years 2 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

click to vote

JMLR
2008

137views more JMLR 2008»

Online Learning of Complex Prediction Problems Using Simultaneous Projections

13 years 7 months ago

Download jmlr.csail.mit.edu

We describe and analyze an algorithmic framework for online classification where each online trial consists of multiple prediction tasks that are tied together. We tackle the prob...

Yonatan Amit, Shai Shalev-Shwartz, Yoram Singer

claim paper

Read More »

click to vote

ALT
2007
Springer

116views Machine Learning» more ALT 2007»

Online Regression Competitive with Changing Predictors

14 years 1 months ago

Download www.clrc.rhul.ac.uk

This paper deals with the problem of making predictions in the online mode of learning where the dependence of the outcome yt on the signal xt can change with time. The Aggregating...

Steven Busuttil, Yuri Kalnishkan

claim paper

Read More »

click to vote

CORR
2008
Springer

173views Education» more CORR 2008»

Decomposition Principles and Online Learning in Cross-Layer Optimization for Delay-Sensitive Applications

13 years 7 months ago

Download documents.scribd.com

In this paper, we propose a general cross-layer optimization framework in which we explicitly consider both the heterogeneous and dynamically changing characteristics of delay-sens...

Fangwen Fu, Mihaela van der Schaar

claim paper

Read More »

« Prev « First page 21 / 57 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers