Search Sciweavers | Sciweavers

340 search results - page 41 / 68

» Kernelized value function approximation for reinforcement le...

153

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 2 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

115

click to vote

ICAC
2006
IEEE

112views Applied Computing» more ICAC 2006»

A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation

15 years 8 months ago

Download userweb.cs.utexas.edu

— Reinforcement Learning (RL) provides a promising new approach to systems performance management that differs radically from standard queuing-theoretic approaches making use of ...

Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mo...

claim paper

Read More »

131

click to vote

ALT
2004
Springer

112views Machine Learning» more ALT 2004»

On Kernels, Margins, and Low-Dimensional Mappings

15 years 11 months ago

Download www.cs.cmu.edu

Kernel functions are typically viewed as providing an implicit mapping of points into a high-dimensional space, with the ability to gain much of the power of that space without inc...

Maria-Florina Balcan, Avrim Blum, Santosh Vempala

claim paper

Read More »

click to vote

ICRA
1994
IEEE

105views Robotics» more ICRA 1994»

Harmonic Functions and Collision Probabilities

15 years 6 months ago

Download www.cs.cmu.edu

There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...

Christopher I. Connolly

claim paper

Read More »

Voted

IJCNN
2006
IEEE

117views Neural Networks» more IJCNN 2006»

Learning to Rank by Maximizing AUC with Linear Programming

15 years 8 months ago

Download dollar.biz.uiowa.edu

— Area Under the ROC Curve (AUC) is often used to evaluate ranking performance in binary classiﬁcation problems. Several researchers have approached AUC optimization by approxi...

Kaan Ataman, W. Nick Street, Yi Zhang

claim paper

Read More »

« Prev « First page 41 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers