Search Sciweavers | Sciweavers

154 search results - page 22 / 31

» Sample-Efficient Evolutionary Function Approximation for Rei...

188

click to vote

ATAL
2009
Springer

134views Intelligent Agents» more ATAL 2009»

Replicator Dynamics for Multi-agent Learning: An Orthogonal Approach

15 years 4 months ago

Download wwwis.win.tue.nl

Today's society is largely connected and many real life applications lend themselves to be modeled as multi-agent systems. Although such systems as well as their models are d...

Michael Kaisers, Karl Tuyls

claim paper

Read More »

214

click to vote

JAIR
2002

163views more JAIR 2002»

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

15 years 6 months ago

Download www.jair.org

The recursive least-squares (RLS) algorithm is one of the most well-known algorithms used in adaptive filtering, system identification and adaptive control. Its popularity is main...

Xin Xu, Hangen He, Dewen Hu

claim paper

Read More »

210

click to vote

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 6 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

176

click to vote

ICAC
2006
IEEE

112views Applied Computing» more ICAC 2006»

A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation

16 years 22 days ago

Download userweb.cs.utexas.edu

— Reinforcement Learning (RL) provides a promising new approach to systems performance management that differs radically from standard queuing-theoretic approaches making use of ...

Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mo...

claim paper

Read More »

196

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 6 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

« Prev « First page 22 / 31 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers