Search Sciweavers | Sciweavers

236 search results - page 35 / 48

» Non-linear dynamics in multiagent reinforcement learning alg...

212

click to vote

SIGDIAL
2010

158views Natural Language Processing» more SIGDIAL 2010»

Sparse Approximate Dynamic Programming for Dialog Management

15 years 5 months ago

Download www.sigdial.org

Spoken dialogue management strategy optimization by means of Reinforcement Learning (RL) is now part of the state of the art. Yet, there is still a clear mismatch between the comp...

Senthilkumar Chandramohan, Matthieu Geist, Olivier...

claim paper

Read More »

208

click to vote

JMLR
2008

92views more JMLR 2008»

Theoretical Advantages of Lenient Learners: An Evolutionary Game Theoretic Perspective

15 years 7 months ago

Download jmlr.csail.mit.edu

This paper presents the dynamics of multiple learning agents from an evolutionary game theoretic perspective. We provide replicator dynamics models for cooperative coevolutionary ...

Liviu Panait, Karl Tuyls, Sean Luke

claim paper

Read More »

196

click to vote

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 8 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

217

click to vote

ADHOCNETS
2010
Springer

276views Computer Networks» more ADHOCNETS 2010»

DCLA: A Duty-Cycle Learning Algorithm for IEEE 802.15.4 Beacon-Enabled WSNs

15 years 4 months ago

Download www.aws.cit.ie

The current specification for IEEE 802.15.4 beacon-enabled networks does not define how active and sleep schedules should be configured in order to achieve the optimal network perf...

Rodolfo de Paz Alberola, Dirk Pesch

claim paper

Read More »

220

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 7 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

« Prev « First page 35 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers