Search Sciweavers | Sciweavers

254

ABIALS
2008
Springer

255views Artificial Intelligence» more ABIALS 2008»

Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning

15 years 9 months ago

Abstract. In order to establish autonomous behavior for technical systems, the well known trade-off between reactive control and deliberative planning has to be considered. Within ...

Matthias Rungger, Hao Ding, Olaf Stursberg

claim paper

Read More »

161

click to vote

PIMRC
2008
IEEE

101views Communications» more PIMRC 2008»

A game theoretic framework for decentralized power allocation in IDMA systems

16 years 1 months ago

Download www.lss.supelec.fr

Abstract—In this contribution we present a decentralized power allocation algorithm for the uplink interleave division multiple access (IDMA) channel. Within the proposed optimal...

Samir Medina Perlaza, Laura Cottatellucci, M&eacut...

claim paper

Read More »

182

click to vote

EMSOFT
2005
Springer

142views Software Engineering» more EMSOFT 2005»

Communication strategies for shared-bus embedded multiprocessors

16 years 18 days ago

Download www.ece.umd.edu

Abstract— This paper explores the problem of efﬁciently ordering interprocessor communication operations in both statically and dynamically-scheduled multiprocessors for iterat...

Neal K. Bambha, Shuvra S. Bhattacharyya

claim paper

Read More »

206

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

16 years 13 days ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

167

click to vote

ATAL
2006
Springer

118views Intelligent Agents» more ATAL 2006»

Exact solutions of interactive POMDPs using behavioral equivalence

15 years 10 months ago

Download www.cs.uic.edu

We present a method for transforming the infinite interactive state space of interactive POMDPs (I-POMDPs) into a finite one, thereby enabling the computation of exact solutions. ...

Bharaneedharan Rathnasabapathy, Prashant Doshi, Pi...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers