Search Sciweavers | Sciweavers

200 search results - page 30 / 40

» Point-Based Policy Iteration

154

click to vote

PIMRC
2008
IEEE

101views Communications» more PIMRC 2008»

A game theoretic framework for decentralized power allocation in IDMA systems

16 years 1 months ago

Download www.lss.supelec.fr

Abstract—In this contribution we present a decentralized power allocation algorithm for the uplink interleave division multiple access (IDMA) channel. Within the proposed optimal...

Samir Medina Perlaza, Laura Cottatellucci, M&eacut...

claim paper

Read More »

163

click to vote

EMSOFT
2005
Springer

142views Software Engineering» more EMSOFT 2005»

Communication strategies for shared-bus embedded multiprocessors

16 years 6 days ago

Download www.ece.umd.edu

Abstract— This paper explores the problem of efﬁciently ordering interprocessor communication operations in both statically and dynamically-scheduled multiprocessors for iterat...

Neal K. Bambha, Shuvra S. Bhattacharyya

claim paper

Read More »

194

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

16 years 2 days ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

156

click to vote

ATAL
2006
Springer

118views Intelligent Agents» more ATAL 2006»

Exact solutions of interactive POMDPs using behavioral equivalence

15 years 10 months ago

Download www.cs.uic.edu

We present a method for transforming the infinite interactive state space of interactive POMDPs (I-POMDPs) into a finite one, thereby enabling the computation of exact solutions. ...

Bharaneedharan Rathnasabapathy, Prashant Doshi, Pi...

claim paper

Read More »

176

click to vote

NIPS
2003

105views Information Technology» more NIPS 2003»

Gaussian Processes in Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learning in continuous state spaces and discrete time. We demonstrate how the GP mod...

Carl Edward Rasmussen, Malte Kuss

claim paper

Read More »

« Prev « First page 30 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers