Search Sciweavers | Sciweavers

200 search results - page 22 / 40

» Point-Based Policy Iteration

192

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 6 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

178

click to vote

ISLPED
1999
ACM

91views Hardware» more ISLPED 1999»

Stochastic modeling of a power-managed system: construction and optimization

15 years 11 months ago

Download hydrogen.ws.binghamton.edu

-- The goal of a dynamic power management policy is to reduce the power consumption of an electronic system by putting system components into different states, each representing ce...

Qinru Qiu, Qing Wu, Massoud Pedram

claim paper

Read More »

166

click to vote

ECAI
2006
Springer

194views Artificial Intelligence» more ECAI 2006»

Strategic Foresighted Learning in Competitive Multi-Agent Games

15 years 10 months ago

Download homepages.cwi.nl

We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-agent games. We make the observation that in a competitive setting with adaptive...

Pieter Jan't Hoen, Sander M. Bohte, Han La Poutr&e...

claim paper

Read More »

171

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 6 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

179

Voted

JSAC
2007

98views more JSAC 2007»

Optimum Power Allocation for Single-User MIMO and Multi-User MIMO-MAC with Partial CSI

15 years 6 months ago

Download www.ece.umd.edu

Abstract— We consider both the single-user and the multiuser power allocation problems in MIMO systems, where the receiver side has the perfect channel state information (CSI), a...

Alper Soysal, Sennur Ulukus

claim paper

Read More »

« Prev « First page 22 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers