Search Sciweavers | Sciweavers

113 search results - page 15 / 23

» Learning Representation and Control in Continuous Markov Dec...

152

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 5 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

147

click to vote

CDC
2008
IEEE

120views Control Systems» more CDC 2008»

Approximate abstractions of discrete-time controlled stochastic hybrid systems

16 years 6 days ago

Download hybrid.stanford.edu

ate Abstractions of Discrete-Time Controlled Stochastic Hybrid Systems Alessandro D’Innocenzo, Alessandro Abate, and Maria D. Di Benedetto — This work proposes a procedure to c...

Alessandro D'Innocenzo, Alessandro Abate, Maria Do...

claim paper

Read More »

142

click to vote

IIS
2001

80views Information Technology» more IIS 2001»

The Development of the AQ20 Learning System and Initial Experiments

15 years 7 months ago

Download cs.gmu.edu

: Research on a new system implementing the AQ learning methodology, called AQ20, is briefly described, and illustrated by initial results from an experimental version. Like its pr...

Guido Cervone, Liviu Panait, Ryszard S. Michalski

claim paper

Read More »

169

click to vote

GLOBECOM
2006
IEEE

160views Communications» more GLOBECOM 2006»

Adaptive Learning of Transmission Control Policies for MIMO Fading Channels under Delay Constraint

15 years 11 months ago

Download www.ece.ubc.ca

— This paper addresses learning based adaptive resource allocation for wireless MIMO channels with Markovian fading. The problem is posed as Constrained Markov Decision Process w...

Dejan V. Djonin, Vikram Krishnamurthy

claim paper

Read More »

198

click to vote

AGENTS
1997
Springer

212views Security Privacy» more AGENTS 1997»

Integrating Communicative Action, Conversations and Decision Theory to Coordinate Agents

15 years 10 months ago

Download sigart.acm.org

The coordination problem in multi-agent systems is the problem of managing dependencies between the activities of autonomous agents, in conditions of incomplete knowledge about th...

Mihai Barbuceanu, Mark S. Fox

claim paper

Read More »

« Prev « First page 15 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers