Search Sciweavers | Sciweavers

802 search results - page 134 / 161

» Experts in a Markov Decision Process

164

click to vote

GLOBECOM
2007
IEEE

116views Communications» more GLOBECOM 2007»

Cross-Layer Call Admission Control for a CDMA Uplink Employing a Base-Station Antenna Array

16 years 2 days ago

Download post.queensu.ca

— A novel cross-layer call admission control policy is proposed for a general CDMA beamforming system. In contrast to previously proposed call admission control (CAC) policies wh...

Wei Sheng, Steven D. Blostein

claim paper

Read More »

156

click to vote

GLOBECOM
2007
IEEE

156views Communications» more GLOBECOM 2007»

Constrained Stochastic Games in Wireless Networks

16 years 2 days ago

Download www.supelec.fr

—We consider the situation where N nodes share a common access point. With each node i there is an associated buffer and channel state that change in time. Node i dynamically cho...

Eitan Altaian, Konstantin Avrachenkov, Nicolas Bon...

claim paper

Read More »

153

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Combinatorial resource scheduling for multiagent MDPs

15 years 12 months ago

Download ai.stanford.edu

Optimal resource scheduling in multiagent systems is a computationally challenging task, particularly when the values of resources are not additive. We consider the combinatorial ...

Dmitri A. Dolgov, Michael R. James, Michael E. Sam...

claim paper

Read More »

158

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 12 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

186

click to vote

ROBOCUP
2007
Springer

99views Robotics» more ROBOCUP 2007»

Instance-Based Action Models for Fast Action Planning

15 years 12 months ago

Download userweb.cs.utexas.edu

Abstract. Two main challenges of robot action planning in real domains are uncertain action eﬀects and dynamic environments. In this paper, an instance-based action model is lear...

Mazda Ahmadi, Peter Stone

claim paper

Read More »

« Prev « First page 134 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers