Sciweavers

672 search results - page 55 / 135
» Policy Search by Dynamic Programming
Sort
View
AAAI
2004
13 years 9 months ago
Stochastic Local Search for POMDP Controllers
The search for finite-state controllers for partially observable Markov decision processes (POMDPs) is often based on approaches like gradient ascent, attractive because of their ...
Darius Braziunas, Craig Boutilier
INFOCOM
2007
IEEE
14 years 2 months ago
Cost and Collision Minimizing Forwarding Schemes for Wireless Sensor Networks
—The paper presents a novel integrated MAC/routing scheme for wireless sensor networking. Our design objective is to elect the next hop for data forwarding by minimizing the numb...
Michele Rossi, Nicola Bui, Michele Zorzi
SWAT
2004
Springer
120views Algorithms» more  SWAT 2004»
14 years 1 months ago
Railway Delay Management: Exploring Its Algorithmic Complexity
We consider delay management in railway systems. Given delayed trains, we want to find a waiting policy for the connecting trains minimizing the weighted total passenger delay. If...
Michael Gatto, Björn Glaus, Riko Jacob, Leon ...
INFOCOM
2002
IEEE
14 years 23 days ago
Optimal Energy Allocation and Admission Control for Communications Satellites
—We address the issue of optimal energy allocation and admission control for communications satellites in earth orbit. Such satellites receive requests for transmission as they o...
Alvin Fu, Eytan Modiano, John N. Tsitsiklis
ICML
2000
IEEE
14 years 6 days ago
A Bayesian Framework for Reinforcement Learning
The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...
Malcolm J. A. Strens