Sciweavers

238 search results - page 13 / 48
» Value-Function Approximations for Partially Observable Marko...
Sort
View
153
Voted
AAAI
2012
13 years 5 months ago
Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication
Planning under uncertainty is an important and challenging problem in multiagent systems. Multiagent Partially Observable Markov Decision Processes (MPOMDPs) provide a powerful fr...
Frans Adriaan Oliehoek, Matthijs T. J. Spaan
116
Voted
IJCAI
2007
15 years 4 months ago
The Value of Observation for Monitoring Dynamic Systems
We consider the fundamental problem of monitoring (i.e. tracking) the belief state in a dynamic system, when the model is only approximately correct and when the initial belief st...
Eyal Even-Dar, Sham M. Kakade, Yishay Mansour
160
Voted
ICML
1999
IEEE
16 years 4 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
153
Voted
ICML
1995
IEEE
16 years 4 months ago
Learning Policies for Partially Observable Environments: Scaling Up
Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...
Michael L. Littman, Anthony R. Cassandra, Leslie P...
129
Voted
GLOBECOM
2010
IEEE
15 years 1 months ago
Cooperative Relay Scheduling under Partial State Information in Energy Harvesting Sensor Networks
Abstract--Sensors equipped with energy harvesting and cooperative communication capabilities are a viable solution to the power limitations of Wireless Sensor Networks (WSNs) assoc...
Huijiang Li, Neeraj Jaggi, Biplab Sikdar