Sciweavers

231 search results - page 41 / 47
» Active Learning in Partially Observable Markov Decision Proc...
Sort
View
149
Voted
CORR
2012
Springer
229views Education» more  CORR 2012»
13 years 11 months ago
Cops and Invisible Robbers: the Cost of Drunkenness
We examine a version of the Cops and Robber (CR) game in which the robber is invisible, i.e., the cops do not know his location until they capture him. Apparently this game (CiR) h...
Athanasios Kehagias, Dieter Mitsche, Pawel Pralat
160
Voted
ICML
1999
IEEE
16 years 4 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
115
Voted
TR
2010
126views Hardware» more  TR 2010»
14 years 10 months ago
Optimal Maintenance Strategies for Wind Turbine Systems Under Stochastic Weather Conditions
Abstract--We examine optimal repair strategies for wind turbines operated under stochastic weather conditions. In-situ sensors installed at wind turbines produce useful information...
Eunshin Byon, Lewis Ntaimo, Yu Ding
158
Voted
ICML
2007
IEEE
16 years 4 months ago
Multi-task reinforcement learning: a hierarchical Bayesian approach
We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...
Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...
167
Voted
JMM2
2006
138views more  JMM2 2006»
15 years 3 months ago
A Framework for Constructing Real-time Immersive Environments for Training Physical Activities
Abstract-- This paper describes a framework for constructing a three-dimensional immersive environment that can be used for training physical activities. The system is designed to ...
Sang-Hack Jung, Ruzena Bajcsy