Sciweavers

523 search results - page 60 / 105
» Structured Solution Methods for Non-Markovian Decision Proce...
Sort
View
UAI
2008
15 years 5 months ago
Partitioned Linear Programming Approximations for MDPs
Approximate linear programming (ALP) is an efficient approach to solving large factored Markov decision processes (MDPs). The main idea of the method is to approximate the optimal...
Branislav Kveton, Milos Hauskrecht
ISF
2007
119views more  ISF 2007»
15 years 3 months ago
Managing the false alarms: A framework for assurance and verification of surveillance monitoring
This article discusses methods to support assurance of surveillance monitoring; and compliance verification knowledge management (CV-KM). The discussion includes aspects of primar...
Peter Goldschmidt
139
Voted
ICRA
2007
IEEE
155views Robotics» more  ICRA 2007»
15 years 10 months ago
Value Function Approximation on Non-Linear Manifolds for Robot Motor Control
— The least squares approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...
Masashi Sugiyama, Hirotaka Hachiya, Christopher To...
ATAL
1997
Springer
15 years 8 months ago
Toward the Specification and Design of Industrial Synthetic Ecosystems
Many agent-based systems rely for their effectiveness on the intelligence of individual agents, and interaction among agents is required simply to coordinate these individually com...
H. Van Dyke Parunak, John A. Sauter, Steve Clark
165
Voted
JMLR
2010
189views more  JMLR 2010»
14 years 10 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...