Sciweavers

1167 search results - page 95 / 234
» policy 2007
Sort
View
JSW
2007
191views more  JSW 2007»
15 years 4 months ago
Building Self-Configuring Data Centers with Cross Layer Coevolution
Abstract—This paper describes a biologically-inspired architecture, called SymbioticSphere, which allows data centers to autonomously adapt to dynamic environmental changes. Symb...
Paskorn Champrasert, Junichi Suzuki
NIPS
1998
15 years 5 months ago
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...
Michael J. Kearns, Satinder P. Singh
NETWORKING
2004
15 years 5 months ago
Hierarchical Routing with QoS Constraints in Optical Transport Networks
Abstract. Optical Transport Networks (OTN) with automatical switching capabilities are named ASON. Hierarchical routing is required in the ASON recommendations to achieve scalabili...
Xavier Masip-Bruin, Sergio Sánchez-Ló...
IJCAI
2003
15 years 5 months ago
Use of Off-line Dynamic Programming for Efficient Image Interpretation
An interpretation system finds the likely mappings from portions of an image to real-world objects. An interpretation policy specifies when to apply which imaging operator, to whi...
Ramana Isukapalli, Russell Greiner
NIPS
2007
15 years 5 months ago
Incremental Natural Actor-Critic Algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...