Sciweavers

3049 search results - page 416 / 610
» On the Convergence of Bound Optimization Algorithms
Sort
View
ICML
1998
IEEE
16 years 5 months ago
The MAXQ Method for Hierarchical Reinforcement Learning
This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...
Thomas G. Dietterich
SCALESPACE
2009
Springer
15 years 11 months ago
On Level-Set Type Methods for Recovering Piecewise Constant Solutions of Ill-Posed Problems
We propose a regularization method for solving ill-posed problems, under the assumption that the solutions are piecewise constant functions with unknown level sets and unknown leve...
Adriano DeCezaro, Antonio Leitão, Xue-Cheng...
ICNSC
2007
IEEE
15 years 10 months ago
Analysis of a Simple Feedback Scheme for Error Correction over a Lossy Network
Abstract— In time varying packet-switched networks, delivering data with high reliability using a limited amount of network resources is highly desirable. To capture the trade-of...
Oscar Flardh, Carlo Fischione, Karl Henrik Johanss...
CSR
2007
Springer
15 years 10 months ago
Estimation of the Click Volume by Large Scale Regression Analysis
Abstract. How could one estimate the total number of clicks a new advertisement could potentially receive in the current market? This question, called the click volume estimation p...
Yury Lifshits, Dirk Nowotka
GECCO
2007
Springer
160views Optimization» more  GECCO 2007»
15 years 10 months ago
An analysis of constructive crossover and selection pressure in genetic programming
A common problem in genetic programming search algorithms is destructive crossover in which the offspring of good parents generally has worse performance than the parents. Design...
Huayang Xie, Mengjie Zhang, Peter Andreae