The empirical curve bounding problem is de ned as follows. Suppose data vectors X Y are presented such that E(Y i]) = f(X i]) where f(x) is an unknown function. The problem is to analyze X Y and obtain complexity bounds O(gu(x)) and (gl(x)) on the function f(x). As no algorithm for empirical curve bounding can be guaranteed correct, we consider heuristics. Five heuristic algorithmsare presented here, together with analyticalresults guaranteeing correctness for certain familiesof functions. Experimental evaluations of the correctness and tightness of bounds obtained by the rules for several constructed functions f(x) and real datasets are described. A hybrid method is shown to have very good performance on some kinds of functions, suggesting a general, iterative re nement procedure in which diagnostic features of the results of applying particular methods can be used to select additional methods.
Catherine C. McGeoch, Doina Precup, Paul R. Cohen