We consider the problem of learning mixtures of product distributions over discrete domains in the distribution learning framework introduced by Kearns et al. [18]. We give a poly(n/ ) time algorithm for learning a mixture of k arbitrary product distributions over the n-dimensional Boolean cube {0, 1}n to accuracy , for any constant k. Previous polynomial time algorithms could only achieve this for k = 2 product distributions; our result answers an open question stated independently in [8] and [14]. We further give evidence that no polynomial time algorithm can succeed when k is superconstant, by reduction from a notorious open problem in PAC learning. Finally, we generalize our poly(n/ ) time algorithm to learn any mixture of k = O(1) product distributions over {0, 1, . . . , b}n , for any b = O(1). ∗ Supported by an NSF Mathematical Sciences Postdoctoral Research Fellowship † Some of this work was done while at the Institute for Advanced Study, supported in part by the National ...
Jon Feldman, Ryan O'Donnell, Rocco A. Servedio