Clustering documents with an exponential-family approximation of the Dirichlet compound multinomial distribution