We analyze the capacity scaling laws of wireless networks where the spatial distribution of nodes over the network area exhibits a high degree of clustering. In particular we consider the presence of heterogeneous clusters, both in size and in population, which are common in many real systems. We completely characterize the scaling exponent of the resulting network capacity by providing upper and lower bounds which differ at most by a poly-logarithmic factor in the number of nodes.