Document page similarity based on layout visual saliency: Application to query by example and document classification