Abstract Homepages usually describe important semantic information about conceptual or physical entities, and are hence the main targets for searching and browsing. To facilitate semantic based information retrieval (IR) at a Web site, homepages can be identified and classified under some pre-defined concepts and these concepts are then used in query or browsing criteria, e.g., finding professor homepages containing "information retrieval". In some Web sites, relationships may also exist among homepages. These relationship instances (also known as homepage relationships) enrich our knowledge about these Web sites and allow more expressive semantic based IR. In this paper, we investigate the features to be used in mining homepage relationships. We systematically develop different classes of inter-homepage features, namely, navigation, relativelocation, and common-item features. We also propose deriving for each homepage a set of support pages so as to obtain richer and more co...