Sciweavers

WEBI
2009
Springer

Effective Keyword Search for Software Resources Installed in Large-Scale Grid Infrastructures

14 years 6 months ago
Effective Keyword Search for Software Resources Installed in Large-Scale Grid Infrastructures
—In this paper, we investigate the problem of supporting keyword-based searching for the discovery of software resources that are installed on the nodes of largescale, federated Grid computing infrastructures. We address a number of challenges that arise from the unstructured nature of software and the unavailability of software-related metadata on Grid sites. We present Minersoft, a Grid harvester that visits Grid sites, crawls their file-systems, identifies and classifies software resources, and discovers implicit associations between them. The results of Minersoft harvesting are encoded in a weighted, typed graph, named the Software Graph. A number of IR algorithms are used to enrich this graph with structural and content associations, to annotate software resources with keywords, and build inverted indexes to support keyword-based searching for software. Using a real testbed, we present an evaluation study of our approach, using data extracted from a production-quality Grid in...
George Pallis, Asterios Katsifodimos, Marios D. Di
Added 25 May 2010
Updated 25 May 2010
Type Conference
Year 2009
Where WEBI
Authors George Pallis, Asterios Katsifodimos, Marios D. Dikaiakos
Comments (0)