Sciweavers

DILS
2006
Springer

A Method for Similarity-Based Grouping of Biological Data

14 years 3 months ago
A Method for Similarity-Based Grouping of Biological Data
Similarity-based grouping of data entries in one or more data sources is a task underlying many different data management tasks, such as, structuring search results, removal of redundancy in databases and data integration. Similarity-based grouping of data entries is not a trivial task in the context of life science data sources as the stored data is complex, highly correlated and represented at different levels of granularity. The contribution of this paper is two-fold. 1) We propose a method for similarity-based grouping and 2) we show results from test cases. As the main steps the method contains specification of grouping rules, pairwise grouping between entries, actual grouping of similar entries, and evaluation and analysis of the results. Often, different strategies can be used in the different steps. The method enables exploration of the influence of the choices and supports evaluation of the results with respect to given classifications. The grouping method is illustrated by te...
Vaida Jakoniene, David Rundqvist, Patrick Lambrix
Added 22 Aug 2010
Updated 22 Aug 2010
Type Conference
Year 2006
Where DILS
Authors Vaida Jakoniene, David Rundqvist, Patrick Lambrix
Comments (0)