In some retrieval situations, a system must search across multiple collections. This task, referred to as federated search, occurs for example when searching a distributed index o...
Supervised learning is difficult with high dimensional input spaces and very small training sets, but accurate classification may be possible if the data lie on a low-dimensional ...
Deduplication is a key operation in integrating data from multiple sources. The main challenge in this task is designing a function that can resolve when a pair of records refer t...
Background: The information from different data sets experimented under different conditions may be inconsistent even though they are performed with the same research objectives. ...
Ki-Yeol Kim, Dong Hyuk Ki, Hei-Cheul Jeung, Hyun C...
Abstract. To make effective use of distributed information, it is desirable to allow coordination and collaboration among various information sources. This paper deals with cluster...