The problem of identifying approximately duplicate objects in databases is an essential step for the information integration process. Most existing approaches have relied on gener...
In modern multimedia databases, objects can be represented by a large variety of feature representations. In order to employ all available information in a best possible way, a joi...
Hans-Peter Kriegel, Peter Kunath, Alexey Pryakhin,...
For dynamic sales dialogs in electronic commerce scenarios, approaches based on an information gain measure used for attribute selection have been suggested. These measures conside...
A variety of information extraction techniques rely on the fact that instances of the same relation are "distributionally similar," in that they tend to appear in simila...
Current web search engines focus on searching only the most recent snapshot of the web. In some cases, however, it would be desirable to search over collections that include many ...