Sciweavers

572 search results - page 74 / 115
» Winnowing-based text clustering
Sort
View
BMCBI
2006
153views more  BMCBI 2006»
13 years 10 months ago
Automatic document classification of biological literature
Background: Document classification is a wide-spread problem with many applications, from organizing search engine snippets to spam filtering. We previously described Textpresso, ...
David Chen, Hans-Michael Müller, Paul W. Ster...
SGAI
2007
Springer
14 years 4 months ago
Metrics for Mining Multisets
Abstract. We propose a new class of distance measures (metrics) designed for multisets, both of which are a recurrent theme in many data mining applications. One particular instanc...
Walter A. Kosters, Jeroen F. J. Laros
ACL
2008
13 years 11 months ago
Inferring Activity Time in News through Event Modeling
Many applications in NLP, such as questionanswering and summarization, either require or would greatly benefit from the knowledge of when an event occurred. Creating an effective ...
Vladimir Eidelman
SPIESR
2003
136views Database» more  SPIESR 2003»
13 years 11 months ago
Media segmentation using self-similarity decomposition
We present a framework for analyzing the structure of digital media streams. Though our methods work for video, text, and audio, we concentrate on detecting the structure of digit...
Jonathan Foote, Matthew L. Cooper
CORR
2008
Springer
87views Education» more  CORR 2008»
13 years 10 months ago
Visualization of association graphs for assisting the interpretation of classifications
Given a query on the PASCAL database maintained by the INIST, we design user interfaces to visualize and wo types of graphs extracted from abstracts: 1) the graph of all associati...
Eric SanJuan, Ivana Roche