Sciweavers

161 search results - page 4 / 33
» Improving Similarity Measures for Short Segments of Text
Sort
View
ACL
2007
13 years 8 months ago
Finding document topics for improving topic segmentation
Topic segmentation and identification are often tackled as separate problems whereas they are both part of topic analysis. In this article, we study how topic identification can...
Olivier Ferret
ICTAI
2007
IEEE
14 years 1 months ago
On Evaluation Methodologies for Text Segmentation Algorithms
The WindowDiff evaluation measure [12] is becoming the standard criterion for evaluating text segmentation methods. Nevertheless, this metric is really not fair with regard to the...
Sylvain Lamprier, Tassadit Amghar, Bernard Levrat,...
IPM
2008
141views more  IPM 2008»
13 years 7 months ago
Towards a unified approach to document similarity search using manifold-ranking of blocks
Document similarity search (i.e. query by example) aims to retrieve a ranked list of documents similar to a query document in a text corpus or on the Web. Most existing approaches...
Xiaojun Wan, Jianwu Yang, Jianguo Xiao
ICASSP
2008
IEEE
14 years 1 months ago
Using corpus and knowledge-based similarity measure in Maximum Marginal Relevance for meeting summarization
MMR (Maximum Marginal Relevance) is widely used in summarization for its simplicity and efficacy, and has been demonstrated to achieve comparable performance to other approaches ...
Shasha Xie, Yang Liu
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
14 years 7 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney