Sciweavers

57 search results - page 10 / 12
» Evaluation of Text Clustering Algorithms with N-Gram-Based D...
Sort
View
CIKM
2009
Springer
14 years 2 months ago
Leveraging a scalable row store to build a distributed text index
Many content-oriented applications require a scalable text index. Building such an index is challenging. In addition to the logic of inserting and searching documents, developers ...
Ning Li, Jun Rao, Eugene J. Shekita, Sandeep Tata
ECAI
2010
Springer
13 years 7 months ago
Learning to Author Text with textual CBR
Abstract. Textual reuse is an integral part of textual case-based reasoning (TCBR) which deals with solving new problems by reusing previous similar problem-solving experiences doc...
Ibrahim Adeyanju, Nirmalie Wiratunga, Juan A. Reci...
PKDD
2005
Springer
122views Data Mining» more  PKDD 2005»
14 years 29 days ago
A Probabilistic Clustering-Projection Model for Discrete Data
For discrete co-occurrence data like documents and words, calculating optimal projections and clustering are two different but related tasks. The goal of projection is to find a ...
Shipeng Yu, Kai Yu, Volker Tresp, Hans-Peter Krieg...
WWW
2010
ACM
14 years 2 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
COLING
2010
13 years 2 months ago
Topic-Based Bengali Opinion Summarization
In this paper the development of an opinion summarization system that works on Bengali News corpus has been described. The system identifies the sentiment information in each docu...
Amitava Das, Sivaji Bandyopadhyay