Sciweavers

450 search results - page 60 / 90
» Content Collection for the Labelling of Health-Related Web C...
Sort
View
ICDM
2006
IEEE
138views Data Mining» more  ICDM 2006»
14 years 2 months ago
Adding Semantics to Email Clustering
This paper presents a novel algorithm to cluster emails according to their contents and the sentence styles of their subject lines. In our algorithm, natural language processing t...
Hua Li, Dou Shen, Benyu Zhang, Zheng Chen, Qiang Y...
KDD
2002
ACM
147views Data Mining» more  KDD 2002»
14 years 9 months ago
A parallel learning algorithm for text classification
Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify te...
Canasai Kruengkrai, Chuleerat Jaruskulchai
TSD
2010
Springer
13 years 7 months ago
Evaluation of a Sentence Ranker for Text Summarization Based on Roget's Thesaurus
Abstract. Evaluation is one of the hardest tasks in automatic text summarization. It is perhaps even harder to determine how much a particular component of a summarization system c...
Alistair Kennedy, Stan Szpakowicz
WWW
2002
ACM
14 years 9 months ago
A machine learning based approach for table detection on the web
Table is a commonly used presentation scheme, especially for describing relational information. However, table understanding remains an open problem. In this paper, we consider th...
Yalin Wang, Jianying Hu
DSS
2006
184views more  DSS 2006»
13 years 8 months ago
Building a scientific knowledge web portal: The NanoPort experience
There has been a tremendous growth in the amount of information and resources on the World Wide Web that are useful to researchers and practitioners in science domains. While the ...
Michael Chau, Zan Huang, Jialun Qin, Yilu Zhou, Hs...