Sciweavers

266 search results - page 25 / 54
» Effective Summarization Method of Text Documents
Sort
View
SIGIR
2004
ACM
14 years 3 months ago
Focused named entity recognition using machine learning
In this paper we study the problem of finding most topical named entities among all entities in a document, which we refer to as focused named entity recognition. We show that th...
Li Zhang, Yue Pan, Tong Zhang
DAS
2008
Springer
13 years 11 months ago
A Fast Preprocessing Method for Table Boundary Detection: Narrowing Down the Sparse Lines Using Solely Coordinate Information
As the rapid growth of PDF document in digital libraries, recognizing the document structure and detecting specific document components are useful for document storage, classifica...
Ying Liu, Prasenjit Mitra, C. Lee Giles
WWW
2006
ACM
14 years 10 months ago
Large-scale text categorization by batch mode active learning
Large-scale text categorization is an important research topic for Web data mining. One of the challenges in large-scale text categorization is how to reduce the amount of human e...
Steven C. H. Hoi, Rong Jin, Michael R. Lyu
SIGIR
2010
ACM
13 years 8 months ago
EUSUM: extracting easy-to-understand english summaries for non-native readers
In this paper we investigate a novel and important problem in multi-document summarization, i.e., how to extract an easy-tounderstand English summary for non-native readers. Exist...
Xiaojun Wan, Huiying Li, Jianguo Xiao
WEBI
2007
Springer
14 years 4 months ago
Pairwise Constraints-Guided Non-negative Matrix Factorization for Document Clustering
Nonnegative Matrix Factorization (NMF) has been proven to be effective in text mining. However, since NMF is a well-known unsupervised components analysis technique, the existing ...
Yujiu Yang, Bao-Gang Hu