Sciweavers

286 search results - page 14 / 58
» Learning taxonomic relations from a set of text documents
Sort
View
SAC
2009
ACM
14 years 2 months ago
Combining statistics and semantics via ensemble model for document clustering
Incorporating background knowledge into data mining algorithms is an important but challenging problem. Current approaches in semi-supervised learning require explicit knowledge p...
Samah Jamal Fodeh, William F. Punch, Pang-Ning Tan
WWW
2009
ACM
14 years 8 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth
LREC
2010
170views Education» more  LREC 2010»
13 years 9 months ago
Building a Domain-specific Document Collection for Evaluating Metadata Effects on Information Retrieval
This paper describes the development of a structured document collection containing user-generated text and numerical metadata for exploring the exploitation of metadata in inform...
Walid Magdy, Jinming Min, Johannes Leveling, Garet...
CIKM
2007
Springer
14 years 1 months ago
Developing learning strategies for topic-based summarization
Most up-to-date well-behaved topic-based summarization systems are built upon the extractive framework. They score the sentences based on the associated features by manually assig...
Ouyang You, Sujian Li, Wenjie Li
ECIR
2009
Springer
14 years 4 months ago
On Automatic Plagiarism Detection Based on n-Grams Comparison
Abstract. When automatic plagiarism detection is carried out considering a reference corpus, a suspicious text is compared to a set of original documents in order to relate the pla...
Alberto Barrón-Cedeño, Paolo Rosso