Sciweavers

317 search results - page 5 / 64
» Style-independent document labeling: design and performance ...
Sort
View
JSS
2006
76views more  JSS 2006»
13 years 7 months ago
Performance evaluation of peer-to-peer Web caching systems
Peer-to-peer Web caching has attracted a great attention from the research community recently, and is one of the potential peer-topeer applications. In this paper, we systematical...
Weisong Shi, Yonggen Mao
KDD
2003
ACM
157views Data Mining» more  KDD 2003»
14 years 8 months ago
Cross-training: learning probabilistic mappings between topics
Classification is a well-established operation in text mining. Given a set of labels A and a set DA of training documents tagged with these labels, a classifier learns to assign l...
Sunita Sarawagi, Soumen Chakrabarti, Shantanu Godb...
SIGIR
2002
ACM
13 years 7 months ago
Document clustering with cluster refinement and model selection capabilities
In this paper, we propose a document clustering method that strives to achieve: (1) a high accuracy of document clustering, and (2) the capability of estimating the number of clus...
Xin Liu, Yihong Gong, Wei Xu, Shenghuo Zhu
ICDAR
2011
IEEE
12 years 7 months ago
Continuous CRF with Multi-scale Quantization Feature Functions Application to Structure Extraction in Old Newspaper
—We introduce quantization feature functions to represent continuous or large range discrete data into the symbolic CRF data representation. We show that doing this convertion in...
David Hebert, Thierry Paquet, Stéphane Nico...
WEBI
2009
Springer
14 years 2 months ago
Social Semantics and Its Evaluation by Means of Semantic Relatedness and Open Topic Models
—This paper presents an approach using social semantics for the task of topic labelling by means of Open Topic Models. Our approach utilizes a social ontology to create an alignm...
Ulli Waltinger, Alexander Mehler