Sciweavers

967 search results - page 125 / 194
» Text Mining
Sort
View
133
Voted
ICDM
2007
IEEE
129views Data Mining» more  ICDM 2007»
15 years 8 months ago
Semi-supervised Clustering Using Bayesian Regularization
Text clustering is most commonly treated as a fully automated task without user supervision. However, we can improve clustering performance using supervision in the form of pairwi...
Zuobing Xu, Ram Akella, Mike Ching, Renjie Tang
128
Voted
WWW
2005
ACM
16 years 3 months ago
The volume and evolution of web page templates
Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...
David Gibson, Kunal Punera, Andrew Tomkins
109
Voted
PAKDD
2009
ACM
127views Data Mining» more  PAKDD 2009»
15 years 9 months ago
Clustering Documents Using a Wikipedia-Based Concept Representation
Abstract. This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document clustering. We first create a concept-based document representation b...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
PKDD
2009
Springer
269views Data Mining» more  PKDD 2009»
15 years 9 months ago
Enhanced Web Page Content Visualization with Firefox
This paper aims at presenting how natural language processing and machine learning techniques can help the internet surfer to get a better overview of the pages he is reading. The ...
Lorand Dali, Delia Rusu, Dunja Mladenic
132
Voted
PKDD
2009
Springer
118views Data Mining» more  PKDD 2009»
15 years 9 months ago
Sparse Kernel SVMs via Cutting-Plane Training
We explore an algorithm for training SVMs with Kernels that can represent the learned rule using arbitrary basis vectors, not just the support vectors (SVs) from the training set. ...
Thorsten Joachims, Chun-Nam John Yu