Sciweavers

700 search results - page 130 / 140
» Clustering XML Documents by Structure
Sort
View
ECIR
2003
Springer
13 years 11 months ago
Representative Sampling for Text Classification Using Support Vector Machines
In order to reduce human efforts, there has been increasing interest in applying active learning for training text classifiers. This paper describes a straightforward active learni...
Zhao Xu, Kai Yu, Volker Tresp, Xiaowei Xu, Jizhi W...
DEXAW
1999
IEEE
106views Database» more  DEXAW 1999»
14 years 2 months ago
Textual Similarities Based on a Distributional Approach
The design of efficient textual similarities is an important issue in the domain of textual data exploration. Textual similarities are for example central in document collection s...
Romaric Besançon, Martin Rajman, Jean-C&eac...
CIKM
2008
Springer
13 years 11 months ago
Learning to link with wikipedia
This paper describes how to automatically cross-reference documents with Wikipedia: the largest knowledge base ever known. It explains how machine learning can be used to identify...
David N. Milne, Ian H. Witten
FCSC
2010
238views more  FCSC 2010»
13 years 7 months ago
Knowledge discovery through directed probabilistic topic models: a survey
Graphical models have become the basic framework for topic based probabilistic modeling. Especially models with latent variables have proved to be effective in capturing hidden str...
Ali Daud, Juanzi Li, Lizhu Zhou, Faqir Muhammad
WWW
2004
ACM
14 years 10 months ago
OntoMiner: bootstrapping ontologies from overlapping domain specific web sites
In this paper, we present automated techniques for bootstrapping and populating specialized domain ontologies by organizing and mining a set of relevant overlapping Web sites prov...
Hasan Davulcu, Srinivas Vadrevu, Saravanakumar Nag...