Sciweavers

638 search results - page 118 / 128
» Scalable Techniques for Clustering the Web
Sort
View
DASFAA
2004
IEEE
135views Database» more  DASFAA 2004»
13 years 11 months ago
Semi-supervised Text Classification Using Partitioned EM
Text classification using a small labeled set and a large unlabeled data is seen as a promising technique to reduce the labor-intensive and time consuming effort of labeling traini...
Gao Cong, Wee Sun Lee, Haoran Wu, Bing Liu
WWW
2008
ACM
14 years 8 months ago
Efficient similarity joins for near duplicate detection
With the increasing amount of data and the need to integrate data from multiple data sources, a challenging issue is to find near duplicate records efficiently. In this paper, we ...
Chuan Xiao, Wei Wang 0011, Xuemin Lin, Jeffrey Xu ...
IAT
2008
IEEE
14 years 2 months ago
Collective User Behaviour and Tag Contextualisation in Folksonomies
Collaborative tagging systems have emerged in recent years to become popular tools for organising information on the Web. While collaborative tagging offers many advantages, they ...
Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbo...
MTSR
2007
Springer
14 years 1 months ago
Creating and Querying an Integrated Ontology for Molecular and Phenotypic Cereals Data
In this paper we describe the development of an ontology of molecular and phenotypic cereals data, realized by integrating existing public web databases with the database developed...
Sonia Bergamaschi, Antonio Sala 0002
IV
2006
IEEE
140views Visualization» more  IV 2006»
14 years 1 months ago
AlViz - A Tool for Visual Ontology Alignment
We introduce a multiple-view tool called AlViz, which supports the alignment of ontologies visually. Ontologies play an important role for interoperability between organizations a...
Monika Lanzenberger, Jennifer Sampson