Sciweavers

338 search results - page 13 / 68
» Unsupervised Semantic Similarity Computation between Terms U...
Sort
View
ICDM
2008
IEEE
147views Data Mining» more  ICDM 2008»
14 years 3 months ago
Clustering Documents with Active Learning Using Wikipedia
Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
WWW
2004
ACM
14 years 9 months ago
Combining link and content analysis to estimate semantic similarity
Search engines use content and link information to crawl, index, retrieve, and rank Web pages. The correlations between similarity measures based on these cues and on semantic ass...
Filippo Menczer
WWW
2005
ACM
14 years 9 months ago
Thresher: automating the unwrapping of semantic content from the World Wide Web
We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...
Andrew Hogue, David R. Karger
CIKM
2008
Springer
13 years 10 months ago
A language for manipulating clustered web documents results
We propose a novel conception language for exploring the results retrieved by several internet search services (like search engines) that cluster retrieved documents. The goal is ...
Gloria Bordogna, Alessandro Campi, Giuseppe Psaila...
ICDE
2010
IEEE
200views Database» more  ICDE 2010»
13 years 8 months ago
Towards better entity resolution techniques for Web document collections
— As person names are non-unique, the same name on different Web pages might or might not refer to the same real-world person. This entity identification problem is one of the m...
Surender Reddy Yerva, Zoltán Miklós,...