Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

173

WWW
2004
ACM

179views Internet Technology» more WWW 2004»

Combining link and content analysis to estimate semantic similarity

16 years 7 months ago

Combining link and content analysis to estimate semantic similarity

Download www.informatics.indiana.edu

Search engines use content and link information to crawl, index, retrieve, and rank Web pages. The correlations between similarity measures based on these cues and on semantic associations between pages therefore crucially affects the performance of any search tool. Here I begin to quantitatively analyze the relationship between content, link, and semantic similarity measures across a massive number of Web page pairs. Maps of semantic similarity across textual and link similarity highlight the potential and limitations of lexical and link analysis for relevance approximation, and provide us with a way to study whether and how text and link based measures should be combined. Categories and Subject Descriptors: H.3.1 [Information Storage and Retrieval]: Content Analysis and Indexing; H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval General Terms: Measurement

Filippo Menczer

Real-time Traffic

Internet Technology | Link Based Measures | Link Similarity | Semantic Similarity Measures | WWW 2004 |

claim paper

Related Content

» Combining Text and Link Analysis for Focused Crawling

» Algorithmic detection of semantic similarity

» Tracking news stories across different sources

» An Intrinsic Information Content Metric for Semantic Similarity in WordNet

» Terrorism and Crime Related Weblog Social Network Link Content Analysis and Information Vi...

» Local Probabilistic Models for Link Prediction

» Imagination Exploiting Link Analysis for Accurate Image Annotation

» Exploring Context and Content Links in Social Media A Latent Space Method

» TreePattern Similarity Estimation for Scalable Contentbased Routing

Post Info
More Details (n/a)

Added	22 Nov 2009
Updated	22 Nov 2009
Type	Conference
Year	2004
Where	WWW
Authors	Filippo Menczer

Comments (0)