Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

168

ICAPR
2005
Springer

130views Pattern Recognition» more ICAPR 2005»

Combining Text and Link Analysis for Focused Crawling

16 years 6 days ago

Combining Text and Link Analysis for Focused Crawling

Download poseidon.csd.auth.gr

The number of vertical search engines and portals has rapidly increased over the last years, making the importance of a topic-driven (focused) crawler evident. In this paper, we develop a latent semantic indexing classiﬁer that combines link analysis with text content in order to retrieve and index domain speciﬁc web documents. We compare its eﬃciency with other well-known web information retrieval techniques. Our implementation presents a diﬀerent approach to focused crawling and aims to overcome the limitations of the neccesity to provide initial training data while maintaining a high recall/precision ratio.

George Almpanidis, Constantine Kotropoulos

Real-time Traffic

ICAPR 2005 | Index Domain Speciﬁc | Latent Semantic Indexing | Vertical Search Engines |

claim paper

Related Content

» Focused Crawling Using Latent Semantic Indexing An Application for Vertical Search Engine...

» Geographically focused collaborative crawling

» Combining link and content analysis to estimate semantic similarity

» Focused Crawling in Depression Portal Search A Feasibility Study

» Accelerated focused crawling through online relevance feedback

» Improving Knowledge Discovery in Document Collections through Combining Text Retrieval and...

» Focused crawling for both topical relevance and quality of medical information

» Evaluation Methods for Focused Crawling

» Effective webscale crawling through website analysis

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ICAPR
Authors	George Almpanidis, Constantine Kotropoulos

Comments (0)