Sciweavers

211 search results - page 23 / 43
» Language Models for Searching in Web Corpora
Sort
View
ECIR
2010
Springer
13 years 8 months ago
Analyzing Information Retrieval Methods to Recover Broken Web Links
In this work we compare different techniques to automatically find candidate web pages to substitute broken links. We extract information from the anchor text, the content of the p...
Juan Martinez-Romo, Lourdes Araujo
SIGIR
2011
ACM
12 years 11 months ago
Parameterized concept weighting in verbose queries
The majority of the current information retrieval models weight the query concepts (e.g., terms or phrases) in an unsupervised manner, based solely on the collection statistics. I...
Michael Bendersky, Donald Metzler, W. Bruce Croft
WWW
2009
ACM
14 years 9 months ago
Combining anchor text categorization and graph analysis for paid link detection
In order to artificially boost the rank of commercial pages in search engine results, search engine optimizers pay for links to these pages on other websites. Identifying paid lin...
Kirill Nikolaev, Ekaterina Zudina, Andrey Gorshkov
SIGIR
2004
ACM
14 years 2 months ago
Web-a-where: geotagging web content
We describe Web-a-Where, a system for associating geography with Web pages. Web-a-Where locates mentions of places and determines the place each name refers to. In addition, it as...
Einat Amitay, Nadav Har'El, Ron Sivan, Aya Soffer
ADC
2000
Springer
82views Database» more  ADC 2000»
14 years 29 days ago
Querying Databases of Annotated Speech
Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic ‘transcriptions’. Such databases are typically multidimensional, heterogeneou...
Steve Cassidy, Steven Bird