Search Sciweavers | Sciweavers

211 search results - page 23 / 43

» Language Models for Searching in Web Corpora

146

click to vote

ECIR
2010
Springer

187views Information Technology» more ECIR 2010»

Analyzing Information Retrieval Methods to Recover Broken Web Links

15 years 4 months ago

Download nlp.uned.es

In this work we compare different techniques to automatically find candidate web pages to substitute broken links. We extract information from the anchor text, the content of the p...

Juan Martinez-Romo, Lourdes Araujo

claim paper

Read More »

140

click to vote

SIGIR
2011
ACM

188views Information Technology» more SIGIR 2011»

Parameterized concept weighting in verbose queries

14 years 7 months ago

Download ciir.cs.umass.edu

The majority of the current information retrieval models weight the query concepts (e.g., terms or phrases) in an unsupervised manner, based solely on the collection statistics. I...

Michael Bendersky, Donald Metzler, W. Bruce Croft

claim paper

Read More »

149

click to vote

WWW
2009
ACM

181views Internet Technology» more WWW 2009»

Combining anchor text categorization and graph analysis for paid link detection

16 years 5 months ago

Download www2009.org

In order to artificially boost the rank of commercial pages in search engine results, search engine optimizers pay for links to these pages on other websites. Identifying paid lin...

Kirill Nikolaev, Ekaterina Zudina, Andrey Gorshkov

claim paper

Read More »

100

click to vote

SIGIR
2004
ACM

112views Information Technology» more SIGIR 2004»

Web-a-where: geotagging web content

15 years 10 months ago

Download einat.webir.org

We describe Web-a-Where, a system for associating geography with Web pages. Web-a-Where locates mentions of places and determines the place each name refers to. In addition, it as...

Einat Amitay, Nadav Har'El, Ron Sivan, Aya Soffer

claim paper

Read More »

119

click to vote

ADC
2000
Springer

82views Database» more ADC 2000»

Querying Databases of Annotated Speech

15 years 8 months ago

Download papers.ldc.upenn.edu

Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic ‘transcriptions’. Such databases are typically multidimensional, heterogeneou...

Steve Cassidy, Steven Bird

claim paper

Read More »

« Prev « First page 23 / 43 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers