Search Sciweavers | Sciweavers

2423 search results - page 87 / 485

» Hypertext Information Retrieval for the Web

127

click to vote

EDBT
2006
ACM

112views Database» more EDBT 2006»

Indexing Shared Content in Information Retrieval Systems

16 years 4 months ago

Download fontoura.org

Abstract. Modern document collections often contain groups of documents with overlapping or shared content. However, most information retrieval systems process each document separa...

Andrei Z. Broder, Nadav Eiron, Marcus Fontoura, Mi...

claim paper

Read More »

119

click to vote

CIKM
2003
Springer

129views Information Technology» more CIKM 2003»

Extracting unstructured data from template generated web documents

15 years 9 months ago

Download www.ir.iit.edu

We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...

Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...

claim paper

Read More »

114

click to vote

WWW
2008
ACM

109views Internet Technology» more WWW 2008»

Recrawl scheduling based on information longevity

16 years 5 months ago

Download www2008.org

It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...

Christopher Olston, Sandeep Pandey

claim paper

Read More »

137

click to vote

LREC
2008

139views Education» more LREC 2008»

Experiments to Investigate the Connection between Case Distribution and Topical Relevance of Search Terms in an Information Retr

15 years 6 months ago

Download www.lrec-conf.org

We have performed a set of experiments made to investigate the utility of morphological analysis to improve retrieval of documents written in languages with relatively large morph...

Jussi Karlgren, Hercules Dalianis, Bart Jongejan

claim paper

Read More »

122

click to vote

SAINT
2007
IEEE

119views Internet Technology» more SAINT 2007»

A Generic API for Retrieving Human-Oriented Information from Social Network Services

15 years 10 months ago

Download iplab.aist-nara.ac.jp

A unique type of Web service, called a Social Network Service (SNS), ﬁrst appeared in 2003. Some researches suggested a method to extract meaningful information from SNSs. Such ...

Teruaki Yokoyama, Shigeru Kashihara, Takeshi Okuda...

claim paper

Read More »

« Prev « First page 87 / 485 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers