Search Sciweavers | Sciweavers

708 search results - page 19 / 142

» Identifying Content Blocks from Web Documents

136

click to vote

HT
2010
ACM

170views Internet Technology» more HT 2010»

Is this a good title?

15 years 8 months ago

Download www.cs.odu.edu

Missing web pages, URIs that return the 404 “Page Not Found” error or the HTTP response code 200 but dereference unexpected content, are ubiquitous in today’s browsing exper...

Martin Klein, Jeffery L. Shipman, Michael L. Nelso...

claim paper

Read More »

142

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

15 years 8 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

135

click to vote

SIGIR
2003
ACM

176views Information Technology» more SIGIR 2003»

Building a web thesaurus from web link structure

15 years 8 months ago

Download research.microsoft.com

Thesaurus has been widely used in many applications, including information retrieval, natural language processing, and question answering. In this paper, we propose a novel approa...

Zheng Chen, Shengping Liu, Liu Wenyin, Geguang Pu,...

claim paper

Read More »

129

click to vote

HT
2003
ACM

102views Internet Technology» more HT 2003»

Untangling compound documents on the web

15 years 8 months ago

Download mccurley.org

Most text analysis is designed to deal with the concept of a “document”, namely a cohesive presentation of thought on a unifying subject. By contrast, individual nodes on the ...

Nadav Eiron, Kevin S. McCurley

claim paper

Read More »

119

click to vote

ICWE
2010
Springer

97views Internet Technology» more ICWE 2010»

15 years 1 months ago

Linking Related Documents: Combining Tag Clouds and Search Queries

Download austria-lexikon.at

Nowadays, Web encyclopedias suffer from a high bounce rate. Typically, users come to an encyclopaedia from a search engine and upon reading the first page on the site they leave it...

Christoph Trattner, Denis Helic

claim paper

Read More »

« Prev « First page 19 / 142 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers