Search Sciweavers | Sciweavers

131 search results - page 8 / 27

» Ranking-Constrained Keyword Sequence Extraction from Web Doc...

192

click to vote

SIGIR
2002
ACM

130views Information Technology» more SIGIR 2002»

Finding relevant documents using top ranking sentences: an evaluation of two alternative schemes

15 years 7 months ago

Download research.microsoft.com

In this paper we present an evaluation of techniques that are designed to encourage web searchers to interact more with the results of a web search. Two specific techniques are ex...

Ryen White, Ian Ruthven, Joemon M. Jose

claim paper

Read More »

242

click to vote

WWW
2005
ACM

154views Internet Technology» more WWW 2005»

Thresher: automating the unwrapping of semantic content from the World Wide Web

16 years 8 months ago

Download www2005.org

We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...

Andrew Hogue, David R. Karger

claim paper

Read More »

189

Voted

WWW
2009
ACM

142views Internet Technology» more WWW 2009»

Estimating web site readability using content extraction

16 years 8 months ago

Download www2009.eprints.org

Nowadays, information is primarily searched on the WWW. From a user perspective, the readability is an important criterion for measuring the accessibility and thereby the quality ...

Thomas Gottron, Ludger Martin

claim paper

Read More »

239

click to vote

WWW
2009
ACM

189views Internet Technology» more WWW 2009»

Extracting data records from the web using tag path clustering

16 years 3 days ago

Download www2009.org

Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the ﬁrst step of this object extraction process, identiﬁes...

Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...

claim paper

Read More »

213

click to vote

WWW
2009
ACM

213views Internet Technology» more WWW 2009»

Extracting article text from the web with maximum subsequence segmentation

16 years 8 months ago

Download www2009.org

Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...

Jeff Pasternack, Dan Roth

claim paper

Read More »

« Prev « First page 8 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers