Search Sciweavers | Sciweavers

2337 search results - page 22 / 468

» Extracting Sequences from the Web

174

click to vote

SIGMOD
2003
ACM

190views Database» more SIGMOD 2003»

Extracting Structured Data from Web Pages

15 years 11 months ago

Download infolab.stanford.edu

Many web sites contain large sets of pages generated using a common template or layout. For example, Amazon lays out the author, title, comments, etc. in the same way in all its b...

Arvind Arasu, Hector Garcia-Molina

claim paper

Read More »

314

click to vote

RIAO
1997

350views Information Technology» more RIAO 1997»

Coupling information retrieval and information extraction: A new text technology for gathering information from the web

15 years 7 months ago

Download reference.kfupm.edu.sa

The techniques of information retrieval and information extraction are complementary, but to date there has been little concrete work aimed at integrating the two. We describe how...

Robert J. Gaizauskas, Alexander M. Robertson

claim paper

Read More »

187

click to vote

WWW
2006
ACM

147views Internet Technology» more WWW 2006»

POLYPHONET: an advanced social network extraction system from the web

16 years 6 months ago

Download www2006.org

Social networks play important roles in the Semantic Web: knowledge management, information retrieval, ubiquitous computing, and so on. We propose a social network extraction syst...

Hideaki Takeda, Junichiro Mori, Kôiti Hasida...

claim paper

Read More »

174

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

15 years 11 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

167

click to vote

LREC
2008

160views Education» more LREC 2008»

Automatic Extraction of Textual Elements from News Web Pages

15 years 7 months ago

Download www.lrec-conf.org

In this paper we present an algorithm for automatic extraction of textual elements, namely titles and full text, associated with news stories in news web pages. We propose a super...

Hossam Ibrahim, Kareem Darwish, Abdel-Rahim Madany

claim paper

Read More »

« Prev « First page 22 / 468 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers