Search Sciweavers | Sciweavers

708 search results - page 1 / 142

» Identifying Content Blocks from Web Documents

165

click to vote

ISMIS
2005
Springer

166views Artificial Intelligence» more ISMIS 2005»

Identifying Content Blocks from Web Documents

16 years 4 days ago

Download clgiles.ist.psu.edu

Intelligent information processing systems, such as digital libraries or search engines index web-pages according to their informative content. However, web-pages contain several n...

Sandip Debnath, Prasenjit Mitra, C. Lee Giles

claim paper

Read More »

248

click to vote

WWW
2011
ACM

316views Internet Technology» more WWW 2011»

Identifying primary content from web pages and its application to web search ranking

15 years 1 months ago

Download www.www2011india.com

Web pages are usually highly structured documents. In some documents, content with diﬀerent functionality is laid out in blocks, some merely supporting the main discourse. In ot...

Srinivas Vadrevu, Emre Velipasaoglu

claim paper

Read More »

162

Voted

KDD
2002
ACM

148views Data Mining» more KDD 2002»

Discovering informative content blocks from Web documents

16 years 7 months ago

Download www.cs.ualberta.ca

In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...

Shian-Hua Lin, Jan-Ming Ho

claim paper

Read More »

160

click to vote

BIS
2006

106views Business» more BIS 2006»

Expected Utility of Content Blocks in Web Content Extraction

15 years 8 months ago

Download integror.net

In this paper we discuss the possible application of new concepts in web content extraction: utility assessment, utility annealing, and dynamic aggregated document generation. Aft...

Marek Kowalkiewicz

claim paper

Read More »

182

click to vote

HT
2005
ACM

133views Internet Technology» more HT 2005»

As we may perceive: inferring logical documents from hypertext

16 years 6 days ago

Download www.cs.cornell.edu

In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...

Pavel Dmitriev, Carl Lagoze, Boris Suchkov

claim paper

Read More »

« Prev « First page 1 / 142 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers