Search Sciweavers | Sciweavers

8316 search results - page 125 / 1664

» Web Document Modeling

118

Voted

WWW
2002
ACM

124views Internet Technology» more WWW 2002»

Improvement of HITS-based algorithms on web documents

15 years 2 months ago

Download net.pku.edu.cn

In this paper, we present two ways to improve the precision of HITS-based algorithms on Web documents. First, by analyzing the limitations of current HITS-based algorithms, we pro...

Longzhuang Li, Yi Shang, Wei Zhang

claim paper

Read More »

104

Voted

CIKM
2003
Springer

129views Information Technology» more CIKM 2003»

Extracting unstructured data from template generated web documents

15 years 7 months ago

Download www.ir.iit.edu

We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...

Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...

claim paper

Read More »

103

Voted

EMNLP
2008

98views Natural Language Processing» more EMNLP 2008»

An Exploration of Document Impact on Graph-Based Multi-Document Summarization

15 years 4 months ago

Download www.aclweb.org

The graph-based ranking algorithm has been recently exploited for multi-document summarization by making only use of the sentence-to-sentence relationships in the documents, under...

Xiaojun Wan

claim paper

Read More »

119

Voted

LREC
2008

140views Education» more LREC 2008»

Unsupervised Relation Extraction From Web Documents

15 years 4 months ago

Download www.lrec-conf.org

The IDEX system is a prototype of an interactive dynamic Information Extraction (IE) system. A user of the system expresses an information request in the form of a topic descripti...

Kathrin Eichler, Holmer Hemsen, Günter Neuman...

claim paper

Read More »

132

Voted

WWW
2005
ACM

173views Internet Technology» more WWW 2005»

Extracting semantic structure of web documents using content and visual information

16 years 3 months ago

Download www2005.org

This work aims to provide a page segmentation algorithm which uses both visual and content information to extract the semantic structure of a web page. The visual information is u...

Rupesh R. Mehta, Pabitra Mitra, Harish Karnick

claim paper

Read More »

« Prev « First page 125 / 1664 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers