Search Sciweavers | Sciweavers

1319 search results - page 13 / 264

» Using the Structure of HTML Documents to Improve Retrieval

193

click to vote

SIGMOD
2009
ACM

140views Database» more SIGMOD 2009»

Robust web extraction: an approach based on a probabilistic tree-edit model

16 years 26 days ago

Download www-rcf.usc.edu

On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to eﬀectively extract information of interest. Of course, the scripts and thus ...

Nilesh N. Dalvi, Philip Bohannon, Fei Sha

claim paper

Read More »

169

click to vote

APWEB
2003
Springer

148views Internet Technology» more APWEB 2003»

Extracting Content Structure for Web Pages Based on Visual Representation

15 years 11 months ago

Download www.dbs.ifi.lmu.de

Abstract. A new web content structure based on visual representation is proposed in this paper. Many web applications such as information retrieval, information extraction and auto...

Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma

claim paper

Read More »

179

click to vote

WWW
2002
ACM

130views Internet Technology» more WWW 2002»

Using web structure for classifying and describing web pages

16 years 6 months ago

Download dpennock.com

The structure of the web is increasingly being used to improve organization, search, and analysis of information on the web. For example, Google uses the text in citing documents ...

Eric J. Glover, Kostas Tsioutsiouliklis, Steve Law...

claim paper

Read More »

175

click to vote

IPM
2000

76views more IPM 2000»

Structured storage and retrieval of SGML documents using Grove

15 years 5 months ago

Download sclab.yonsei.ac.kr

SGML standardized in ISO 8879 [International Organization for Standardization (1986)] has been proliferated because it can provide various styles and transform documents on dieren...

Hak-Gyoon Kim, Sung-Bae Cho

claim paper

Read More »

158

click to vote

EWMF
2005
Springer

161views Internet Technology» more EWMF 2005»

Information Retrieval in Trust-Enhanced Document Networks

15 years 11 months ago

Download www.uni-bamberg.de

Abstract. To ﬁght the problem of information overload in huge information sources like large document repositories, e. g. citeseer, or internet websites you need a selection crit...

Klaus Stein, Claudia Hess

claim paper

Read More »

« Prev « First page 13 / 264 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers