Sciweavers

2137 search results - page 121 / 428
» Extraction of Structural Information from the Web
Sort
View
WISE
2005
Springer
15 years 10 months ago
Semantic Partitioning of Web Pages
In this paper we describe the semantic partitioner algorithm, that uses the structural and presentation regularities of the Web pages to automatically transform them into hierarchi...
Srinivas Vadrevu, Fatih Gelgi, Hasan Davulcu
WIDM
2005
ACM
15 years 10 months ago
Web path recommendations based on page ranking and Markov models
Markov models have been widely used for modelling users' navigational behaviour in the Web graph, using the transitional probabilities between web pages, as recorded in the w...
Magdalini Eirinaki, Michalis Vazirgiannis, Dimitri...
VLDB
2004
ACM
121views Database» more  VLDB 2004»
15 years 9 months ago
An Automatic Data Grabber for Large Web Sites
We demonstrate a system to automatically grab data from data intensive web sites. The system first infers a model that describes at the intensional level the web site as a collec...
Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...
ICDAR
2003
IEEE
15 years 9 months ago
Automated Detection and Segmentation of Table of Contents Page from Document Images
With an aim to extract the structural information from the table of contents (TOC) to help develop digital document library the requirement of identifying/segmenting the TOC page ...
S. Mandal, S. P. Chowdhury, Amit Kumar Das, Bhabat...
LREC
2010
183views Education» more  LREC 2010»
15 years 6 months ago
Extracting Lexico-conceptual Knowledge for Developing Persian WordNet
Semantic lexicons and lexical ontologies are some major resources in natural language processing. Developing such resources are time consuming tasks for which some automatic metho...
Mehrnoush Shamsfard, Hakimeh Fadaei, Elham Fekri