Sciweavers

684 search results - page 20 / 137
» Extracting semantic structure of web documents using content...
Sort
View
WEBNET
2000
13 years 10 months ago
Models for User Access Patterns on the Web: Semantic Content versus Access History
: This work focuses on clustering a site into groups of documents that are predictive of future user accesses. Two approaches have been developed and tested. The first approach use...
Arun Ross, Charles B. Owen, Aditya Vailaya
WWW
2006
ACM
14 years 9 months ago
Robust web content extraction
We present an empirical evaluation and comparison of two content extraction methods in HTML: absolute XPath expressions and relative XPath expressions. We argue that the relative ...
Marek Kowalkiewicz, Maria E. Orlowska, Tomasz Kacz...
EDBTW
2010
Springer
13 years 12 months ago
Text-to-query: dynamically building structured analytics to illustrate textual content
Successfully structuring information in databases, OLAP cubes, and XML is a crucial element in managing data nowadays. However this process brought new challenges to usability. It...
Raphaël Thollot, Falk Brauer, Wojciech M. Bar...
CN
1999
143views more  CN 1999»
13 years 8 months ago
Embedding Knowledge in Web Documents
The paper argues for the use of general and intuitive knowledge representation languages (and simpler notational variants, e.g. subsets of natural languages) for indexing the cont...
Philippe Martin, Peter W. Eklund
PRICAI
2000
Springer
14 years 7 days ago
Extracting Logical Schema from the Web
One of the main limitations when accessing the web is the lack of explicit structure, whose presence may help in understanding data semantics. Schema for web data can be constructe...
Vincenza Carchiolo, Alessandro Longheu, Michele Ma...