Sciweavers

498 search results - page 17 / 100
» Robust web content extraction
Sort
View
SYRCODIS
2007
124views Database» more  SYRCODIS 2007»
13 years 8 months ago
Recommender System Based on User-generated Content
Recommender systems apply statistical and knowledge discovery techniques to the problem of making recommendations during live user interaction. This paper describes a novel approa...
Denis Turdakov
WWW
2007
ACM
14 years 7 months ago
Towards domain-independent information extraction from web tables
Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...
Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...
DIS
2001
Springer
13 years 11 months ago
Eliminating Useless Parts in Semi-structured Documents Using Alternation Counts
We propose a preprocessing method for Web mining which, given semi-structured documents with the same structure and style, distinguishes useless parts and non-useless parts in each...
Daisuke Ikeda, Yasuhiro Yamada, Sachio Hirokawa
ISIWI
2000
13 years 8 months ago
Aiding Web Searches by Statistical Classification Tools
We describe an infrastructure for the collection and management of large amounts of text, and discuss the possibility of information extraction and visualisation from text corpora...
Gerhard Heyer, Uwe Quasthoff, Christian Wolff
WWW
2011
ACM
13 years 1 months ago
Web information extraction using Markov logic networks
In this paper, we consider the problem of extracting structured data from web pages taking into account both the content of individual attributes as well as the structure of pages...
Sandeepkumar Satpal, Sahely Bhadra, Sundararajan S...