Search Sciweavers | Sciweavers

498 search results - page 17 / 100

» Robust web content extraction

185

click to vote

SYRCODIS
2007

124views Database» more SYRCODIS 2007»

Recommender System Based on User-generated Content

15 years 7 months ago

Download sunsite.informatik.rwth-aachen.de

Recommender systems apply statistical and knowledge discovery techniques to the problem of making recommendations during live user interaction. This paper describes a novel approa...

Denis Turdakov

claim paper

Read More »

214

Voted

WWW
2007
ACM

144views Internet Technology» more WWW 2007»

Towards domain-independent information extraction from web tables

16 years 7 months ago

Download www2007.org

Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...

Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...

claim paper

Read More »

154

click to vote

DIS
2001
Springer

93views Theoretical Computer Science» more DIS 2001»

Eliminating Useless Parts in Semi-structured Documents Using Alternation Counts

15 years 11 months ago

Download www.i.kyushu-u.ac.jp

We propose a preprocessing method for Web mining which, given semi-structured documents with the same structure and style, distinguishes useless parts and non-useless parts in each...

Daisuke Ikeda, Yasuhiro Yamada, Sachio Hirokawa

claim paper

Read More »

186

click to vote

ISIWI
2000

126views Knowledge Management» more ISIWI 2000»

Aiding Web Searches by Statistical Classification Tools

15 years 8 months ago

Download www.informationswissenschaft.org

We describe an infrastructure for the collection and management of large amounts of text, and discuss the possibility of information extraction and visualisation from text corpora...

Gerhard Heyer, Uwe Quasthoff, Christian Wolff

claim paper

Read More »

228

click to vote

WWW
2011
ACM

293views Internet Technology» more WWW 2011»

Web information extraction using Markov logic networks

15 years 1 months ago

Download www.it.iitb.ac.in

In this paper, we consider the problem of extracting structured data from web pages taking into account both the content of individual attributes as well as the structure of pages...

Sandeepkumar Satpal, Sahely Bhadra, Sundararajan S...

claim paper

Read More »

« Prev « First page 17 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers