Search Sciweavers | Sciweavers

368 search results - page 13 / 74

» Template-Based Information Mining from HTML Documents

122

click to vote

PVLDB
2010

135views more PVLDB 2010»

SXPath - Extending XPath towards Spatial Querying on Web Documents

15 years 1 months ago

Download www.vldb.org

Querying data from presentation formats like HTML, for purposes such as information extraction, requires the consideration of tree structures as well as the consideration of spati...

Ermelinda Oro, Massimo Ruffolo, Steffen Staab

claim paper

Read More »

121

click to vote

AWIC
2003
Springer

140views Internet Technology» more AWIC 2003»

Web Page Classification: A Soft Computing Approach

15 years 8 months ago

Download gavab.escet.urjc.es

The Internet makes it possible to share and manipulate a vast quantity of information efficiently and effectively, but the rapid and chaotic growth experienced by the Net has gener...

Angela Ribeiro, Víctor Fresno, Maria C. Gar...

claim paper

Read More »

143

click to vote

PAKDD
2009
ACM

116views Data Mining» more PAKDD 2009»

Scalable Web Mining with Newistic

15 years 10 months ago

Download www.horatiumocian.com

Abstract. Newistic is a web mining platform that collects and analyses documents crawled from the Internet. Although it currently processes news articles, it can be easily adapted ...

Ovidiu Dan, Horatiu Mocian

claim paper

Read More »

144

click to vote

WWW
2007
ACM

144views Internet Technology» more WWW 2007»

Towards domain-independent information extraction from web tables

16 years 4 months ago

Download www2007.org

Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...

Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...

claim paper

Read More »

102

click to vote

DKE
1998

146views more DKE 1998»

A Case study of Automatic Authoring: From a Textbook to a Hyper-Textbook

15 years 3 months ago

Download personal.cis.strath.ac.uk

This paper presents a case-study of automatic construction of a hypertext from a large full-text document. The document we used as input of the automatic authoring process is a we...

Fabio Crestani, Massimo Melucci

claim paper

Read More »

« Prev « First page 13 / 74 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers