Search Sciweavers | Sciweavers

368 search results - page 16 / 74

» Template-Based Information Mining from HTML Documents

click to vote

SIGIR
2008
ACM

89views Information Technology» more SIGIR 2008»

XML-aided phrase indexing for hypertext documents

15 years 3 months ago

Download www.cs.helsinki.fi

We combine techniques of XML Mining and Text Mining for the benefit of Information Retrieval. By manipulating the word sequence according to the XML structure of the marked-up tex...

Miro Lehtonen, Antoine Doucet

claim paper

Read More »

120

click to vote

ADC
2006
Springer

139views Database» more ADC 2006»

Peer-to-peer form based web information systems

15 years 9 months ago

Download eprints.usq.edu.au

The World Wide Web revolutionized the use of forms in everyday private and business life by allowing a move away from paper forms to easily accessible digital forms. Data captured...

Stijn Dekeyser, Jan Hidders, Richard Watson, Ron A...

claim paper

Read More »

136

click to vote

ASP
2005
Springer

288views Automated Reasoning» more ASP 2005»

Exploiting ASP for Semantic Information Extraction

15 years 5 months ago

Download ftp.informatik.rwth-aachen.de

Abstract. The paper describes HıLεX, a new ASP-based system for the extraction of information from unstructured documents. Unlike previous systems, which are mainly syntactic, H�...

Massimo Ruffolo, Nicola Leone, Marco Manna, Domeni...

claim paper

Read More »

166

click to vote

JOT
2008

136views more JOT 2008»

The Stock Statistics Parser

15 years 3 months ago

Download www.jot.fm

This paper describes how use the HTMLEditorKit to perform web data mining on stock statistics for listed firms. Our focus is on making use of the web to get information about comp...

Douglas Lyon

claim paper

Read More »

130

Voted

DOCENG
2009
ACM

166views Document Analysis» more DOCENG 2009»

Object-level document analysis of PDF files

15 years 9 months ago

Download www.dbai.tuwien.ac.at

The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...

Tamir Hassan

claim paper

Read More »

« Prev « First page 16 / 74 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers