Sciweavers

72 search results - page 5 / 15
» Automatic Selection of Table Areas in Documents for Informat...
Sort
View
EDBT
2009
ACM
123views Database» more  EDBT 2009»
14 years 2 months ago
High-performance information extraction with AliBaba
A wealth of information is available only in web pages, patents, publications etc. Extracting information from such sources is challenging, both due to the typically complex langu...
Peter Palaga, Long Nguyen, Ulf Leser, Jörg Ha...
IUI
2000
ACM
13 years 11 months ago
Enhancing information retrieval by automatic acquisition of textual relations using genetic programming
We have explored a novel method to find textual relations in electronic documents using genetic programming and semantic networks. This can be used for enhancing information retri...
Agneta Bergström, Patricija Jaksetic, Peter N...
WEBI
2005
Springer
14 years 26 days ago
A Semi-Supervised Document Clustering Algorithm Based on EM
Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...
Leonardo Rigutini, Marco Maggini
MIR
2005
ACM
143views Multimedia» more  MIR 2005»
14 years 28 days ago
Extracting information from multimedia meeting collections
Multimedia meeting collections, composed of unedited audio and video streams, handwritten notes, slides, and electronic documents that jointly constitute a raw record of complex h...
Daniel Gatica-Perez, Dong Zhang, Samy Bengio
ICDAR
2003
IEEE
14 years 20 days ago
A Constraint-based Approach to Table Structure Derivation
er presents an approach to deriving an abstract geometric model of a table from a physical representation. The technique developed uses a graph of constraints between cells which ...
Matthew Hurst