Sciweavers

330 search results - page 31 / 66
» Document structure analysis algorithms: a literature survey
Sort
View
DGO
2006
116views Education» more  DGO 2006»
13 years 9 months ago
Multidimensional text analysis for eRulemaking
To support rule-writers, we are developing techniques to automatically analyze large number of public comments on proposed regulations. A document is analyzed in various ways incl...
Namhee Kwon, Stuart W. Shulman, Eduard H. Hovy
CIKM
2008
Springer
13 years 9 months ago
Mapping enterprise entities to text segments
Today, valuable business information is increasingly stored as unstructured data (documents, emails, etc.). For example, documents exchanged between business partners capture info...
Falk Brauer, Alexander Löser, Hong-Hai Do
DAS
2008
Springer
13 years 9 months ago
On the Reading of Tables of Contents
This paper presents a framework for understanding tables of contents (TOC) of books, journals, and magazines. We propose a universal logical structure representation in terms of a...
Prateek Sarkar, Eric Saund
LREC
2008
99views Education» more  LREC 2008»
13 years 9 months ago
Characterization of Scientific and Popular Science Discourse in French, Japanese and Russian
We aim to characterize the comparability of corpora, we address this issue in the trilingual context through the distinction of expert and non expert documents. We work separately...
Lorraine Goeuriot, Natalia Grabar, Béatrice...
DOCENG
2005
ACM
13 years 9 months ago
Injecting information into atomic units of text
This paper presents a new approach to text processing, based on textemes. These are atomic text units generalising the concepts of character and glyph by merging them in a common ...
Yannis Haralambous, Gábor Bella