Abstract. This paper presents the evaluation methods and the preliminary results of a combined thematic segmentation of (a) meeting documents and (b)meeting speech transcript. Our ...
We show that it is possible to extend a general-purpose programming language with a convenient high-level data-type for manipulating XML documents while permitting (1) precise stat...
Aske Simon Christensen, Christian Kirkegaard, Ande...
There is a significant need to objectively evaluate layout analysis (page segmentation and region classification) methods. This paper describes the Page Segmentation Competition (...
This paper describes an investigation of authorship gender attribution mining from e-mail text documents. We used an extended set of predominantly topic content-free e-mail docume...
Malcolm Corney, Olivier Y. de Vel, Alison Anderson...