This paper addresses the problem of extracting information from textual documents, either normal documents or web pages. A new approach for extracting complicate information from ...
Luo Xiao, Dieter Wissmann, Michael Brown, Stefan J...
Abstract. Our unique approach for learning English grapheme segmentation (LE-GS) rules using the Iterated Version Space Algorithm (IVSA) is presented. After de ning the problem and...
Jianna Jian Zhang, Howard J. Hamilton, Nick Cercon...
or untagged treebanks. ' When trained on an untagged This paper presents a method for constructing deterministic Prolog parsers from corpora of parsed sentences. Our approach ...
This paper presents a character segmentation algorithm for unconstrained cursive handwritten text. The transformation-based learning method and a simplified variation of it are us...
Paraphrase detection can be seen as the task of aligning sentences that convey the same information but yet are written in different forms. Such resources are important to automat...