Sciweavers

102 search results - page 19 / 21
» Named Entity Recognition for Digitised Historical Texts
Sort
View
FLAIRS
2003
13 years 10 months ago
Orthographic Case Restoration Using Supervised Learning Without Manual Annotation
One challenge in text processing is the treatment of case insensitive documents such as speech recognition results. The traditional approach is to re-train a language model exclud...
Cheng Niu, Wei Li 0003, Jihong Ding, Rohini K. Sri...
CSL
2008
Springer
13 years 8 months ago
A stopping criterion for active learning
Active learning (AL) is a framework that attempts to reduce the cost of annotating training material for statistical learning methods. While a lot of papers have been presented on...
Andreas Vlachos
DAS
2010
Springer
13 years 7 months ago
Automatic unsupervised parameter selection for character segmentation
A major difficulty for designing a document image segmentation methodology is the proper value selection for all involved parameters. This is usually done after experimentations o...
Georgios Vamvakas, Nikolaos Stamatopoulos, Basilio...
CSL
2004
Springer
13 years 8 months ago
Automatic capitalisation generation for speech input
Two different systems are proposed for the task of capitalisation generation. The first system is a slightly modified speech recogniser. In this system, every word in the vocabula...
Ji-Hwan Kim, Philip C. Woodland
CIKM
2009
Springer
14 years 3 months ago
Combining labeled and unlabeled data with word-class distribution learning
We describe a novel simple and highly scalable semi-supervised method called Word-Class Distribution Learning (WCDL), and apply it the task of information extraction (IE) by utili...
Yanjun Qi, Ronan Collobert, Pavel Kuksa, Koray Kav...