Sciweavers

563 search results - page 9 / 113
» Assessing the Quality of Natural Language Text Data
Sort
View
ACL
2001
13 years 9 months ago
Scaling to Very Very Large Corpora for Natural Language Disambiguation
The amount of readily available on-line text has reached hundreds of billions of words and continues to grow. Yet for most core natural language tasks, algorithms continue to be o...
Michele Banko, Eric Brill
EACL
2010
ACL Anthology
13 years 9 months ago
Probabilistic Approaches for Modeling Text Structure and Their Application to Text-to-Text Generation
Abstract. Since the early days of generation research, it has been acknowledged that modeling the global structure of a document is crucial for producing coherent, readable output....
Regina Barzilay
ICPR
2004
IEEE
14 years 8 months ago
Off-line Handwritten Textline Recognition Using a Mixture of Natural and Synthetic Training Data
In this paper the problem of off-line handwritten cursive text recognition is considered. A method for expanding the set of available training textlines by applying random perturb...
Tamás Varga, Horst Bunke
CVPR
2010
IEEE
1778views Computer Vision» more  CVPR 2010»
14 years 3 months ago
Detecting Text in Natural Scenes with Stroke Width Transform
We present a novel image operator that seeks to find the value of stroke width for each image pixel, and demonstrate its use on the task of text detection in natural images. The s...
Boris Epshtein, Eyal Ofek, Yonatan Wexler
IQ
2007
13 years 9 months ago
Quality Of Data, Information And Knowledge In Technology Foresight Processes
: Futures research means observation and understanding of today’s operational environment as well as identification and positioning of future opportunities. Lots of technology fo...
Helinä Melkas, Tuomo Uotila