Sciweavers

2827 search results - page 113 / 566
» Marking Text Documents
Sort
View
ICDAR
2005
IEEE
14 years 1 months ago
Text Degradations and OCR Training
Printing and scanning of text documents introduces degradations to the characters which can be modeled. Interestingly, certain combinations of the parameters that govern the degra...
Elisa H. Barney Smith, Tim L. Andersen
TSD
2009
Springer
14 years 21 days ago
Combining Text Vector Representations for Information Retrieval
Abstract. This paper suggests a novel representation for documents that is intended to improve precision. This representation is generated by combining two central techniques: Rand...
Maya Carrillo, Chris Eliasmith, Aurelio Lóp...
ACL
2008
13 years 9 months ago
Automatic Editing in a Back-End Speech-to-Text System
Written documents created through dictation differ significantly from a true verbatim transcript of the recorded speech. This poses an obstacle in automatic dictation systems as s...
Maximilian Bisani, Paul Vozila, Olivier Divay, Jef...
LWA
2008
13 years 9 months ago
Rule-Based Information Extraction for Structured Data Acquisition using TextMarker
Information extraction is concerned with the location of specific items in (unstructured) textual documents, e.g., being applied for the acquisition of structured data. Then, the ...
Martin Atzmüller, Peter Klügl, Frank Pup...
KYOTODL
2000
140views more  KYOTODL 2000»
13 years 9 months ago
Text Data Mining: Discovery of Important Keywords in the Cyberspace
This paper describes applications of the optimized pattern discover),framework to text and Webmining. In particular; we introduce a class of simple combinatorialpatterns over phra...
Hiroki Arimura, Jun-ichiro Abe, Hiroshi Sakamoto, ...