Sciweavers

563 search results - page 41 / 113
» Assessing the Quality of Natural Language Text Data
Sort
View
EMNLP
2011
12 years 8 months ago
Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation
We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...
Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...

Book
695views
15 years 4 months ago
The Scheme Programming Language
"Scheme is a general-purpose computer programming language. It is a high-level language, supporting operations on structured data such as strings, lists, and vectors, as well ...
R. Kent Dybvig
ICDAR
2005
IEEE
14 years 2 months ago
Word Separation of Unconstrained Handwritten Text Lines in PCR Forms
An approach for segmenting handwritten text in a Pre-Hospital Care Report (PCR) is presented. Segmentation of lines and words in a PCR is extremely challenging due to the nature o...
Ifeoma Nwogu, Gyeonghwan Kim
DBISP2P
2008
Springer
124views Database» more  DBISP2P 2008»
13 years 10 months ago
Exploiting Distribution Skew for Scalable P2P Text Clustering
K-Means clustering is widely used in information retrieval and data mining. Distributed K-Means variants have already been proposed, but none of the past algorithms scales to large...
Odysseas Papapetrou, Wolf Siberski, Fabian Leitrit...
DOCENG
2005
ACM
13 years 10 months ago
Integrating translation services within a structured editor
Fully automatic machine translation cannot produce high quality translation; Dialog-Based Machine Translation (DBMT) is the only way to provide authors with a means of translating...
Ali Choumane, Hervé Blanchon, Cécile...