Sciweavers

577 search results - page 67 / 116
» Improved Text Generation Using N-gram Statistics
Sort
View
ISIWI
2000
13 years 11 months ago
Automatic Document Classification - A thorough Evaluation of various Methods
(Automatic) document classification is generally defined as content-based assignment of one or more predefined categories to documents. Usually, machine learning, statistical patt...
Christoph Goller, J. Löning, T. Will, W. Wolf...
AND
2009
13 years 7 months ago
Tools for monitoring, visualizing, and refining collections of noisy documents
Developing better systems for document image analysis requires understanding errors, their sources, and their effects. The interactions between various processing steps are comple...
Daniel P. Lopresti, George Nagy
EMNLP
2009
13 years 7 months ago
Discriminative Corpus Weight Estimation for Machine Translation
Current statistical machine translation (SMT) systems are trained on sentencealigned and word-aligned parallel text collected from various sources. Translation model parameters ar...
Spyros Matsoukas, Antti-Veikko I. Rosti, Bing Zhan...
ICML
2007
IEEE
14 years 10 months ago
Self-taught learning: transfer learning from unlabeled data
We present a new machine learning framework called "self-taught learning" for using unlabeled data in supervised classification tasks. We do not assume that the unlabele...
Rajat Raina, Alexis Battle, Honglak Lee, Benjamin ...
SAC
2004
ACM
14 years 3 months ago
Guiding motif discovery by iterative pattern refinement
In this paper, we demonstrate that the performance of a motif discovery algorithm can be significantly improved by embedding it into a novel framework that effectively guides the ...
Zhiping Wang, Mehmet M. Dalkilic, Sun Kim