Sciweavers

563 search results - page 26 / 113
» Assessing the Quality of Natural Language Text Data
Sort
View
EMNLP
2010
13 years 6 months ago
Negative Training Data Can be Harmful to Text Classification
This paper studies the effects of training data on binary text classification and postulates that negative training data is not needed and may even be harmful for the task. Tradit...
Xiaoli Li, Bing Liu, See-Kiong Ng
FINTAL
2006
14 years 12 days ago
Morphological Lexicon Extraction from Raw Text Data
The tool extract enables the automatic extraction of lemma-paradigm pairs from raw text data. The tool uses search patterns that consist of regular expressions and propositional lo...
Markus Forsberg, Harald Hammarström, Aarne Ra...
DAWAK
2008
Springer
13 years 10 months ago
The Evaluation of Sentence Similarity Measures
The ability to accurately judge the similarity between natural language sentences is critical to the performance of several applications such as text mining, question answering, an...
Palakorn Achananuparp, Xiaohua Hu, Xiajiong Shen
EMNLP
2010
13 years 6 months ago
Assessing Phrase-Based Translation Models with Oracle Decoding
Extant Statistical Machine Translation (SMT) systems are very complex softwares, which embed multiple layers of heuristics and embark very large numbers of numerical parameters. A...
Guillaume Wisniewski, Alexandre Allauzen, Fran&cce...
CORR
2002
Springer
93views Education» more  CORR 2002»
13 years 8 months ago
Ellogon: A New Text Engineering Platform
This paper presents Ellogon, a multi-lingual, cross-platform, general-purpose text engineering environment. Ellogon was designed in order to aid both researchers in natural langua...
Georgios Petasis, Vangelis Karkaletsis, Georgios P...