Sciweavers

563 search results - page 20 / 113
» Assessing the Quality of Natural Language Text Data
Sort
View
KDD
2003
ACM
118views Data Mining» more  KDD 2003»
14 years 9 months ago
Generating English summaries of time series data using the Gricean maxims
We are developing technology for generating English textual summaries of time-series data, in three domains: weather forecasts, gas-turbine sensor readings, and hospital intensive...
Somayajulu Sripada, Ehud Reiter, Jim Hunter, Jin Y...
CICLING
2006
Springer
14 years 13 days ago
Improving kNN Text Categorization by Removing Outliers from Training Set
We show that excluding outliers from the training data significantly improves kNN classifier, which in this case performs about 10% better than the best know method--Centroid-based...
Kwangcheol Shin, Ajith Abraham, Sang-Yong Han
WWW
2008
ACM
14 years 9 months ago
Size matters: word count as a measure of quality on wikipedia
Wikipedia, "the free encyclopedia", now contains over two million English articles, and is widely regarded as a highquality, authoritative encyclopedia. Some Wikipedia a...
Joshua E. Blumenstock
EMNLP
2010
13 years 6 months ago
Automatic Evaluation of Translation Quality for Distant Language Pairs
Automatic evaluation of Machine Translation (MT) quality is essential to developing highquality MT systems. Various evaluation metrics have been proposed, and BLEU is now used as ...
Hideki Isozaki, Tsutomu Hirao, Kevin Duh, Katsuhit...
EWNLG
1993
14 years 25 days ago
Choosing a Set of Coherence Relations for Text Generation: A Data-Driven Approach
Abstract. An active research programme in Natural Language Generation has grown up around the notion of `coherence relations'. Relations are being used in a variety of roles i...
Alistair Knott, Robert Dale