Sciweavers

650 search results - page 32 / 130
» Challenges in Natural Language Processing: The Case of Metap...
Sort
View
FINTAL
2006
14 years 21 days ago
MEDITE: A Unilingual Textual Aligner
This paper addresses a problem of natural language text alignment, from a humanities discipline called textual genetic criticism where different text versions must be compared. The...
Julien Bourdaillet, Jean-Gabriel Ganascia
IJCNLP
2005
Springer
14 years 2 months ago
Instance-Based Generation for Interactive Restricted Domain Question Answering Systems
Abstract. One important component of interactive systems is the generation component. While template-based generation is appropriate in many cases (for example, task oriented spoke...
Matthias Denecke, Hajime Tsukada
CICLING
2006
Springer
14 years 23 days ago
Improving kNN Text Categorization by Removing Outliers from Training Set
We show that excluding outliers from the training data significantly improves kNN classifier, which in this case performs about 10% better than the best know method--Centroid-based...
Kwangcheol Shin, Ajith Abraham, Sang-Yong Han
EMNLP
2009
13 years 6 months ago
Unsupervised Tokenization for Machine Translation
Training a statistical machine translation starts with tokenizing a parallel corpus. Some languages such as Chinese do not incorporate spacing in their writing system, which creat...
Tagyoung Chung, Daniel Gildea
WWW
2011
ACM
13 years 4 months ago
Web scale NLP: a case study on url word breaking
This paper uses the URL word breaking task as an example to elaborate what we identify as crucialin designingstatistical natural language processing (NLP) algorithmsfor Web scale ...
Kuansan Wang, Christopher Thrasher, Bo-June Paul H...