Statistical machine translation systems are usually trained on large amounts of bilingual text (used to learn a translation model), and also large amounts of monolingual text in th...
This paper describes a text normalization system for deletion-based abbreviations in informal text. We propose using statistical classifiers to learn the probability of deleting ...
Unsupervised learning algorithms have been derived for several statistical models of English grammar, but their computational complexity makes applying them to large data sets int...
Most of existing ontologies construction tools support construction of ontological relations (e.g. taxonomy, equivalence, etc.) but they do not support construction of domain rela...
Mohamed Yehia Dahab, Hesham A. Hassan, Ahmed A. Ra...
Cross-language Text Categorization is the task of assigning semantic classes to documents written in a target language (e.g. English) while the system is trained using labeled doc...