Sciweavers

281 search results - page 50 / 57
» Introducing the Enron Corpus
Sort
View
ACL
2010
13 years 5 months ago
Bilingual Lexicon Generation Using Non-Aligned Signatures
Bilingual lexicons are fundamental resources. Modern automated lexicon generation methods usually require parallel corpora, which are not available for most language pairs. Lexico...
Daphna Shezaf, Ari Rappoport
ACL
2010
13 years 5 months ago
Cross Lingual Adaptation: An Experiment on Sentiment Classifications
In this paper, we study the problem of using an annotated corpus in English for the same natural language processing task in another language. While various machine translation sy...
Bin Wei, Christopher Pal
EMNLP
2010
13 years 5 months ago
A Probabilistic Morphological Analyzer for Syriac
We define a probabilistic morphological analyzer using a data-driven approach for Syriac in order to facilitate the creation of an annotated corpus. Syriac is an under-resourced S...
Peter McClanahan, George Busby, Robbie Haertel, Kr...
EMNLP
2009
13 years 5 months ago
Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora
A significant portion of the world's text is tagged by readers on social bookmarking websites. Credit attribution is an inherent problem in these corpora because most pages h...
Daniel Ramage, David Hall, Ramesh Nallapati, Chris...
FLAIRS
2009
13 years 5 months ago
Improving Biomedical Document Retrieval by Mining Domain Knowledge
When research articles introduce new findings or concepts they typically relate them only to knowledge and domain concepts of immediate relevance. However, many domain concepts re...
Shuguang Wang, Milos Hauskrecht