Sciweavers

563 search results - page 46 / 113
» Assessing the Quality of Natural Language Text Data
Sort
View
LREC
2010
147views Education» more  LREC 2010»
13 years 10 months ago
Named Entity Recognition in Questions: Towards a Golden Collection
Named Entity Recognition (NER) plays a relevant role in several Natural Language Processing tasks. Question-Answering (QA) is an example of such, since answers are frequently name...
Ana Cristina Mendes, Luísa Coheur, Paula Va...
EMNLP
2010
13 years 6 months ago
Self-Training with Products of Latent Variable Grammars
We study self-training with products of latent variable grammars in this paper. We show that increasing the quality of the automatically parsed data used for self-training gives h...
Zhongqiang Huang, Mary P. Harper, Slav Petrov
CICLING
2009
Springer
14 years 9 months ago
Exploiting Parallel Treebanks to Improve Phrase-Based Statistical Machine Translation
We use existing tools to automatically build two parallel treebanks from existing parallel corpora. We then show that combining the data extracted from both the treebanks and the ...
John Tinsley, Mary Hearne, Andy Way
ICWSM
2010
13 years 10 months ago
Trading Strategies to Exploit Blog and News Sentiment
We use quantitative media (blogs, and news as a comparison) data generated by a large-scale natural language processing (NLP) text analysis system to perform a comprehensive and c...
Wenbin Zhang, Steven Skiena
LREC
2010
167views Education» more  LREC 2010»
13 years 10 months ago
New Tools for Web-Scale N-grams
We introduce a new set of tools for working with web-scale N-gram data. These tools lower the barrier for working with web-scale text, and create a new platform for acquiring larg...
Dekang Lin, Kenneth Ward Church, Heng Ji, Satoshi ...