Sciweavers

3145 search results - page 376 / 629
» A Computational Psycholinguistic Model of Natural Language P...
Sort
View
EMNLP
2010
15 years 2 months ago
Negative Training Data Can be Harmful to Text Classification
This paper studies the effects of training data on binary text classification and postulates that negative training data is not needed and may even be harmful for the task. Tradit...
Xiaoli Li, Bing Liu, See-Kiong Ng
NLPRS
2001
Springer
15 years 8 months ago
A Hierarchical EM Approach to Word Segmentation
We propose a simple two-level hierarchical probability model for unsupervised word segmentation. By treating words as strings composed of morphemes/phonemes which are themselves c...
Fuchun Peng, Dale Schuurmans
IR
2008
15 years 4 months ago
A probability ranking principle for interactive information retrieval
The classical Probability Ranking Principle (PRP) forms the theoretical basis for probabilistic Information Retrieval (IR) models, which are dominating IR theory since about 20 ye...
Norbert Fuhr
IR
2006
15 years 4 months ago
Table extraction for answer retrieval
The ability to find tables and extract information from them is a necessary component of many information retrieval tasks. Documents often contain tables in order to communicate d...
Xing Wei, W. Bruce Croft, Andrew McCallum
EMNLP
2009
15 years 2 months ago
Generalized Expectation Criteria for Bootstrapping Extractors using Record-Text Alignment
Traditionally, machine learning approaches for information extraction require human annotated data that can be costly and time-consuming to produce. However, in many cases, there ...
Kedar Bellare, Andrew McCallum