Sciweavers

423 search results - page 63 / 85
» Text Classification by Labeling Words
Sort
View
ROCAI
2004
Springer
14 years 27 days ago
Learning Interestingness Measures in Terminology Extraction. A ROC-based approach
Abstract. In the field of Text Mining, a key phase in data preparation is concerned with the extraction of terms, i.e. collocation of words attached to specific concepts (e.g. Ph...
Mathieu Roche, Jérôme Azé, Yve...
ICDAR
2003
IEEE
14 years 25 days ago
Automatic Feature Selection with Applications to Script Identification of Degraded Documents
Current approaches to script identification rely on hand-selected features and often require processing a significant part of the document to achieve reliable identification. We p...
Vitaly Ablavsky, Mark R. Stevens
ACL
2004
13 years 9 months ago
Unsupervised Sense Disambiguation Using Bilingual Probabilistic Models
We describe two probabilistic models for unsupervised word-sense disambiguation using parallel corpora. The first model, which we call the Sense model, builds on the work of Diab ...
Indrajit Bhattacharya, Lise Getoor, Yoshua Bengio
EMNLP
2004
13 years 9 months ago
Object-Extraction and Question-Parsing using CCG
Accurate dependency recovery has recently been reported for a number of wide-coverage statistical parsers using Combinatory Categorial Grammar (CCG). However, overall figures give...
Stephen Clark, Mark Steedman, James R. Curran
NIPS
2001
13 years 9 months ago
Grammatical Bigrams
Unsupervised learning algorithms have been derived for several statistical models of English grammar, but their computational complexity makes applying them to large data sets int...
Mark A. Paskin