Sciweavers

2827 search results - page 514 / 566
» Marking Text Documents
Sort
View
LREC
2008
111views Education» more  LREC 2008»
13 years 10 months ago
Low-Density Language Bootstrapping: the Case of Tajiki Persian
Low-density languages raise difficulties for standard approaches to natural language processing that depend on large online corpora. Using Persian as a case study, we propose a no...
Karine Megerdoomian, Dan Parvaz
LREC
2008
117views Education» more  LREC 2008»
13 years 10 months ago
Semantic Vectors: a Scalable Open Source Package and Online Technology Management Application
This paper describes the open source SemanticVectors package that efficiently creates semantic vectors for words and documents from a corpus of free text articles. We believe that...
Dominic Widdows, Kathleen Ferraro
LREC
2008
127views Education» more  LREC 2008»
13 years 10 months ago
An Automatic Close Copy Speech Synthesis Tool for Large-Scale Speech Corpus Evaluation
The production of rich multilingual speech corpus resources on a large scale is a requirement for many linguistic, phonetic and technological tasks, in both research and applicati...
Dafydd Gibbon, Jolanta Bachan
IJCAI
2007
13 years 10 months ago
Learning to Identify Unexpected Instances in the Test Set
Traditional classification involves building a classifier using labeled training examples from a set of predefined classes and then applying the classifier to classify test instan...
Xiaoli Li, Bing Liu, See-Kiong Ng
IJCAI
2007
13 years 10 months ago
Supervised Latent Semantic Indexing Using Adaptive Sprinkling
Latent Semantic Indexing (LSI) has been shown to be effective in recovering from synonymy and polysemy in text retrieval applications. However, since LSI ignores class labels of t...
Sutanu Chakraborti, Rahman Mukras, Robert Lothian,...