Sciweavers

910 search results - page 33 / 182
» Standardization of Speech Corpus
Sort
View
ACL
2012
11 years 10 months ago
Syntactic Annotations for the Google Books NGram Corpus
We present a new edition of the Google Books Ngram Corpus, which describes how often words and phrases were used over a period of five centuries, in eight languages; it reflects...
Yuri Lin, Jean-Baptiste Michel, Erez Aiden Lieberm...
LREC
2008
117views Education» more  LREC 2008»
13 years 9 months ago
Tagging a Hebrew Corpus: the Case of Participles
We report on an effort to build a corpus of Modern Hebrew tagged with parts of speech and morphology. We designed a tagset specific to Hebrew while focusing on four aspects: the t...
Meni Adler, Yael Dahan Netzer, Yoav Goldberg, Davi...
ACL
1994
13 years 9 months ago
A Corpus-Based Approach to Automatic Compound Extraction
An automatic compound retrieval method is proposed to extract compounds within a text message. It uses n-gram mutual information, relative frequency count and parts of speech as t...
Keh-Yih Su, Ming-Wen Wu, Jing-Shin Chang
KDD
2009
ACM
211views Data Mining» more  KDD 2009»
14 years 8 months ago
Address standardization with latent semantic association
Address standardization is a very challenging task in data cleansing. To provide better customer relationship management and business intelligence for customer-oriented cooperates...
Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang,...
ICMCS
2005
IEEE
128views Multimedia» more  ICMCS 2005»
14 years 1 months ago
Low-complexity automatic speaker recognition in the compressed GSM AMR domain
This paper presents an experimental implementation of a low-complexity speaker recognition algorithm working in the compressed speech domain. The goal is to perform speaker modeli...
Matteo Petracca, Antonio Servetti, Juan Carlos De ...