Sciweavers

910 search results - page 166 / 182
» Standardization of Speech Corpus
Sort
View
FLAIRS
2009
13 years 6 months ago
Improving Biomedical Document Retrieval by Mining Domain Knowledge
When research articles introduce new findings or concepts they typically relate them only to knowledge and domain concepts of immediate relevance. However, many domain concepts re...
Shuguang Wang, Milos Hauskrecht
COLING
2010
13 years 3 months ago
Discriminative Induction of Sub-Tree Alignment using Limited Labeled Data
We employ Maximum Entropy model to conduct sub-tree alignment between bilingual phrasal structure trees. Various lexical and structural knowledge is explored to measure the syntac...
Jun Sun, Min Zhang, Chew Lim Tan
EMNLP
2011
12 years 8 months ago
Named Entity Recognition in Tweets: An Experimental Study
People tweet more than 100 Million times daily, yielding a noisy, informal, but sometimes informative corpus of 140-character messages that mirrors the zeitgeist in an unprecedent...
Alan Ritter, Sam Clark, Mausam, Oren Etzioni
WSDM
2012
ACM
236views Data Mining» more  WSDM 2012»
12 years 4 months ago
Effective query formulation with multiple information sources
Most standard information retrieval models use a single source of information (e.g., the retrieval corpus) for query formulation tasks such as term and phrase weighting and query ...
Michael Bendersky, Donald Metzler, W. Bruce Croft

Dataset
924views
14 years 4 months ago
SCUT-COUCH2009 - A Comprehensive Online Unconstrained Handwriting Database
SCUT-COUCH 2009 database is a comprehensive database that consists of 12 datasets, namely GB1, GB2, TradGB1, Big5, Pinyin, Letters, Digit, Symbol, Word8888, Word17366, Word44208 an...
Lianwen Jin