Sciweavers

79 search results - page 15 / 16
» Self-Supervised Chinese Word Segmentation
Sort
View
NAACL
2003
13 years 8 months ago
A Context-Sensitive Homograph Disambiguation in Thai Text-to-Speech Synthesis
Homograph ambiguity is an original issue in Text-to-Speech (TTS). To disambiguate homograph, several efficient approaches have been proposed such as part-of-speech (POS) n-gram, B...
Virongrong Tesprasit, Paisarn Charoenpornsawat, Vi...
ACL
2009
13 years 4 months ago
Mining Bilingual Data from the Web with Adaptively Learnt Patterns
Mining bilingual data (including bilingual sentences and terms1 ) from the Web can benefit many NLP applications, such as machine translation and cross language information retrie...
Long Jiang, Shiquan Yang, Ming Zhou, Xiaohua Liu, ...
EMNLP
2006
13 years 8 months ago
A Hybrid Markov/Semi-Markov Conditional Random Field for Sequence Segmentation
Markov order-1 conditional random fields (CRFs) and semi-Markov CRFs are two popular models for sequence segmentation and labeling. Both models have advantages in terms of the typ...
Galen Andrew
ESWA
2008
113views more  ESWA 2008»
13 years 7 months ago
A weighted string pattern matching-based passage ranking algorithm for video question answering
Video question answering aims to pinpoint answers in response to user's specified questions. However, most question answering technologies involve in integrating rich specifi...
Yu-Chieh Wu, Jie-Chi Yang, Yue-Shi Lee
IDEAL
2003
Springer
14 years 6 days ago
Towards a Terabyte Digital Library System
In China-US Million Book Digital Library, output of the digitalization process is more than one terabyte of text in OEB and PDF format. To access these data quickly and accurately,...
Hao Ding, Yun Lin, Bin Liu