Sciweavers

144 search results - page 17 / 29
» Improved Source-Channel Models for Chinese Word Segmentation
Sort
View
ICDAR
2011
IEEE
12 years 7 months ago
A Handwritten Character Extraction Algorithm for Multi-language Document Image
—In this paper, we propose a novel method for extracting handwritten characters from multi-language document images, which may contain various types of characters, e.g. Chinese, ...
Yonghong Song, Guilin Xiao, Yuanlin Zhang, Lei Yan...
ACL
2012
11 years 10 months ago
SITS: A Hierarchical Nonparametric Model using Speaker Identity for Topic Segmentation in Multiparty Conversations
One of the key tasks for analyzing conversational data is segmenting it into coherent topic segments. However, most models of topic segmentation ignore the social aspect of conver...
Viet-An Nguyen, Jordan L. Boyd-Graber, Philip Resn...
FSKD
2009
Springer
145views Fuzzy Logic» more  FSKD 2009»
14 years 2 months ago
Chinese Web Comments Clustering Analysis with a Two-phase Method
Usually a meaningful web topic has tens of thousands of comments, especially the hot topics. It is valuable if we congregate the comments into clusters and find out the mainstrea...
Yexin Wang, Li Zhao, Yan Zhang
EMNLP
2011
12 years 7 months ago
Quasi-Synchronous Phrase Dependency Grammars for Machine Translation
We present a quasi-synchronous dependency grammar (Smith and Eisner, 2006) for machine translation in which the leaves of the tree are phrases rather than words as in previous wor...
Kevin Gimpel, Noah A. Smith
ICASSP
2011
IEEE
12 years 11 months ago
Discriminative duration modeling for speech recognition with segmental conditional random fields
This paper describes a new approach to modeling duration for LVCSR using SCARF, a toolkit for speech recognition with segmental conditional random fields. We utilize SCARF’s abi...
Justine T. Kao, Geoffrey Zweig, Patrick Nguyen