Sciweavers

92 search results - page 3 / 19
» Optimal Multi-Paragraph Text Segmentation by Dynamic Program...
Sort
View
ACL
2012
11 years 10 months ago
Text Segmentation by Language Using Minimum Description Length
The problem addressed in this paper is to segment a given multilingual document into segments for each language and then identify the language of each segment. The problem was mot...
Hiroshi Yamaguchi, Kumiko Tanaka-Ishii
SIGMOD
2009
ACM
190views Database» more  SIGMOD 2009»
14 years 8 months ago
Optimizing complex extraction programs over evolving text data
Most information extraction (IE) approaches have considered only static text corpora, over which we apply IE only once. Many real-world text corpora however are dynamic. They evol...
Fei Chen 0002, Byron J. Gao, AnHai Doan, Jun Yang ...
ICDE
2012
IEEE
205views Database» more  ICDE 2012»
11 years 10 months ago
Optimizing Statistical Information Extraction Programs over Evolving Text
—Statistical information extraction (IE) programs are increasingly used to build real-world IE systems such as Alibaba, CiteSeer, Kylin, and YAGO. Current statistical IE approach...
Fei Chen, Xixuan Feng, Christopher Re, Min Wang
CIARP
2006
Springer
13 years 11 months ago
Alignment of Paragraphs in Bilingual Texts Using Bilingual Dictionaries and Dynamic Programming
Parallel text alignment is a special type of pattern recognition task aimed to discover the similarity between two sequences of symbols. Given the same text in two different langua...
Alexander F. Gelbukh, Grigori Sidorov
NLE
2007
180views more  NLE 2007»
13 years 7 months ago
Segmentation and alignment of parallel text for statistical machine translation
We address the problem of extracting bilingual chunk pairs from parallel text to create training sets for statistical machine translation. We formulate the problem in terms of a s...
Yonggang Deng, Shankar Kumar, William Byrne