Sciweavers

167 search results - page 25 / 34
» Text Alignment with Handwritten Documents
Sort
View
ACL
2006
13 years 9 months ago
A DOM Tree Alignment Model for Mining Parallel Data from the Web
This paper presents a new web mining scheme for parallel data acquisition. Based on the Document Object Model (DOM), a web page is represented as a DOM tree. Then a DOM tree align...
Lei Shi, Cheng Niu, Ming Zhou, Jianfeng Gao
ANLP
2000
107views more  ANLP 2000»
13 years 9 months ago
Cut and Paste Based Text Summarization
We present a cut and paste based text summarizer, which uses operations derived from an analhuman written abstracts. The summarizer edits extracted sentences, using reduction to r...
Hongyan Jing, Kathleen McKeown
ICDAR
2007
IEEE
14 years 2 months ago
Content-level Annotation of Large Collection of Printed Document Images
A large annotated corpus is critical to the development of robust optical character recognizers (OCRs). However, creation of annotated corpora is a tedious task. It is laborious, ...
Anand Kumar 0002, C. V. Jawahar
COLING
2002
13 years 7 months ago
A Robust Cross-Style Bilingual Sentences Alignment Model
Most current sentence alignment approaches adopt sentence length and cognate as the alignment features; and they are mostly trained and tested in the documents with the same style...
Tz-Liang Kueng, Keh-Yih Su
ICAIL
2009
ACM
14 years 15 days ago
Segmentation of legal documents
An overwhelming number of legal documents is available in digital form. However, most of the texts are usually only provided in a semi-structured form, i.e. the documents are stru...
Eneldo Loza Mencía