Sciweavers

843 search results - page 62 / 169
» Segmentation of Compressed Documents
Sort
View
ACL
2007
13 years 9 months ago
Japanese Dependency Parsing Using Sequential Labeling for Semi-spoken Language
The amount of documents directly published by end users is increasing along with the growth of Web 2.0. Such documents often contain spoken-style expressions, which are difficult...
Kenji Imamura, Gen-ichiro Kikui, Norihito Yasuda
IPM
2007
95views more  IPM 2007»
13 years 7 months ago
Using structural contexts to compress semistructured text collections
We describe a compression model for semistructured documents, called Structural Contexts Model (SCM), which takes advantage of the context information usually implicit in the stru...
Joaquín Adiego, Gonzalo Navarro, Pablo de l...
ICDAR
2009
IEEE
13 years 5 months ago
A Tool for Ground-Truthing Text Lines and Characters in Off-Line Handwritten Chinese Documents
Annotating the regions, text lines and characters of document images is an important, but tedious and expensive task. A ground-truthing tool may largely alleviate the human burden...
Fei Yin, Qiu-Feng Wang, Cheng-Lin Liu
ICCPOL
2009
Springer
14 years 2 months ago
Text Editing for Lecture Speech Archiving on the Web
It is very significant in the knowledge society to accumulate spoken documents on the web. However, because of the high redundancy of spontaneous speech, the transcribed text in i...
Masashi Ito, Tomohiro Ohno, Shigeki Matsubara
DAS
2006
Springer
13 years 11 months ago
On Benchmarking of Invoice Analysis Systems
Abstract. An approach is presented to guide the benchmarking of invoice analysis systems, a specific, applied subclass of document analysis systems. The state of the art of benchma...
Bertin Klein, Stefan Agne, Andreas Dengel