Sciweavers

37 search results - page 4 / 8
» Automated Detection and Segmentation of Table of Contents Pa...
Sort
View
ITCC
2005
IEEE
14 years 1 months ago
Elimination of Redundant Information for Web Data Mining
These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...
Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang
WECWIS
2003
IEEE
132views ECommerce» more  WECWIS 2003»
14 years 21 days ago
Page Digest for Large-Scale Web Services
The rapid growth of the World Wide Web and the Internet has fueled interest in Web services and the Semantic Web, which are quickly becoming important parts of modern electronic c...
Daniel Rocco, David Buttler, Ling Liu
ECCV
2008
Springer
14 years 9 months ago
Learning Visual Shape Lexicon for Document Image Content Recognition
Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content catego...
Guangyu Zhu, Xiaodong Yu, Yi Li, David S. Doermann
ICDAR
1999
IEEE
13 years 11 months ago
Preattentive Reading and Selective Attention for Document Image Analysis
PixED (from Pixel to Electronic Document) is aimed at converting document images into structured electronic documents which can be read by a machine for information retrieval. The...
Claudie Faure
PCM
2010
Springer
156views Multimedia» more  PCM 2010»
13 years 5 months ago
A New Text Detection Algorithm for Content-Oriented Line Drawing Image Retrieval
Abstract. Content retrieval of scanned line drawing images is a difficult problem, especially from real-life large scale databases. Existing algorithms don’t work well due to th...
Zhenyu Zhang, Tong Lu, Feng Su, Ruoyu Yang