Sciweavers

502 search results - page 35 / 101
» Extracting Partial Structures from HTML Documents
Sort
View
ICDAR
2003
IEEE
14 years 1 months ago
Localization, Extraction and Recognition of Text in Telugu Document Images
In this paper we present a system to locate, extract and recognize Telugu text. The circular nature of Telugu script is exploited for segmenting text regions using the Hough Trans...
Atul Negi, K. Nikhil Shanker, Chandra Kanth Chered...
JCDL
2003
ACM
160views Education» more  JCDL 2003»
14 years 1 months ago
Automatic Document Metadata Extraction Using Support Vector Machines
Automatic metadata generation provides scalability and usability for digital libraries and their collections. Machine learning methods offer robust and adaptable automatic metadat...
Hui Han, C. Lee Giles, Eren Manavoglu, Hongyuan Zh...
ICMCS
2006
IEEE
139views Multimedia» more  ICMCS 2006»
14 years 2 months ago
A Measure for Evaluating Retrieval Techniques based on Partially Ordered Ground Truth Lists
For the RISM A/II collection of musical incipits (short extracts of scores, taken from the beginning), we have established a ground truth based on the opinions of human experts. I...
Rainer Typke, Remco C. Veltkamp, Frans Wiering
PRICAI
2004
Springer
14 years 1 months ago
Coherent Arrangement of Sentences Extracted from Multiple Newspaper Articles
Multi-document summarization is a challenge to information overload problem to provide a condensed text for a number of documents. Most multi-document summarization systems make u...
Naoaki Okazaki, Yutaka Matsuo, Mitsuru Ishizuka
ER
2007
Springer
99views Database» more  ER 2007»
14 years 2 months ago
VERT: A Semantic Approach for Content Search and Content Extraction in XML Query Processing
Processing a twig pattern query in XML document includes structural search and content search. Most existing algorithms only focus on structural search. They treat content nodes th...
Huayu Wu, Tok Wang Ling, Bo Chen