Sciweavers

332 search results - page 30 / 67
» Document Content Extraction Using Automatically Discovered F...
Sort
View
HIPC
2009
Springer
13 years 5 months ago
Extracting the textual and temporal structure of supercomputing logs
Supercomputers are prone to frequent faults that adversely affect their performance, reliability and functionality. System logs collected on these systems are a valuable resource o...
Sourabh Jain, Inderpreet Singh, Abhishek Chandra, ...
NAACL
2004
13 years 9 months ago
A Statistical Model for Multilingual Entity Detection and Tracking
Entity detection and tracking is a relatively new addition to the repertoire of natural language tasks. In this paper, we present a statistical language-independent framework for ...
Radu Florian, Hany Hassan, Abraham Ittycheriah, Ho...
WIDM
2003
ACM
14 years 24 days ago
Datarover: a taxonomy based crawler for automated data extraction from data-intensive websites
The advent of e-commerce has created a trend that brought thousands of catalogs online. Most of these websites are “taxonomy-directed”. A Web site is said to be ``taxonomydire...
Hasan Davulcu, S. Koduri, Saravanakumar Nagarajan
ICPR
2008
IEEE
14 years 2 months ago
Automatic video annotation with adaptive number of key words
Retrieving videos using key words requires obtaining the semantic features of the videos. Most work reported in the literature focuses on annotating a video shot with a fixed numb...
Fangshi Wang, Wei Lu, Jingen Liu, Mubarak Shah, De...
ICCV
2005
IEEE
14 years 1 months ago
Learning Non-Generative Grammatical Models for Document Analysis
— We present a general approach for the hierarchical segmentation and labeling of document layout structures. This approach models document layout as a grammar and performs a glo...
Michael Shilman, Percy Liang, Paul A. Viola