Sciweavers

1287 search results - page 130 / 258
» Integrating document and data retrieval based on XML
Sort
View
ICDAR
2011
IEEE
12 years 9 months ago
Functional-Based Table Category Identification in Digital Library
– Better understanding the document logical components is crucial to many applications, e.g., document classification or data integration. As the development of digital libraries...
Seongchan Kim, Ying Liu
DKE
2006
139views more  DKE 2006»
13 years 10 months ago
Information extraction from structured documents using k-testable tree automaton inference
Information extraction (IE) addresses the problem of extracting specific information from a collection of documents. Much of the previous work on IE from structured documents, suc...
Raymond Kosala, Hendrik Blockeel, Maurice Bruynoog...
ICDAR
2009
IEEE
13 years 8 months ago
PENTOOLS - A MATLAB Toolkit for On-line Pen-Based Data Experimentation
MATLAB provides a powerful environment for rapid prototyping of research methods and techniques. Across the wide range of on-line pen computing applications there exists a series ...
Richard M. Guest
KDD
2007
ACM
186views Data Mining» more  KDD 2007»
14 years 10 months ago
Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus
We present a document routing and index partitioning scheme for scalable similarity-based search of documents in a large corpus. We consider the case when similarity-based search ...
Deepavali Bhagwat, Kave Eshghi, Pankaj Mehra
ICDM
2009
IEEE
162views Data Mining» more  ICDM 2009»
13 years 8 months ago
Towards a Universal Text Classifier: Transfer Learning Using Encyclopedic Knowledge
Document classification is a key task for many text mining applications. However, traditional text classification requires labeled data to construct reliable and accurate classifie...
Pu Wang, Carlotta Domeniconi